Just as the principles of physics guide engineers who design bridges and jet airliners, so the principles of population genetics touch all of our lives in many, if unseen, ways. In Chapter 19, you’ll see how population genetics figures prominently in the search for genes that contribute to disease risk in people, using concepts such as linkage disequilibrium, described in this chapter. In this final section of the chapter, we will examine four other areas in which the principles of population genetics are being to applied to issues affecting modern societies.
Conservation biologists attempting to save endangered wild species, and zookeepers attempting to maintain small populations of captive animals, often perform population genetic analyses. Above, we discussed how a genetic bottleneck caused a loss of genetic variation in the California condor and an increase in the frequency of a lethal form of dwarfism. Bottlenecks may also increase the level of inbreeding in a population, perhaps leading to a decline in fitness through inbreeding depression. The issue is complex, however, because inbreeding is not always associated with a decline in fitness. Inbreeding can sometimes help purge deleterious recessive alleles from a population. Purifying selection is more effective at eliminating deleterious recessive alleles since the homozygous recessive class becomes more frequent in inbred populations. Thus, conservation biologists have debated whether they should attempt to maximize genetic diversity and minimize inbreeding or deliberately subject zoo populations to inbreeding with the goal of purging deleterious alleles.
To help address this question, researchers looked for evidence of successful purging among zoo populations. Let’s define inbreeding depression as delta (δ)
where Wf is the fitness of inbred individuals and w0 the fitness of non-
In Chapter 2, we saw how alleles for genetic disorders could be traced in pedigrees and we discussed how to calculate the risk that a couple will have a child who inherits such a disorder. Population genetic principles allow us to extend this type of analysis. We will consider two examples.
The disease allele for cystic fibrosis (CF) occurs at a frequency of about 0.025 in Caucasians. In the pedigree for a Caucasian family below, individual II-
One of II-
, or 1/8. We now extend the calculation to determine the probability that III-
chance she will transmit the disease allele to III-
The frequency of cystic fibrosis among Caucasians is p2 = (0.025)2 = 0.000625. These calculations tell us that individuals who have a first cousin with cystic fibrosis have a 0.003 ÷ 0.000625 = 4.9-
Here is another application of population genetics to assessing disease risk. Sickle-
Using this equation, we obtain
This represents a 2.2-
Criminals can leave DNA evidence at the scene of a crime in the form of blood, semen, hairs, or even buccal cells from saliva on a cigarette butt. The polymerase chain reaction (PCR) enables forensic scientists to amplify very tiny amounts of DNA and determine the genotype of the individual who left the specimen. If the DNA found at the crime scene matches that of the suspect, then they “may be” the same individual. The key phrase here is “may be,” and this is where population genetics comes into play. Let’s see how this works.
Consider two microsatellite loci, each with multiple alleles: A1, A2, … An and B1, B2, … Bn. Forensic scientists determine that a DNA specimen from a crime scene and the suspect are both A3/A8 B1/B7. They have determined that there is a “match” between the evidence and the suspect. Does the match prove that the DNA evidence came from the suspect? Does it prove that that the suspect was at the crime scene?
What population geneticists do with this type of evidence is to test a specific hypothesis: The evidence came from someone other than the suspect. This is what statisticians call the “null hypothesis,” or the hypothesis that is considered true unless the evidence shows that it is very unlikely (see Chapter 4). To perform the test, we calculate the probability of observing a match between the evidence and the suspect, given that the suspect and the person who left the evidence are different individuals. Symbolically, we write
where “|” means “given.” If this probability is very small, then we can reject the null hypothesis and argue in favor of an alternative hypothesis: The evidence was left by the suspect. We never formally prove the suspect left the evidence since there could be alternative hypotheses such as The evidence was left by the suspect’s identical twin.
To calculate the probability of observing a match between the evidence and the suspect if the evidence is from a different individual, we need to know the frequencies of the microsatellite alleles in the population.
A4 |
0.03 |
A6 |
0.05 |
B1 |
0.01 |
B7 |
0.12 |
Prob(match | different individuals) is the same as the probability that the evidence came from a randomly chosen individual. We can calculate this probability using the allele frequencies above. First, we will assume that the Hardy–
To combine these two probabilities, we need to make one more assumption. We need to assume that the two loci are independent; that is, that the loci are at linkage equilibrium. By making this assumption, we can apply the product rule for independent events (see Chapter 2) and determine that
Thus, the probability under the null hypothesis that the evidence came from someone other than the suspect is 7.2 × 10−6, or about 7 in a million. That’s a small probability, and so the null hypothesis seems unlikely in this case. However, if Prob(match | different individuals) were 0.1, then 10 percent of the population would be a match and could have left the evidence. In that case, we would not want to reject the null hypothesis.
Two microsatellites do not provide very much power to discriminate, so the FBI in the United States uses a set of 13 microsatellites. Microsatellite loci typically have large numbers of alleles (10 to 20 or more); therefore, the number of possible genotypes based on 13 microsatellites is astronomically large. With 10 alleles per locus, there are 55 possible genotypes at each locus and 5513, or 4.2 × 1022, possible multilocus genotypes for 13 loci. The FBI has also assembled a database called CODIS (Combined DNA Index System) that contains the frequencies of different alleles at these loci in the population, including data specific to different ethnic groups and regions of the country.
In this chapter, we have reviewed the basic principles of population genetics and discussed many applications to human genetics. Basic population genetic theory has been around for nearly 100 years, but only in the last decade has the development of high-
What does the future hold? Soon, the sequencing of a human genome may cost little more than a new bicycle. A college student might swab the inside of her mouth with a Q-