5.18: DNA is an individual identifier: the uses and abuses of DNA fingerprinting.

In another time, Colin Pitchfork, a murderer and rapist, would have walked free. But in 1987, he was captured and convicted, betrayed by his DNA, and is now serving two life sentences in prison. Pitchfork’s downfall began when he raped and murdered two 15-year-old high school girls in a small village in England in the 1980s. The police thought they had their perpetrator when a man confessed, but only to the second murder. He denied any involvement with the first murder, which perplexed the police because the details of the two crimes strongly suggested that the same person committed both.

At the time, British biologist Alec Jeffreys made the important discovery that there are small pieces of DNA in human chromosomes that are tremendously variable in their base sequences. In much the same way that each person has a driver’s license number or social security number that differs from everyone else’s, these DNA fragments are so variable that it is extremely unlikely that two people would have identical sequences at these locations. Thus, a comparison between these regions in a person’s DNA sample and in DNA-containing evidence left at a crime scene would enable police to determine whether the evidence sample came from that person.

Jeffreys analyzed DNA left by the murderer-rapist on the two victims and found that it did indeed come from a single person, and that person was not the man who had confessed to one of the crimes. The original suspect was released and has the distinction of being the first person cleared of a crime through DNA fingerprinting. To track down the criminal, police then requested blood samples from all men in the area of the crimes who were between 18 and 35 years old—a practice that many viewed as an invasion of privacy. The police collected and analyzed more than 5,000 blood samples. This procedure led them to Colin Pitchfork, whose DNA matched perfectly the DNA left on both victims, and ultimately was the evidence responsible for his conviction (FIGURE 5-44). (He almost slipped through, having persuaded a friend to give a blood sample in his name. But when the friend was overheard telling the story in a pub, police tracked down Pitchfork to get a blood sample.)

Figure 5.44: Betrayed by his DNA.

221

DNA fingerprinting is now used extensively in forensic investigations, in much the same way that regular fingerprints have been used for the past 100 years. But regular fingerprinting is limited in its usefulness for many crimes because, often, no fingerprints are left behind. DNA samples are frequently left behind, though, usually in the form of semen, blood, hair, skin, or other tissue. As a consequence, this technology has been directly responsible for bringing thousands of criminals to justice and, as we saw at the beginning of the chapter, for establishing the innocence of more than 300 people wrongly convicted of murder and other capital crimes. Let’s examine how DNA fingerprinting is done, why it is such a powerful forensic tool, and why it is not foolproof.

The DNA from different humans is almost completely identical. More than 99.9% of the DNA sequences of two individuals are the same, because we’re all of the same species and thus share a common evolutionary history. Even so, in comparing two individuals’ genomes of three billion base pairs each, a one-tenth of a percent difference still translates to about three million base-pair differences. These differences are responsible for the fact that all individuals have their own unique genome. (The lone exception? Identical twins, whose DNA is exactly the same.) Thus, when we are trying to evaluate whether the DNA from a crime scene matches that from a suspect, the analysis focuses on the parts of our DNA that differ. There are thousands of these highly variable regions in the human genome.

Among these thousands of variable regions, one particular type is used for the determination of a person’s genetic fingerprint. These regions are called STRs (for short tandem repeats) and are characterized by a short sequence (commonly four or five nucleotides) that repeats over and over within the region. The number of repeats is what varies among individuals.

An individual—we’ll call her Individual A—has two copies of each chromosome, one from her mother and one from her father. At one STR region (on chromosome 2, for example), the number of times the sequence repeats is likely to differ on the maternal and the paternal copies of that chromosome in Individual A. The sequence may repeat 3 times on the maternal copy of her chromosome 2, and it may repeat 14 times on the paternal copy. In such a case, Individual A is said to have two different alleles for this STR region: 3 and 14. In contrast, in Individual B, for the same STR region on chromosome 2, the sequence may repeat 5 and 11 times (FIGURE 5-45).

Figure 5.45: Biotechnology in forensics. Forensic scientists can use highly variable regions of DNA to genetically link a person to DNA-containing evidence left at a crime scene.

For an STR region within the human genome, there typically are about 10 different alleles that occur within a population, and each is shared by about 10% of all individuals in a population. So the likelihood that two individuals carry the same two alleles is about 1 in 100. This is unlikely, but given enough people, many are likely to carry the same alleles. If two different STR regions are analyzed, the likelihood that two individuals have the same four alleles is 1/10 × 1/10 × 1/10 × 1/10, or 1/10,000. This is much rarer, but still would lead to multiple individuals within the same large city carrying the same four alleles. The real power of DNA fingerprinting comes from simultaneously determining the alleles an individual carries (that is, their genotype) not simply at one or two STR locations but at 13 different STR locations. This is the number used by the FBI in constructing DNA fingerprints in the United States, and it makes the probability that two individuals would have exactly the same genotype extremely low (see Figure 5-45).

222

For each STR locus analyzed, an individual’s genotype is determined by using PCR to amplify that region, then measuring the length of the STR region using a technique called electrophoresis, which allows researchers to see how long a stretch of DNA is. The length of the region can then be used to determine the number of times the STR is repeated. For a single STR region, an individual’s genotype is expressed by two numbers, reflecting the number of STR repeats in the copies inherited from the mother and from the father. And a person’s full DNA fingerprint is a string of 26 numbers that includes the two numbers for each of 13 STRs.

Q

Question 5.14

What is a DNA fingerprint?

223

In court, a suspect’s genotype might be compared with the DNA fingerprint obtained from evidence found at the crime scene. DNA samples from different people will produce different 26-number “fingerprints,” whereas different samples of DNA from one person will have exactly the same genotype (FIGURE 5-46).

Figure 5.46: Your unique identifier: a DNA fingerprint. DNA fingerprints are being used to match a suspect’s DNA to DNA found at the scene of a crime.

In the end, despite universally accepted methods, DNA fingerprinting still is not foolproof. Numerous incidences of human error—accidental as well as intentional—have been documented, ranging from mislabeled test tubes to tissue from a suspect being added to evidence from a crime scene. So we should not blindly draw conclusions solely from this one type of evidence. Nonetheless, DNA fingerprinting is an increasingly valuable tool for law enforcement. The FBI, for example, has reported that nearly one-third of their suspects are cleared immediately by DNA testing, and that many more criminals, because of DNA fingerprinting, now plead guilty to the crimes they have committed.

TAKE-HOME MESSAGE 5.18

Comparisons of highly variable DNA regions can be used to identify tissue specimens and determine the individual from whom they came.

What is an STR, and how is the analysis of STRs useful?

224