7.2 DNA Structure

Even before the structure of DNA was elucidated, genetic studies indicated that the hereditary material must have three key properties:

  1. Because essentially every cell in the body of an organism has the same genetic makeup, faithful replication of the genetic material at every cell division is crucial. Thus, the structural features of DNA must allow faithful replication. These structural features will be considered later in this chapter.

  2. Because it must encode the constellation of proteins expressed by an organism, the genetic material must have informational content. How the information coded in DNA is deciphered to produce proteins will be the subject of Chapters 8 and 9.

  3. Because hereditary changes, called mutations, provide the raw material for evolutionary selection, the genetic material must be able to change on rare occasion. Nevertheless, the structure of DNA must be stable so that organisms can rely on its encoded information. We will consider the mechanisms of mutation in Chapter 16.

DNA structure before Watson and Crick

Consider the discovery of the double-helical structure of DNA by Watson and Crick as the solution to a complicated three-dimensional puzzle. To solve this puzzle, Watson and Crick used a process called “model building” in which they assembled the results of earlier and ongoing experiments (the puzzle pieces) to form the three-dimensional puzzle (the double-helix model). To understand how they did so, we first need to know what pieces of the puzzle were available to Watson and Crick in 1953.

The building blocks of DNA The first piece of the puzzle was knowledge of the basic building blocks of DNA. As a chemical, DNA is quite simple. It contains three types of chemical components: (1) phosphate, (2) a sugar called deoxyribose, and (3) four nitrogenous bases—adenine, guanine, cytosine, and thymine. The sugar in DNA is called “deoxyribose” because it has only a hydrogen atom (H) at the 2′-carbon atom, unlike ribose (a component of RNA), which has a hydroxyl (OH) group at that position. Two of the bases, adenine and guanine, have a double-ring structure characteristic of a type of chemical called a purine. The other two bases, cytosine and thymine, have a single-ring structure of a type called a pyrimidine.

The carbon atoms in the bases are assigned numbers for ease of reference. The carbon atoms in the sugar group also are assigned numbers—in this case, the number is followed by a prime (1′, 2′, and so forth).

The chemical components of DNA are arranged into groups called nucleotides, each composed of a phosphate group, a deoxyribose sugar molecule, and any one of the four bases (Figure 7-5). It is convenient to refer to each nucleotide by the first letter of the name of its base: A, G, C, or T. The nucleotide with the adenine base is called deoxyadenosine 5′-monophosphate, where the 5′ refers to the position of the carbon atom in the sugar ring to which the single (mono) phosphate group is attached.

Figure 7-5: Structure of the four DNA nucleotides
Figure 7-5: These nucleotides, two with purine bases and two with pyrimidine bases, are the fundamental building blocks of DNA. The sugar is called deoxyribose because it is a variation of a common sugar, ribose, that has one more oxygen atom (position indicated by the red arrow).

265

Chargaff’s rules of base composition The second piece of the puzzle used by Watson and Crick came from work done several years earlier by Erwin Chargaff. Studying a large selection of DNAs from different organisms (Table 7-1), Chargaff established certain empirical rules about the amounts of each type of nucleotide found in DNA:

  1. The total amount of pyrimidine nucleotides (T + C) always equals the total amount of purine nucleotides (A + G).

  2. The amount of T always equals the amount of A, and the amount of C always equals the amount of G. But the amount of A + T is not necessarily equal to the amount of G + C, as can be seen in the right-hand column of Table 7-1. This ratio varies among different organisms but is virtually the same in different tissues of the same organism.

Organism

Tissue

Adenine

Thymine

Guanine

Cytosine

Escherichia coli (K12)

26.0

23.9

24.9

25.2

1.00

Diplococcus pneumoniae

29.8

31.6

20.5

18.0

1.59

Mycobacterium tuberculosis

15.1

14.6

34.9

35.4

0.42

Yeast

31.3

32.9

18.7

17.1

1.79

Paracentrotus lividus

(sea urchin)

Sperm

32.8

32.1

17.7

18.4

1.85

Herring

Sperm

278

27.5

22.2

22.6

1.23

Rat

Bone marrow

28.6

28.4

21.4

21.5

1.33

Human

Thymus

30.9

29.4

19.9

19.8

1.52

Human

Liver

30.3

30.3

19.5

19.9

1.53

Human

Sperm

30.7

31.2

19.3

18.8

1.62

* Defined as moles of nitrogenous constituents per 100 g-atoms phosphate in hydrolysate.

Source: Data from E. Chargaff and J. Davidson, eds., The Nucleic Acids. Academic Press, 1955.

Table 7-1: Molar properties of Bases* in DNAs from Various sources

266

X-ray diffraction analysis of DNA The third and most controversial piece of the puzzle came from X-ray diffraction data on DNA structure that were collected by Rosalind Franklin when she was in the laboratory of Maurice Wilkins (Figure 7-6). In such experiments, X rays are fired at DNA fibers, and the scatter of the rays from the fibers is observed by catching the rays on photographic film, on which the X rays produce spots. The angle of scatter represented by each spot on the film gives information about the position of an atom or certain groups of atoms in the DNA molecule. This procedure is not simple to carry out (or to explain), and the interpretation of the spot patterns requires complex mathematical treatment that is beyond the scope of this text. The available data suggested that DNA is long and skinny and that it has two similar parts that are parallel to each other and run along the length of the molecule. The X-ray data showed the molecule to be helical (spiral-like). Unknown to Rosalind Franklin, her best X-ray picture was shown to Watson and Crick by Maurice Wilkins, and it was this crucial piece of the puzzle that allowed them to deduce the three-dimensional structure that could account for the X-ray spot patterns.

Figure 7-6: Rosalind Franklin’s critical experimental result
Figure 7-6: Rosalind Franklin (left) and her X-ray diffraction pattern of DNA (right).
[Left) Science Source; (right) Rosalind Franklin/Science Source.]

267

The double helix

A 1953 paper by Watson and Crick in the journal Nature began with two sentences that ushered in a new age of biology: “We wish to suggest a structure for the salt of deoxyribose nucleic acid (D.N.A.). This structure has novel features which are of considerable biological interest.”1 The structure of DNA had been a subject of great debate since the experiments of Avery and co-workers in 1944. As we have seen, the general composition of DNA was known, but how the parts fit together was not known. The structure had to fulfill the main requirements for a hereditary molecule: the ability to store information, the ability to be replicated, and the ability to mutate.

Figure 7-7: Watson and Crick’s DNA model
Figure 7-7: James Watson and Francis Crick with their DNA model.
[A. Barrington Brown/Science Source.]

The three-dimensional structure derived by Watson and Crick is composed of two side-by-side chains (“strands”) of nucleotides twisted into the shape of a double helix (Figure 7-7). The two nucleotide strands are held together by hydrogen bonds between the bases of each strand, forming a structure like a spiral staircase (Figure 7-8a). The backbone of each strand is formed of alternating phosphate and deoxyribose sugar units that are connected by phosphodiester linkages (Figure 7-8b). We can use these linkages to describe how a nucleotide chain is organized. As already mentioned, the carbon atoms of the sugar groups are numbered 1′ through 5′. A phosphodiester linkage connects the 5′-carbon atom of one deoxyribose to the 3′-carbon atom of the adjacent deoxyribose. Thus, each sugar–phosphate backbone is said to have a 5′-to-3′ polarity, or direction, and understanding this polarity is essential in understanding how DNA fulfills its roles. In the double-stranded DNA molecule, the two backbones are in opposite, or antiparallel, orientation (see Figure 7-8b).

Each base is attached to the 1′-carbon atom of a deoxyribose sugar in the backbone of each strand and faces inward toward a base on the other strand. Hydrogen bonds between pairs of bases hold the two strands of the DNA molecule together. The hydrogen bonds are indicated by dashed lines in Figure 7-8b.

Figure 7-8: The structure of DNA
Figure 7-8: (a) A simplified model showing the helical structure of DNA. The sticks represent base pairs, and the ribbons represent the sugar–phosphate backbones of the two antiparallel chains. (b) An accurate chemical diagram of the DNA double helix, unrolled to show the sugar–phosphate backbones (blue) and base-pair rungs (purple, orange). The backbones run in opposite directions; the 5′ and 3′ ends are named for the orientation of the 5′ and 3′ carbon atoms of the sugar rings. Each base pair has one purine base, adenine (A) or guanine (G), and one pyrimidine base, thymine (T) or cytosine (C), connected by hydrogen bonds (red dashed lines).

Two complementary nucleotide strands paired in an antiparallel manner automatically assume a double-helical conformation (Figure 7-9), mainly through the interaction of the base pairs. The base pairs, which are flat planar structures, stack on top of one another at the center of the double helix (see Figure 7-9a). Stacking adds to the stability of the DNA molecule by excluding water molecules from the spaces between the base pairs. The most stable form that results from base stacking is a double helix with two distinct sizes of grooves running in a spiral: the major groove and the minor groove, which can be seen in both the ribbon and the space-filling models of Figure 7-9a and 7-9b. Most DNA–protein associations are in major grooves. A single strand of nucleotides has no helical structure; the helical shape of DNA depends entirely on the pairing and stacking of the bases in the antiparallel strands. DNA is a right-handed helix; in other words, it has the same structure as that of a screw that would be screwed into place by using a clockwise turning motion.

Figure 7-9: Two representations of the DNA double helix
Figure 7-9: The ribbon diagram (a) highlights the stacking of the base pairs, whereas the space-filling model (b) shows the major and minor grooves.

The double helix accounted nicely for the X-ray data and successfully accounted for Chargaff’s data. By studying models that they made of the structure, Watson and Crick realized that the observed radius of the double helix (known from the X-ray data) would be explained if a purine base always pairs (by hydrogen bonding) with a pyrimidine base (Figure 7-10). Such pairing would account for the (A + G) = (T + C) regularity observed by Chargaff, but it would predict four possible pairings: T A, T···G, C A, and C···G. Chargaff’s data, however, indicate that T pairs only with A, and C pairs only with G. Watson and Crick concluded that each base pair consists of one purine base and one pyrimidine base, paired according to the following rule: G pairs with C, and A pairs with T.

Figure 7-10: Base pairing in DNA
Figure 7-10: The pairing of purines with pyrimidines accounts exactly for the diameter of the DNA double helix determined from X-ray data. That diameter is indicated by the vertical dashed lines.

268

Note that the G–C pair has three hydrogen bonds, whereas the A–T pair has only two (see Figure 7-8b). We would predict that DNA containing many G–C pairs would be more stable than DNA containing many A–T pairs. In fact, this prediction is confirmed. Heat causes the two strands of DNA double helix to separate (a process called DNA melting or DNA denaturation); DNAs with higher G + C content can be shown to require higher temperatures to melt because of the greater attraction of the G–C pairing.

269

KEY CONCEPT

DNA is a double helix composed of two nucleotide chains held together by complementary pairing of A with T and G with C.

Watson and Crick’s discovery of the structure of DNA is considered by some to be the most important biological discovery of the twentieth century and led to their being awarded the Nobel Prize with Maurice Wilkins in 1962 (Rosalind Franklin died of cancer in 1958 and the prize is not awarded posthumously). The reason that this discovery is considered so important is that the double helix model, in addition to being consistent with earlier data about DNA structure, fulfilled the three requirements for a hereditary substance:

270

  1. The double-helical structure suggested how the genetic material might determine the structure of proteins. Perhaps the sequence of nucleotide pairs in DNA dictates the sequence of amino acids in the protein specified by that gene. In other words, some sort of genetic code may write information in DNA as a sequence of nucleotides and then translate it into a different language of amino acid sequences in protein. Just how it is done is the subject of Chapter 9.

  2. If the base sequence of DNA specifies the amino acid sequence, then mutation is possible by the substitution of one type of base for another at one or more positions. Mutations will be discussed in Chapter 16.

  3. As Watson and Crick stated in the concluding words of their 1953 Nature paper that reported the double-helical structure of DNA: “It has not escaped our notice that the specific pairing we have postulated immediately suggests a possible copying mechanism for the genetic material.”2 To geneticists at the time, the meaning of this statement was clear, as we see in the next section.