Nucleic acids are polymers that store, transmit, and express hereditary (genetic) information. This information is encoded in the sequences of monomers that make up nucleic acids. There are two types of nucleic acids: DNA (deoxyribonucleic acid) and RNA (ribonucleic acid). DNA stores and transmits genetic information. Through RNA intermediates, the information encoded in DNA is used to specify the amino acid sequences of proteins. As you will see later in this chapter, proteins are essential for both metabolism and structure. Certain specialized RNA molecules also play roles in metabolism. Ultimately, nucleic acids and the proteins encoded by them determine the metabolic functions of an organism.
Nucleic acids are polymers composed of monomers called nucleotides. A nucleotide consists of three components: a nitrogen-containing base, a pentose sugar, and one to three phosphate groups (FIGURE 3.1). Molecules consisting of a pentose sugar and a base—but no phosphate group—are called nucleosides. The nucleotides that make up nucleic acids contain just one phosphate group—they are nucleoside monophosphates.
Go to ACTIVITY 3.1 Nucleic Acid Building Blocks
PoL2e.com/ac3.1
The bases of the nucleic acids take one of two chemical forms: a six-membered single-ring structure called a pyrimidine, or a fused double-ring structure called a purine (see Figure 3.1). In DNA, the pentose sugar is deoxyribose, which differs from the ribose found in RNA by the absence of one oxygen atom (see Figure 2.9).
During the formation of a nucleic acid, new nucleotides are added to an existing chain one at a time. The pentose sugar in the last nucleotide of the existing chain and the phosphate on the new nucleotide undergo a condensation reaction (see Figure 2.8) and the resulting linkage is called a phosphodiester bond. The phosphate on the new nucleotide is attached to the 5′ (5 prime) carbon atom of its sugar, and the bond occurs between it and the 3’ (3 prime) carbon on the last sugar of the existing chain. Because each nucleotide is added to the 3′ carbon of the last sugar, nucleic acids are said to grow in the 5′ to 3 direction (FIGURE 3.2).
Nucleic acids can be oligonucleotides, with a few to about 20 nucleotide monomers, or longer polynucleotides:
In addition to differing in their sugar groups, DNA and RNA also differ in their bases and general structures (TABLE 3.1). Four bases are found in DNA: adenine (A), cytosine (C), guanine (G), and thymine (T). RNA also contains adenine, cytosine, and guanine, but the fourth base in RNA is uracil (U) rather than thymine. The lack of a hydroxyl group at the 2′ position of the deoxyribose sugar in DNA makes the structure of DNA less flexible than that of RNA. As we describe below, DNA is composed of two polynucleotide strands whereas RNA is usually single-stranded. However, a long RNA can fold up on itself, forming a variety of structures.
39
The key to understanding the structure and function of both DNA and RNA is the principle of complementary base pairing. In DNA, adenine and thymine always pair (A-T), and cytosine and guanine always pair (C-G):
In RNA, the base pairs are A-U and C-G. Base pairs are held together primarily by hydrogen bonds. As you can see, there are polar C O and N—H covalent bonds in the nucleotide bases (see Concept 2.2 for a discussion of polar covalent bonds). Hydrogen bonds form between the partial negative charge (δ−) on an oxygen or nitrogen atom of one base, and the partial positive charge (δ+) on a hydrogen atom of another base. Complementary base pairing occurs because the arrangements of polar bonds in the nucleotide bases favor the pairing of bases as they occur (C with G, and A with U or T).
Individual hydrogen bonds are relatively weak, but there are so many of them in DNA and RNA that collectively they provide a considerable force of attraction. However, this attraction is not as strong as that provided by multiple covalent bonds. This means that base pairs are relatively easy to separate with a modest input of energy. As you will see in Chapters 9 and 10, the breaking and making of hydrogen bonds in nucleic acids is vital to their roles in living systems. Let’s now look in a little more detail at the structures of RNA and DNA.
RNA Usually, RNA is single-stranded (FIGURE 3.3A). However, many single-stranded RNA molecules fold up into three-dimensional structures, because of hydrogen bonding between nucleotides in separate portions of the molecules (FIGURE 3.3B). An RNA strand can also fold back on itself to form a double-stranded helix. This results in a three-dimensional surface for the bonding and recognition of other molecules. It is important to realize that this folding occurs by complementary base pairing, and the structure is thus determined by the particular order of bases in the RNA molecule.
DNA Usually, DNA is double-stranded; that is, it consists of two separate polynucleotide strands of the same length (FIGURE 3.4A). The two polynucleotide strands are antiparallel: they run in opposite directions so that their 5′ ends are at opposite ends of the double-stranded molecule. In contrast to RNA’s diversity in three-dimensional structure, DNA is remarkably uniform. The A-T and G-C base pairs are about the same size (each is a purine paired with a pyrimidine), and the two polynucleotide strands form a “ladder” that twists into a double helix (FIGURE 3.4B). The sugar–phosphate groups form the sides of the ladder, and the bases with their hydrogen bonds form the rungs on the inside. The double helix is almost always right-handed:
40
Go to ACTIVITY 3.2 DNA Structure
PoL2e.com/ac3.2
DNA is a purely informational molecule. The information is encoded in the sequence of bases carried in its strands. For example, the information encoded in the sequence TCAGCA is different from the information in the sequence CCAGCA. DNA has two functions in terms of information:
The details of these important processes are described in Chapters 9 and 10, but it is important to realize several things at this point:
41
5′-TCAGCA-3′
3′-AGTCGT-5′
Because DNA carries hereditary information from one generation to the next, a theoretical series of DNA molecules stretches back through the lineage of every organism to the beginning of biological evolution on Earth, about 3.8 billion years ago. The genomes of organisms gradually accumulate changes in their DNA base sequences over evolutionary time. Therefore closely related living species should have more similar base sequences than species that are more distantly related.
Over the past two decades there have been remarkable developments in technologies for determining the order of nucleotides in DNA molecules (DNA sequencing), and in computer technologies to analyze these sequences. These advances have enabled scientists to determine the entire DNA base sequences of whole organisms, including the human genome, which contains about 3 billion base pairs. These studies have confirmed many of the evolutionary relationships that were inferred from more traditional comparisons of body structure, biochemistry, and physiology. Traditional comparisons had indicated that the closest living relative of humans (Homo sapiens) is the chimpanzee (genus Pan). In fact, the chimpanzee genome shares nearly 99 percent of its DNA base sequence with the human genome. Increasingly, scientists turn to DNA analyses to figure out evolutionary relationships when other comparisons are not possible or are not conclusive. For example, DNA studies revealed a close relationship between starlings and mockingbirds that was not expected on the basis of their anatomy or behavior.
42
For more on the use of DNA sequences to reconstruct the evolutionary history of life, see Concept 16.2
Nucleic acids are largely informational molecules that encode proteins. We will now turn to a discussion of proteins—the most structurally and functionally diverse class of macromolecules.
Go to ANIMATED TUTORIAL 3.1 Macromolecules: Nucleic Acids and Proteins
PoL2e.com/at3.1