Proteins are linear polymers formed by linking the α-carboxyl group of one amino acid to the α-amino group of another amino acid. This type of linkage is called a peptide bond or an amide bond. The formation of a dipeptide from two amino acids is accompanied by the loss of a water molecule (Figure 2.13). The equilibrium of this reaction lies on the side of hydrolysis rather than synthesis under most conditions. Hence, the biosynthesis of peptide bonds requires an input of free energy. Nonetheless, peptide bonds are quite stable kinetically because the rate of hydrolysis is extremely slow; the lifetime of a peptide bond in aqueous solution in the absence of a catalyst approaches 1000 years.
A series of amino acids joined by peptide bonds form a polypeptide chain, and each amino acid unit in a polypeptide is called a residue. A polypeptide chain has directionality because its ends are different: an α-amino group is present at one end and an α-carboxyl group at the other. The amino end is taken to be the beginning of a polypeptide chain; by convention, the sequence of amino acids in a polypeptide chain is written starting with the amino-
36
A polypeptide chain consists of a regularly repeating part, called the main chain or backbone, and a variable part, comprising the distinctive side chains (Figure 2.15). The polypeptide backbone is rich in hydrogen-
A unit of mass very nearly equal to that of a hydrogen atom. Named after John Dalton (1766–
Kilodalton (kDa)
A unit of mass equal to 1000 daltons
Most natural polypeptide chains contain between 50 and 2000 amino acid residues and are commonly referred to as proteins. The largest single polypeptide known is the muscle protein titin, which consists of more than 27,000 amino acids. Polypeptide chains made of small numbers of amino acids are called oligopeptides or simply peptides. The mean molecular weight of an amino acid residue is about 110 g mol–1, and so the molecular weights of most proteins are between 5500 and 220,000 g mol–1. We can also refer to the mass of a protein, which is expressed in units of daltons; one dalton is equal to one atomic mass unit. A protein with a molecular weight of 50,000 g mol–1 has a mass of 50,000 daltons, or 50 kDa (kilodaltons).
In some proteins, the linear polypeptide chain is cross-
37
In 1953, Frederick Sanger determined the amino acid sequence of insulin, a protein hormone (Figure 2.17). This work is a landmark in biochemistry because it showed for the first time that a protein has a precisely defined amino acid sequence consisting only of l amino acids linked by peptide bonds. This accomplishment stimulated other scientists to carry out sequence studies of a wide variety of proteins. Currently, the complete amino acid sequences of more than 2,000,000 proteins are known. The striking fact is that each protein has a unique, precisely defined amino acid sequence. The amino acid sequence of a protein is referred to as its primary structure.
A series of incisive studies in the late 1950s and early 1960s revealed that the amino acid sequences of proteins are determined by the nucleotide sequences of genes. The sequence of nucleotides in DNA specifies a complementary sequence of nucleotides in RNA, which in turn specifies the amino acid sequence of a protein. In particular, each of the 20 amino acids of the repertoire is encoded by one or more specific sequences of three nucleotides (Section 4.6).
Knowing amino acid sequences is important for several reasons. First, knowledge of the sequence of a protein is usually essential to elucidating its function (e.g., the catalytic mechanism of an enzyme). In fact, proteins with novel properties can be generated by varying the sequence of known proteins. Second, amino acid sequences determine the three-
38
Examination of the geometry of the protein backbone reveals several important features. First, the peptide bond is essentially planar (Figure 2.18). Thus, for a pair of amino acids linked by a peptide bond, six atoms lie in the same plane: the α-carbon atom and CO group of the first amino acid and the NH group and α-carbon atom of the second amino acid. The nature of the chemical bonding within a peptide accounts for the bond’s planarity. The bond resonates between a single bond and a double bond. Because of this partial double-
The partial double-
Two configurations are possible for a planar peptide bond. In the trans configuration, the two α-carbon atoms are on opposite sides of the peptide bond. In the cis configuration, these groups are on the same side of the peptide bond. Almost all peptide bonds in proteins are trans. This preference for trans over cis can be explained by the fact that steric clashes between groups attached to the α-carbon atoms hinder the formation of the cis configuration but do not arise in the trans configuration (Figure 2.20). By far the most common cis peptide bonds are X Pro linkages. Such bonds show less preference for the trans configuration because the nitrogen of proline is bonded to two tetrahedral carbon atoms, limiting the steric differences between the trans and cis forms (Figure 2.21).
In contrast with the peptide bond, the bonds between the amino group and the α-carbon atom and between the α-carbon atom and the carbonyl group are pure single bonds. The two adjacent rigid peptide units can rotate about these bonds, taking on various orientations. This freedom of rotation about two bonds of each amino acid allows proteins to fold in many different ways. The rotations about these bonds can be specified by torsion angles (Figure 2.22). The angle of rotation about the bond between the nitrogen and the α-carbon atoms is called phi (ϕ). The angle of rotation about the bond between the α-carbon and the carbonyl carbon atoms is called psi (ψ). A clockwise rotation about either bond as viewed from the nitrogen atom toward the α-carbon atom or from the α-carbon atom toward the carbonyl group corresponds to a positive value. The ϕ and ψ angles determine the path of the polypeptide chain.
A measure of the rotation about a bond, usually taken to lie between −180 and +180 degrees. Torsion angles are sometimes called dihedral angles.
39
Are all combinations of ϕ and ψ possible? Gopalasamudram Ramachandran recognized that many combinations are forbidden because of steric collisions between atoms. The allowed values can be visualized on a two-
The ability of biological polymers such as proteins to fold into well-
40