10.3 Carbohydrates Are Attached to Proteins to Form Glycoproteins

✓ 2 Differentiate among the types of glycoproteins in regard to structure and function.

A carbohydrate group can be covalently attached to a protein to form a glycoprotein. Such modifications are not rare, as 50% of the proteome consists of glycoproteins. We will examine three classes of glycoproteins. In the first class, referred to simply as glycoproteins, the protein constituent is the largest component by weight. This versatile class plays a variety of biochemical roles. Many glycoproteins are components of cell membranes, where they take part in processes such as cell adhesion and the binding of sperm to eggs. Other glycoproteins are formed by linking carbohydrates to soluble proteins. Many of the proteins secreted from cells are glycosylated, or modified by the attachment of carbohydrates, including most proteins present in the serum component of blood.

The second class of glycoproteins comprises the proteoglycans. The protein component of proteoglycans is conjugated to a particular type of polysaccharide called a glycosaminoglycan. Carbohydrates make up a much larger percentage by weight of the proteoglycan compared with simple glycoproteins. Proteoglycans function as structural components and lubricants.

The third class of glycoproteins are the mucins, or mucoproteins, which, like proteoglycans, are predominately carbohydrate. Mucins, a key component of mucus, serve as lubricants. N-Acetylgalactosamine is usually the carbohydrate moiety bound to the protein in mucins. N-Acetylgalactosamine is an example of an amino sugar, so named because an amino group replaces a hydroxyl group.

Carbohydrates May Be Linked to Asparagine, Serine, or Threonine Residues of Proteins

Figure 10.15: Glycosidic bonds between proteins and carbohydrates. A glycosidic bond links a carbohydrate to the side chain of asparagine (N-linked) or to the side chain of serine or threonine (O-linked). The glycosidic bonds are shown in red. abbreviations: GlcNAc, N-acetylglucosamine; GalNac, N-acetylgalactosamine.

In all classes of glycoproteins, sugars are attached either to the amide nitrogen atom in the side chain of asparagine (termed an N-linkage) or to the hydroxyl oxygen atom in the side chain of serine or threonine (termed an O-linkage), as shown in Figure 10.15, a process called glycosylation. All N-linked oligosaccharides have in common a pentasaccharide core consisting of three mannoses, a six-carbon sugar, and two N-acetylglucosamines, a glucosamine in which the nitrogen atom binds to an acetyl group. Additional sugars are attached to this core to form the great variety of oligosaccharide patterns found in glycoproteins (Figure 10.16).

Figure 10.16: N-linked oligosaccharides. A pentasaccharide core (shaded gray) is common to all N-linked oligosaccharides and serves as the foundation for a wide variety of N-linked oligosaccharides, two of which are illustrated: (A) high-mannose type; (B) complex type. Detailed chemical formulas and schematic structures are shown for each type (for the key to the scheme, see Figure 10.17). Abbreviations for sugars: Fuc, fucose; Gal, galactose; GalNAc, N-acetylgalactosamine; GlcNAc, N-acetylglucosamine; Man, mannose; Sia, sialic acid.

178

!clinic! CLINICAL INSIGHT: The Hormone Erythropoietin Is a Glycoprotein

Let us look at a glycoprotein present in the blood serum that has dramatically improved treatment for anemia, particularly that induced by cancer chemotherapy. The glycoprotein hormone erythropoietin (EPO) is secreted by the kidneys and stimulates the production of red blood cells by bone marrow, the tissue in interior of bones. EPO is composed of 165 amino acids and is N-glycosylated at three asparagine residues and O-glycosylated on a serine residue (Figure 10.17), making the mature EPO 40% carbohydrate by weight. Glycosylation enhances the stability of the protein in the blood; unglycosylated protein has only about 10% of the bioactivity of the glycosylated form because the protein is rapidly removed from the blood by the kidney. The availability of recombinant human EPO has greatly aided the treatment of anemias. Some endurance athletes have used recombinant human EPO to increase the red-blood-cell count and hence their oxygen-carrying capacity, a practice prohibited by most professional sports organizations. Drug-testing laboratories are able to distinguish some forms of prohibited recombinant EPO from natural EPO in athletes by detecting differences in the glycosylation patterns through the use of isoelectric focusing.

Figure 10.17: Oligosaccharides attached to erythropoietin. Erythropoietin has oligosaccharides linked to three asparagine residues and one serine residue. The structures shown are approximately to scale. The carbohydrate structures represented in the amino acid residues are depicted symbolically by employing a scheme (shown in the key, which also applies to Figures 10.16, 10.23, and 10.24) that is becoming widely used.

Proteoglycans, Composed of Polysaccharides and Protein, Have Important Structural Roles

The second class of glycoproteins that we consider comprises the proteoglycans, proteins attached to a particular type of polysaccharide called glycosaminoglycans, as stated earlier. The glycosaminoglycan makes up as much as 95% of the biomolecule by weight, and so the proteoglycan resembles a polysaccharide more than a protein. Proteoglycans not only function as lubricants and structural components in connective tissue, but also mediate the adhesion of cells to the extracellular matrix, and bind factors that regulate cell proliferation.

179

The properties of proteoglycans are determined primarily by the glycosaminoglycan component. Many glycosaminoglycans are made of repeating units of disaccharides containing a derivative of an amino sugar, either glucosamine or galactosamine (Figure 10.18). At least one of the two sugars in the repeating unit has a negatively charged carboxylate or sulfate group. The major glycosaminoglycans in animals are chondroitin sulfate, keratan sulfate, heparin, heparan sulfate, dermatan sulfate, and hyaluronate. Mucopolysaccharidoses are a collection of diseases, such as Hurler disease, that result from the inability to degrade glycosaminoglycans (Figure 10.19). Although precise clinical features vary with the disease, all mucopolysaccharidoses result in skeletal deformities and reduced life expectancies.

Figure 10.18: Repeating units in glycosaminoglycans. Structural formulas for five repeating units of important glycosaminoglycans illustrate the variety of modifications and linkages that are possible. Amino groups are shown in blue and negatively charged groups in red. Hydrogen atoms have been omitted for clarity. The right-hand structure is a glucosamine derivative in each case. The parent amino sugars, β-d-glucosamine and β-d-galactosamine, are shown for reference.
Figure 10.19: Hurler disease. Formerly called gargoylism, Hurler disease is a mucopolysaccharidosis having symptoms that include wide nostrils, a depressed nasal bridge, thick lips and earlobes, and irregular teeth. In Hurler disease, glycosaminoglycans cannot be degraded. The excess of these molecules is stored in the soft tissue of the facial regions, resulting in the characteristic facial features.

!clinic! CLINICAL INSIGHT: Proteoglycans Are Important Components of Cartilage

The proteoglycan aggrecan and the protein collagen are key components of cartilage. The triple helix of collagen provides structure and tensile strength, whereas aggrecan serves as a shock absorber (Figure 10.20). The protein component of aggrecan is a large molecule composed of 2397 amino acids (Figure 10.21). Many aggrecans are linked together by hyaluronate, a glycosaminoglycan. The aggrecan molecule itself is decorated with the glycosaminoglycans chondroitin sulfate and keratan sulfate. This combination of glycosaminoglycan and protein is especially suited to function as a shock absorber. Water is absorbed on the glycosaminoglycans, attracted by the many negative charges. Aggrecan can cushion compressive forces because the absorbed water enables the proteoglycan to spring back after having been deformed. When pressure is exerted, such as when the foot hits the ground in walking, water is squeezed from the glycosaminoglycan, cushioning the impact. When the pressure is released, the water rebinds. Osteoarthritis, the most common type of arthritis, can result from the proteolytic degradation of aggrecan and collagen in the cartilage in the joints due to inflammation. More typically, the cause is simple “wear and tear.” Glucosamine and chondroitin (Figure 10.18) are widely promoted as a treatment for osteoarthritis, but the benefits to anyone but the producers and marketers of the supplement are very ambiguous.

Figure 10.20: Cartilage acts as a shock absorber. The cartilage of a runner’s foot cushions the impact of each step that she takes. The repeating unit of chondroitin sulfate, a glycosaminoglycan component of cartilage, is shown on the right.
Figure 10.21: The structure of proteoglycan from cartilage. (A) Electron micrograph of a proteoglycan from cartilage (with color added). Proteoglycan monomers emerge laterally at regular intervals from opposite sides of a central filament of hyaluronan. (B) Schematic representation in which G stands for globular domain.

180

Although glycosaminoglycans may not seem to be familiar molecules, they are common throughout the biosphere. Chitin is a glycosaminoglycan found in the exoskeleton of insects, crustaceans, and arachnids and is, next to cellulose, the second most abundant polysaccharide in nature (Figure 10.22). Cephalopods, such as squid, have razor sharp beaks, which are made of extensively cross-linked chitin, to disable and consume prey.

Figure 10.22: Chitin, a glycosaminoglycan, is present in insect wings and the exoskeleton. Glycosaminoglycans are components of the exoskeletons of insects, crustaceans, and arachnids.

!clinic! CLINICAL INSIGHT: Mucins Are Glycoprotein Components of Mucus

The third class of glycoproteins that we examine comprises the mucins (mucoproteins). In mucins, the protein component is extensively glycosylated to serine or threonine residues by N-acetylgalactosamine (Figure 10.7). Mucins are capable of forming large polymeric structures and are common in mucous secretions. These glycoproteins are synthesized by specialized cells in the tracheobronchial, gastrointestinal, and urogenital tracts. Mucins are abundant in saliva where they function as lubricants.

181

A model of a mucin is shown in Figure 10.23A. The defining feature of the mucins is a region of the protein backbone termed the variable number of tandem repeats (VNTR) region, which is rich in serine and threonine residues that are O-glycosylated. Indeed, the carbohydrate moiety can account for as much as 80% of the molecule by weight. A number of core carbohydrate structures are conjugated to the protein component of mucin. Figure 10.23B shows one such structure.

Figure 10.23: Mucin structure. (A) A schematic representation of a mucoprotein. The VNTR region is highly glycosylated, forcing the molecule into an extended conformation. The Cys-rich domains and the D domain facilitate the polymerization of many such molecules. (B) An example of an oligosaccharide that is bound to the VNTR region of the protein. [Information from A. Varki, R. D. Cummings, J. D. Esko, H. H. Freeze, P. Stanley, C. R. Bertozzi, G. W. Hart, and M. E. Etzler, Essentials of Glycobiology, 2d ed. (Cold Spring Harbor Press, 2009), (Part A) p. 117, Fig. 9.1; (Part B) p. 118, Fig. 9.2.]

Mucins adhere to epithelial cells and act as a protective barrier; they also hydrate the underlying cells. In addition to protecting cells from environmental insults, such as stomach acid, inhaled chemicals in the lungs, and bacterial infections, mucins have roles in fertilization, the immune response, and cell adhesion. Mucins are overexpressed in bronchitis and cystic fibrosis, and the overexpression of mucins is also characteristic of adenocarcinomas—cancers of the glandular cells of epithelial origin.

!quickquiz! QUICK QUIZ 2

Which amino acids are used for the attachment of carbohydrates to proteins?

!bio! BIOLOGICAL INSIGHT: Blood Groups Are Based on Protein Glycosylation Patterns

The human ABO blood groups illustrate the effects of glycosyltransferases. Each blood group is designated by the presence of one of the three different carbohydrates, termed A, B, or O, on the surfaces of red blood cells (Figure 10.24). These structures have in common an oligosaccharide foundation called the O (or sometimes H) antigen. The A and B antigens differ from the O antigen by the addition of one extra monosaccharide, either N-acetylgalactosamine (for A) or galactose (for B) through an α-1,3 linkage to a galactose moiety of the O antigen.

Figure 10.24: Structures of A, B, and O oligosaccharide antigens. The carbohydrate structures represented in the upper part of this illustration are depicted symbolically (refer to the key in Figure 10.17).

Specific glycosyltransferases add the extra monosaccharide to the O antigen. There are three common genes encoding such glycosyltransferases, and each person inherits the genes for each type from each parent. The type A transferase specifically adds N-acetylgalactosamine, whereas the type B transferase adds galactose. These enzymes are identical in all but 4 of 354 positions. The O phenotype is the result of a mutation in the O transferase that results in the synthesis of inactive enzyme.

These structures have important implications for blood transfusions and other transplantation procedures. If an antigen not normally present in a person is introduced, the person’s immune system recognizes it as foreign. Adverse reactions can ensue, initiated by the intravascular destruction of the incompatible red blood cells.

182

Why are different blood types present in the human population? Suppose that a pathogenic organism such as a parasite expresses on its cell surface a carbohydrate antigen similar to one of the blood-group antigens. This antigen may not be readily detected as foreign in a person with the blood type that matches the parasite antigen, and the parasite will flourish. However, other people with different blood types will be protected. Hence, there will be selective pressure on human beings to vary blood type to prevent parasitic mimicry and a corresponding selective pressure on parasites to enhance mimicry. The constant “arms race” between pathogenic microorganisms and human beings drives the evolution of diversity of surface antigens within the human population.

Figure 10.25: The formation of a mannose 6-phosphate marker. A glycoprotein destined for delivery to lysosomes acquires a phosphate marker in the Golgi complex in a two-step process. First, N-acetylgalactosamine phosphotransferase adds a phospho-N-acetylglucosamine unit to the 6-OH group of a mannose residue; then, N-acetylgalactosaminidase removes the added sugar to generate a mannose 6-phosphate residue in the core oligosaccharide.

!clinic! CLINICAL INSIGHT: Lack of Glycosylation Can Result in Pathological Conditions

Although the role of carbohydrate attachment to proteins is not known in detail in most cases, data indicate that this glycosylation is important for the processing and stability of these proteins, as it is for EPO. Certain types of muscular dystrophy can be traced to improper glycosylation of dystroglycan, a membrane protein that links the extracellular matrix with the cytoskeleton (Section 1.4). Indeed, an entire family of severe inherited human diseases called congenital disorders of glycosylation has been identified. These pathological conditions reveal the importance of proper modification of proteins by carbohydrates and their derivatives.

An especially clear example of the role of glycosylation is provided by I-cell disease (also called mucolipidosis II), a lysosomal storage disease. Lysosomes are organelles that degrade and recycle damaged cellular components or material brought into the cell by endocytosis. Normally, a carbohydrate marker directs certain digestive enzymes from the Golgi complex to lysosomes where they function. In patients with I-cell disease, lysosomes contain large inclusions of undigested glycosaminoglycans and glycolipids, the “I” in the name of the disease. These inclusions are present because the enzymes normally responsible for the degradation of glycosaminoglycans are missing from affected lysosomes. Remarkably, the enzymes are present at very high levels in the blood and urine. Thus, active enzymes are synthesized, but, in the absence of appropriate glycosylation, they are exported instead of being sequestered in lysosomes. In other words, in I-cell disease, a whole series of enzymes is incorrectly addressed and delivered to the wrong location. Normally, these enzymes contain a mannose 6-phosphate residue as a component of an N-oligosaccharide that serves as the marker directing the enzymes from the Golgi complex to lysosomes. In I-cell disease, however, the attached mannose lacks a phosphate. I-cell patients are deficient in the N-acetylgalactosamine phosphotransferase catalyzing the first step in the addition of the phosphoryl group; the consequence is the mistargeting of eight essential enzymes (Figure 10.25). I-cell disease causes the patient to suffer severe psychomotor retardation and skeletal deformities, similar to those in Hurler disease. Remarkably, mutations in the phosphotransferase have also been linked to stuttering. Why some mutations cause stuttering while others cause I-cell disease is a mystery.