Advances in biotechnology are linked strongly to the development of physical techniques. The effort to characterize the entire genetic material, or genome, of organisms as simple as bacteria and as complex as Homo sapiens will lead to important new insights into the molecular mechanisms of disease, primarily through the discovery of previously unknown proteins encoded by the deoxyribonucleic acid (DNA) in genes. However, decoding genomic DNA will not always lead to accurate predictions of the amino acids present in biologically active proteins. Many proteins undergo chemical modification, such as cleavage into smaller proteins, after being synthesized in the cell. Moreover, it is known that one piece of DNA may encode more than one active protein. It follows that it is also important to describe the proteome, the full complement of functional proteins of an organism, by characterizing directly the proteins after they have been synthesized and processed in the cell.
The procedures of genomics and proteomics, the analysis of the genome and proteome, of complex organisms are time-consuming because of the very large number of molecules that must be characterized. For example, the human genome contains about 30 000 genes and the number of active proteins is likely to be much larger. Success in the characterization of the genome and proteome of any organism will depend on the deployment of very rapid techniques for the determination of the order in which molecular building blocks are linked covalently in DNA and proteins.
Many macromolecules, such as DNA, are charged and move in response to an electric field. This motion, with a drift speed given by eqn 80.10, is called electrophoresis and it depends on its net charge, size (and hence molar mass), and shape. The latter two factors are implied by the dependence of the drift speed on the frictional force. Consequently, experimental techniques based on electrophoresis are very useful to polymer chemists and biochemists in the characterization of macromolecules.
An important tool in genomics and proteomics is gel electrophoresis, in which biopolymers are separated on a slab of a porous gel, a semirigid dispersion of a solid in a liquid. Because the molecules must pass through the pores in the gel, the larger the macromolecule the less mobile it is in the electric field and, conversely, the smaller the macromolecule the more swiftly it moves through the pores. In this way, gel electrophoresis allows for the separation of components of a mixture according to their molar masses. Two common gel materials for the study of proteins and nucleic acids are agarose and cross-linked polyacrylamide. Agarose has large pores and is better suited for the study of large macromolecules, such as DNA and enzyme complexes. Polyacrylamide gels with varying pore sizes can be made by changing the concentration of acrylamide in the polymerization solution. In general, smaller pores form as the concentration of acrylamide is increased, making possible the separation of relatively small macromolecules by polyacrylamide gel electrophoresis (PAGE).
The separation of very large pieces of DNA, such as chromosomes, by conventional gel electrophoresis is not effective, making the analysis of genomic material rather difficult. Double-stranded DNA molecules are thin enough to pass through gel pores, but long and flexible DNA coils can become trapped in the pores and the result is impaired mobility along the direction of the applied electric field. This problem can be avoided with pulsed-field electrophoresis, in which a brief burst of the electric field is applied first along one direction and then along a perpendicular direction. In response to the switching back and forth between field directions, the DNA coils writhe about and eventually pass through the gel pores. In this way, the mobility of the macromolecule can be related to its molar mass.
We have seen that charge also determines the drift speed. For example, proteins of the same size but different net charge travel along the slab at different speeds. One way to avoid this problem and to achieve separation by molar mass is to denature the proteins in a controlled way. Sodium dodecyl sulfate is an anionic detergent that is very useful in this respect: it denatures proteins, whatever their initial shapes, into rods by forming a complex with them. Moreover, most protein molecules bind a constant number of ions, so the net charge per protein is well regulated. Under these conditions, different proteins in a mixture may be separated according to size only. The molar mass of each constituent protein is estimated by comparing its mobility in its rod-like complex form with standard sample of known molar mass. However, molar masses obtained by this method, often referred to as SDS-PAGE when polyacrylamide gels are used, are not as accurate as those obtained by mass spectrometry.
Another technique that deals with the effect of charge on drift speed takes advantage of the fact that the overall charge of proteins and other biopolymers depends on the pH of the medium. For instance, in acidic environments protons attach to basic groups and the net charge is positive; in basic media the net charge is negative as a result of proton loss. At the isoelectric point, the pH is such that there is no net charge on the biopolymer. Consequently, the drift speed of a biopolymer depends on the pH of the medium, with s = 0 at the isoelectric point (Fig. 1). Isoelectric focusing is an electrophoresis method that exploits the dependence of drift speed on pH. In this technique, a mixture of proteins is dispersed in a medium with a pH gradient along the direction of an applied electric field. Each protein in the mixture will stop moving at a position in the gradient where the pH is equal to the isoelectric point. In this manner, the protein mixture can be separated into its components.
Fig. 1. The plot of drift speed of the protein bovine serum albumin in water against pH. The isoelectric point of the macromolecule corresponds to the pH at which the drift speed in the presence of an electric field is zero.