Prokaryotic genomes have distinctive features

Bacterial and archaeal genomes have several notable features:

Beyond these similarities, there is great diversity among these single-celled organisms, reflecting the huge variety of environments in which they are found (described in Chapter 25). Let’s examine prokaryotic genomes in terms of functional and comparative genomics.

FUNCTIONAL GENOMICS As we described in Key Concept 17.1, functional genomics is the biological discipline that assigns functions to the products of gene sequences. You can see the various functions encoded by the genomes of three prokaryotes in Table 17.1. As an example, look at the first bacterial genome sequenced, that of Haemophilus influenzae, which lives in the human respiratory tract and can cause ear infections or, more seriously, meningitis in children. Its single circular chromosome has 1,830,138 bp. In addition to its origin of replication and the genes coding for rRNAs and tRNAs, this bacterial chromosome has 1,727 open reading frames with promoters nearby.

table 17.1 Gene Functions in Three Bacteria
Category Number of genes in:
E. coli H. influenzae M. genitalium
Total protein-coding genes 4,288 1,727 482
Biosynthesis of amino acids 131 68 1
Biosynthesis of cofactors 103 54 5
Biosynthesis of nucleotides 58 53 19
Cell envelope proteins 237 84 17
Energy metabolism 243 112 31
Intermediary metabolism 188 30 6
Lipid metabolism 48 25 6
DNA replication, recombination, and repair 115 87 32
Protein folding 9 6 7
Regulatory proteins 178 64 7
Transcription 55 27 12
Translation 182 141 101
Uptake of molecules from the environment 427 123 34

When the H. influenzae sequence was first announced 20 years ago, only 1,007 (58 percent) of the open reading frames coded for proteins with known functions. Since then, scientists have identified the functions of the rest of the encoded proteins. All of the major biochemical pathways and molecular functions are represented. For example, the genes that encode enzymes involved in glycolysis, fermentation, and electron transport have been identified. Gene sequences that code for membrane proteins have been identified as well, including those involved in active transport. An important finding was that highly infective strains of H. influenzae—but not noninfective strains—have genes for surface proteins that attach the bacterium to the human respiratory tract. These surface proteins are now a focus of research on possible treatments for H. influenzae infections.

COMPARATIVE GENOMICS Soon after the sequence of H. influenzae was announced, smaller (Mycoplasma genitalium: 580,073 bp) and larger (E. coli: 4,639,221 bp) prokaryotic sequences were completed. Thus began the era of comparative genomics. Scientists can identify genes that are present in one bacterium and missing in another, allowing them to relate these genes to bacterial function.

365

M. genitalium, for example, lacks enzymes needed to synthesize amino acids, whereas E. coli and H. influenzae both possess such enzymes. This finding reveals that M. genitalium must obtain all its amino acids from its environment (usually the human urogenital tract). Furthermore, E. coli has 55 genes that encode transcriptional activators, whereas M. genitalium has only 12. This relative lack of control over gene expression suggests that the biochemical flexibility of M. genitalium must be limited compared with that of E. coli.