Noncoding sequences called introns often appear between genes in eukaryotic chromosomes

Differences between prokaryotic and eukaryotic transcription are revealed when mRNA probes from prokaryotes and eukaryotes are incubated with their respective chromosomal DNAs:

300

The discovery that some stretches of eukaryotic DNA sequence are excluded from the mRNA that reaches the ribosome for translation initiated the question of whether this “extra” DNA actually gets transcribed. Does transcription “skip” these “extra” sequences, or are they transcribed and then somehow edited out of the mRNA transcript before it arrives at the ribosome? To find out, an experiment can be conducted in which the initial mRNA transcript in the cell nucleus—the precursor mRNA, or pre-mRNA (see Figure 14.2)—is hybridized with chromosomal DNA. If the pre-mRNA exhibits full, linear, loopfree hybridization with the template DNA, we may conclude that the noncoding, intervening regions of DNA, known as introns, do get transcribed but are then spliced out of the pre-mRNA in the nucleus. The experiment demonstrates that only the coding sequences (exons) remain in the mRNA that reaches the ribosome. The step of editing out the introns is known as RNA processing and is shown in Figure 14.7.

image
Figure 14.7 Transcription of a Eukaryotic Gene The β-globin gene diagrammed here is about 1,600 base pairs (bp) long. The three exons—the protein-coding sequences—contain codons for 146 amino acids plus a stop codon. The two introns—noncoding sequences of DNA containing almost 1,000 bp among them—are initially transcribed but are spliced out of the pre-mRNA transcript.

Introns interrupt, but do not scramble, the DNA sequence of a gene. The base sequences of the exons in the template strand, if joined and taken in order, form a continuous sequence that is complementary to that of the mature mRNA. In some cases, the separated exons often encode different functional regions, or domains, of the protein. For example, the globin polypeptides that make up hemoglobin each have two domains: one for binding to a nonprotein pigment called heme, and another for binding to the other globin subunits. These two domains are encoded by different exons in the globin genes. Most (but not all) eukaryotic genes contain introns, and in rare cases, introns are also found in prokaryotes. The largest human gene encodes a muscle protein called titin; it has 363 exons, which together code for 38,138 amino acids.