15: Transcription: DNA-
519
15.1 RNA Polymerases and Transcription Basics
15.2 Transcription in Bacteria
15.3 Transcription in Eukaryotes
In the early 1980s, it was clear that specialized proteins must exist for accurate and regulated mRNA synthesis from particular genes in mammalian cells. However, nobody had been able to identify such “transcription factors” or determine how this process of transcriptional activation works. The breakthrough in my laboratory came when we found out that human cell extracts contained a factor that can discriminate between two templates and somehow program the enzyme that reads DNA to choose the right promoter DNA and ignore all others. But how?
We decided to use a short piece of the active promoter DNA sequence as “bait” to fish out proteins that selectively bind this site. The challenge was to purify this activity away from the other 3,000 DNA-
I’ll never forget the feeling of profound excitement at having shared the discovery of such a fundamental protein in biology and, at the same time, having devised with my lab members a new means to isolate hundreds more of these key gene-
—Robert Tjian, on discovering the first specific eukaryotic transcription factor
520
Information encoded in the DNA of cells and viruses provides the instructions for making the RNA and protein molecules that carry out the activities essential for life. The first step in expression of this information is transcription, the enzymatic production of an exact complementary strand of RNA from a DNA template. Transcription thus involves the transfer of genetic information from DNA to RNA. For protein-
All RNA molecules, except for the RNA genomes of certain viruses, are derived from information stored in DNA. Transcription produces three major kinds of RNA, and many other types of RNA are generated in smaller amounts. As described in Chapter 6, messenger RNAs (mRNAs) encode the amino acid sequence of one or more polypeptides specified by a gene or set of genes. Transfer RNAs (tRNAs) read the information encoded in the mRNA and provide the appropriate amino acid to a growing polypeptide chain during protein synthesis. Ribosomal RNAs (rRNAs) are constituents of ribosomes, the intricate cellular machines that synthesize proteins. Other specialized RNAs have regulatory or catalytic functions or are precursor forms of the three main classes of RNA (Highlight 15-1).
When complete eukaryotic genome sequences became available, molecular biologists were excited to discover the extent of the transcriptome, the entire set of RNA transcripts produced in a cell. Initially, researchers focused on characterizing the transcription products of known genes. These included mRNAs and known stable noncoding RNAs (ncRNAs) such as rRNAs, tRNAs, small nuclear RNAs (snRNAs) involved in pre-
Unexpected levels of complexity began to emerge, however, beginning with the discovery of naturally occurring interfering RNAs, such as small interfering RNAs (siRNAs) and microRNAs (miRNAs), which have roles in the regulation of translation (as we describe in Chapter 22). Using a combination of microarrays and RNA-
In a parallel set of experiments, arrays of synthetic DNA oligonucleotides representing all nonrepetitive sequences in human chromosomes 21 and 22 were used to map the binding sites for three human transcription factors—
Some possible roles of previously undetected transcripts have emerged. For example, long noncoding RNAs (lncRNAs) are produced from regions that are either intergenic or antisense to genes. The functional significance of lncRNAs is not known, although several studies suggest roles for these transcripts in gene regulation. Shorter transcripts, particularly those that originate near gene promoters, fall into two somewhat arbitrary categories: molecules 20 to 200 nucleotides long are called small RNAs (sRNAs), and molecules of 200 to 1,000 nucleotides are called long RNAs (lRNAs). These categories include numerous subfamilies of transcripts defined by their abundance, longevity, and genomic origin. Although the sources of these transcripts are not yet fully worked out, at least some sRNAs may result from aborted or prematurely terminated transcription.
Whether pervasive transcription is simply a consequence of low-
Unlike DNA replication, which involves copying the entire chromosome, transcription is selective. Only particular genes or groups of genes are transcribed at any given time, and some portions of the DNA genome may be transcribed rarely or not at all. The cell directs the transcription machinery to express genetic information as it is needed. Specific regulatory sequences mark the beginning and end of the DNA segments to be transcribed and designate which strand of the double-
As one of the central cellular processes studied by molecular biologists, transcription—