Chapter Introduction

301

CHAPTER 8

Genes, Genomics, and Chromosomes

image
These brightly colored FISH-painted chromosomes are not only beautiful, but also useful in revealing chromosome anomalies and in comparing karyotypes of different species.
[L. Willatt/Science Source.]

OUTLINE

8.1 Eukaryotic Gene Structure

8.2 Chromosomal Organization of Genes and Noncoding DNA

8.3 Transposable (Mobile) DNA Elements

8.4 Genomics: Genome-Wide Analysis of Gene Structure and Function

8.5 Structural Organization of Eukaryotic Chromosomes

8.6 Morphology and Functional Elements of Eukaryotic Chromosomes

In previous chapters, we learned how the structure and composition of proteins allow them to perform a wide variety of cellular functions. We also examined another vital component of cells, the nucleic acids, and the process by which information encoded in the sequence of DNA is translated into protein. In this chapter, our focus again is on DNA and proteins as we consider the characteristics of eukaryotic nuclear genomes: the features of genes and the other DNA sequences that constitute the genome, and how this DNA is structured and organized by proteins within the cell.

By the beginning of the twenty-first century, molecular biologists had completed the sequencing of the entire genomes of hundreds of viruses, scores of bacteria, and one unicellular eukaryote, the budding yeast S. cerevisiae. By now, the vast majority of the genomic sequence is also known for the fission yeast S. pombe, the simple plant A. thaliana, rice, and multiple multicellular animals (metazoans), including the roundworm C. elegans, the fruit fly D. melanogaster, mice, humans, and at least one representative each of the 35 or so metazoan phyla. Detailed analysis of these sequencing data has revealed insights into evolution, genome organization, and gene function. It has also allowed researchers to identify previously unknown genes and to estimate the total number of protein-coding genes in each genome. Comparisons between gene sequences often provide insight into possible functions of newly identified genes. Comparisons of genomic sequence and organization between species also help us understand the evolution of organisms.

Surprisingly, DNA sequencing revealed that large portions of the genomes of metazoans and plants do not encode mRNAs or any other RNAs required by the organism. Remarkably, such noncoding DNA constitutes about 98.5 percent of human chromosomal DNA! The noncoding DNA in multicellular organisms contains many regions that are similar, but not identical, to one another. Variations within some stretches of this repetitious DNA between individuals are so great that every person can be distinguished by a DNA “fingerprint” based on these sequence variations. Moreover, some repetitious DNA sequences are not found in the same positions in the genomes of different individuals of the same species. At one time, all noncoding DNA was collectively termed “junk DNA” and was considered to serve no purpose. We now understand the evolutionary basis of all this extra DNA, and of the variation in location of certain sequences between individuals. Cellular genomes harbor transposable (mobile) DNA elements that can copy themselves and move throughout the genome. Although transposable DNA elements seem to have little function in the life cycle of an individual organism, over evolutionary time they have shaped our genomes and contributed to the rapid evolution of multicellular organisms. Moreover, we now understand that much of the DNA that does not encode proteins or stable RNAs functions as binding sites for protein complexes that regulate gene transcription.

302

The sheer length of cellular DNA is a significant problem with which cells must contend. The DNA in a single human cell, which measures about 2 m in total length, must be contained within nuclei with diameters of less than 10 µm, a compaction ratio of greater than 105 to 1. In relative terms, if a cell were 1 cm in diameter (about the size of a pea), the length of DNA packed into its nucleus would be about 2 km (1.2 miles)! Specialized eukaryotic proteins associated with nuclear DNA exquisitely fold and organize the DNA so that it fits into nuclei. And yet, at the same time, any given portion of this highly compacted DNA can be accessed readily for transcription, replication, and repair of damage without the long DNA molecules becoming tangled or broken. Furthermore, the integrity of DNA must be maintained during the process of cell division when it is partitioned into daughter cells. In eukaryotes, the complex of DNA and the proteins that organize it, called chromatin, can be visualized as individual chromosomes during mitosis. As we will see in this and the following chapter, the organization of DNA into chromatin allows a mechanism for regulation of gene expression that is not available in bacteria.

In the first four sections of this chapter, we provide an overview of the landscape of eukaryotic genes and genomes. First we discuss the structure of eukaryotic genes and the complexities that arise in higher organisms from the processing of mRNA precursors into alternatively spliced mRNAs. Next we discuss the main classes of eukaryotic DNA, including the special properties of transposable DNA elements and how they have shaped contemporary genomes. This background prepares us to discuss genomics, computer-based methods for analyzing and interpreting vast amounts of sequence data. The final two sections of the chapter address how DNA is physically organized in eukaryotic cells. We consider the packaging of DNA and histone proteins into the compact complexes called nucleosomes that are the fundamental building blocks of chromatin, the large-scale structure of chromosomes, and the functional elements required for chromosome duplication and segregation. Figure 8-1 provides an overview of these interrelated subjects. The understanding of genes, genomics, and chromosomes gained in this chapter will prepare us to explore current knowledge about how the synthesis and concentration of each protein and functional RNA in a cell is regulated in the following two chapters.

image
FIGURE 8-1 Overview of the structure of genes and chromosomes. DNA of higher eukaryotes consists of both unique and repeated sequences. Only about 1.5 percent of human DNA encodes proteins and functional RNAs. The remainder includes regulatory sequences that control gene expression, which are scattered through intergenic DNA between genes and in introns within genes. Much of this intergenic DNA, about 45 percent in humans, is derived from mobile DNA elements, genetic symbionts that have contributed to the evolution of contemporary genomes. Each chromosome consists of a single long molecule of DNA (as long as 280 Mb in humans), organized into increasing levels of condensation by the histone and nonhistone proteins with which it is intricately complexed. Each chromosome occupies its own “territory” in the nucleus.