13.2 The Genetic Toolkit for Drosophila Development
Animal genomes typically contain about 13,000 to 22,000 genes. Many of these genes encode proteins that function in essential processes in all cells of the body (for example, in cellular metabolism or the biosynthesis of macromolecules). Such genes are often referred to as housekeeping genes. Other genes encode proteins that carry out the specialized tasks of various organ systems, tissues, and cells of the body such as the globin proteins in oxygen transport or antibody proteins that mediate immunity. Here, we are interested in a different set of genes, those concerned with the building of organs and tissues and the specification of cell types—the genetic toolkit for development that determines the overall body plan and the number, identity, and pattern of body parts.
Toolkit genes of the fruit fly have generally been identified through the monstrosities or catastrophes that arise when they are mutated. Toolkit-gene mutations from two sources have yielded most of our knowledge. The first source consists of spontaneous mutations that arise in laboratory populations. The second source comprises mutations induced at random by treatment with mutagens (such as chemicals or radiation) that greatly increase the frequency of damaged genes throughout the genome. Elegant refinements of the latter approach have made possible systematic searches for mutants that have identified many members of the fly’s genetic toolkit. The members of this toolkit constitute only a small fraction, perhaps several hundred genes, of the roughly 14,000 genes in the fly genome.
KEY CONCEPT
The genetic toolkit for animal development is composed of a small fraction of all genes. Only a small subset of the entire complement of genes in the genome affect development in discrete ways.
Classification of genes by developmental function
One of the first tasks following the execution of a genetic screen for mutations is to sort out those of interest. Many mutations are lethal when hemi- or homozygous because cells cannot survive without products affected by these mutations. The more interesting mutations are those that cause some discrete defect in either the embryonic or the adult body pattern or both. It has proved useful to group the genes affected by mutations into several categories based on the nature of their mutant phenotypes. Many toolkit genes can be classified according to their function in controlling the identity of body parts (for example, of different segments or appendages), the formation of body parts (for example, of organs and appendages), the number of body parts, the formation of cell types, and the organization of the primary body axes (the anteroposterior, or A–P, and dorsoventral, or D–V, axes).
We will begin our inventory of the Drosophila toolkit by examining the genes that control the identity of segments and appendages. We do so for both historical and conceptual purposes. The genes controlling segmental and appendage identity were among the very first toolkit genes identified. Subsequent discoveries about their nature were sources of profound insights into not just how their products work, but also the content and workings of the toolkits of most animals. Furthermore, their spectacular mutant phenotypes indicate that they are among the most globally acting genes that affect animal form. Learning about these genes should whet our appetites for learning more about the whole toolkit that controls the development of animal form.
Homeotic genes and segmental identity
Among the most fascinating abnormalities to be described in animals are those in which one normal body part is replaced by another. Such homeotic transformations have been observed in many species in nature, including sawflies in which a leg forms in place of an antenna and frogs in which a thoracic vertebra forms in place of a cervical vertebra (Figure 13-3). Whereas only one member of a bilateral pair of structures is commonly altered in many naturally occurring variants, both members of a bilateral pair of structures are altered in homeotic mutants of fruit flies. In the former case, the alteration is not heritable, but homeotic mutants breed true from generation to generation.
Figure 13-3: In a homeotic transformation, one body part is replaced by another
Figure 13-3: A late-nineteenth-century drawing from one of the first studies of homeotic transformations in nature. (a) Homeosis in a sawfly, with the left antenna transformed into a leg. (b) Homeosis in a frog. The middle specimen is normal. The specimen on the left has extra structures growing out of the top of the vertebral column. The specimen on the right has an extra set of vertebrae.
[From W. Bateson, Material for the Study of Variation. Macmillan, 1894.]
The scientific fascination with homeotic mutants stems from three properties. First, it is amazing that a single gene mutation can alter a developmental pathway so dramatically. Second, it is striking that the structure formed in the mutant is a well-developed likeness of another body part. And, third, it is important to note that homeotic mutations transform the identity of serially reiterated structures. Insect and many animal bodies are made of repeating parts of similar structure, like building blocks, arranged in a series. The forewings and hind wings, the segments, and the antennae, legs, and mouthparts of insects are sets of serially reiterated body parts. Homeotic mutations transform identities within these sets.
A mutation may cause a loss of homeotic gene function where the gene normally acts or it may cause a gain of homeotic function where the homeotic gene does not normally act. For example, the Ultrabithorax (Ubx) gene acts in the developing hind wing to promote hind-wing development and to repress forewing development. Loss-of-function mutations in Ubx transform the hind wing into a forewing. Dominant gain-of-function mutations in Ubx transform the forewing into a hind wing. Similarly, the antenna-to-leg transformations of Antennapedia (Antp) mutants are caused by the dominant gain of Antp function in the antenna. In addition to these transformations in appendage identity, homeotic mutations can transform segment identity, causing one body segment of the adult or larva to resemble another.
Although homeotic genes were first identified through spontaneous mutations affecting adult flies, they are required throughout most of a fly’s development. Systematic searches for homeotic genes have led to the identification of eight loci, now referred to as Hox genes, that affect the identity of segments and their associated appendages in Drosophila. Generally, the complete loss of any Hox-gene function is lethal in early development. The dominant mutations that transform adults are viable in heterozygotes because the wild-type allele provides normal gene function to the developing animal.
Organization and expression of Hox genes
A most intriguing feature of Hox genes is that they are clustered together in two gene complexes that are located on the third chromosome of Drosophila. The Bithorax complex contains three Hox genes, and the Antennapedia complex contains five Hox genes. Moreover, the order of the genes in the complexes and on the chromosome corresponds to the order of body regions, from head to tail, that are influenced by each Hox gene (Figure 13-4).
Figure 13-4: Hox genes regulate the identity of body parts
Figure 13-4: The Hox genes of Drosophila. Eight Hox genes regulate the identity of regions within the adult. The color coding identifies the segments and structures that are affected by mutations in the various Hox genes.
[S. B. Carroll, J. K. Grenier, and S. D. Weatherbee, From DNA to Diversity: Molecular Genetics and the Evolution of Animal Design, 2nd ed. Blackwell, 2005.]
The relation between the structure of the Hox-gene complexes and the phenotypes of Hox-gene mutants was illuminated by the molecular characterization of the genes. Molecular cloning of the sequences encompassing each Hox locus provided the means to analyze where in the developing animal each gene is expressed. These spatial aspects of gene expression and gene regulation are crucial to understanding the logic of the genetic control of development. In regard to the Hox genes and other toolkit genes, the development of technology that made possible the visualization of gene and protein expression was crucial to understanding the relation among gene organization, gene function, and mutant phenotypes.
Two principal technologies for the visualization of gene expression in embryos or other tissues are (1) the expression of RNA transcripts visualized by in situ hybridization and (2) the expression of proteins visualized by immunological methods. Each technology depends on the isolation of cDNA clones representing the mature mRNA transcript and protein (Figure 13-5).
Figure 13-5: Methods for visualizing gene expression in developing animals
Figure 13-5: The two principal technologies for visualizing where a gene is transcribed or where the protein that it encodes is expressed are (left) in situ hybridization of complementary RNA probe to mRNA and (right) immunolocalization of protein expression. The procedures for each method are outlined. Expression patterns may be visualized as the product of an enzymatic reaction or of a chromogenic substrate or with fluorescently labeled compounds.
In the developing embryo, the Hox genes are expressed in spatially restricted, sometimes overlapping domains within the embryo (Figure 13-6). The genes are also expressed in the larval and pupal tissues that will give rise to the adult body parts.
The patterns of Hox-gene expression (and other toolkit genes) generally correlate with the regions of the animal affected by gene mutations. For example, the dark blue shading in Figure 13-6 indicates where the Ubx gene is expressed. This Hox gene is expressed in the posterior thoracic and most of the abdominal segments of the embryo. The development of these segments is altered in Ubx mutants. Ubx is also expressed in the developing hind wing but not in the developing forewing (Figure 13-7), as one would expect knowing that Ubx promotes hind-wing development and represses forewing development in this appendage.
Figure 13-6: Hox genes are expressed in spatially restricted domains
Figure 13-6: Expression of Hox genes in the Drosophila embryo. (a) Schematic representation of Drosophila embryo showing regions where eight individual Hox genes are expressed. (b) Actual image of the expression of seven Hox genes visualized by in situ hybridization. Colors indicate expression of labial (turquoise), Deformed (lavender), Sex combs reduced (green), Antennapedia (orange), Ultrabithorax (dark blue), Abdominal-A (red), and Abdominal-B (yellow). The embryo is folded so that the posterior end (yellow) appears near the top center.
[(b) Dave Kosman, Ethan Bier, and Bill McGinnis.]
KEY CONCEPT
The spatial expression of toolkit genes is usually closely correlated with the regions of the animal affected by gene mutations.
It is crucial to distinguish the role of Hox genes in determining the identity of a structure from that governing its formation. In the absence of function of all Hox genes, segments form, but they all have the same identity; limbs also can form, but they have antennal identity; and, similarly, wings can form, but they have forewing identity. Other genes control the formation of segments, limbs, and wings and will be described later. First, we must understand how Hox genes exert their dramatic effects on fly development.
Figure 13-7: Hox genes are expressed in structures affected by Hox-gene mutations
Figure 13-7: An example of Hox-gene expression. (a) The adult wing of D. melanogaster. (b) Ubx protein is not expressed in cells of the developing imaginal disk that will form the forewing. Cells enriched in Hox proteins are stained green; in this image, the green-stained cells are cells that do not form the wing. (c) The adult hind wing (haltere). (d) The Ubx protein is expressed at high levels in all cells of the developing hind-wing imaginal disk.
[Scott Weatherbee.]
The homeobox
Because Hox genes have large effects on the identities of entire segments and other body structures, the nature and function of the proteins that they encode are of special interest. Edward Lewis, a pioneer in the study of homeotic genes, noted early on that the clustering of Bithorax complex genes suggested that the multiple loci had arisen by tandem duplication of an ancestral gene. This idea led researchers to search for similarities in the DNA sequences of Hox genes. They found that all eight Hox genes of the two complexes were similar enough to hybridize to each other. This hybridization was found to be due to a short region of sequence in each gene, 180 bp in length. This stretch of DNA sequence similarity, because of its presence in homeotic genes, was dubbed the homeobox. The homeobox encodes a protein domain, the homeodomain, containing 60 amino acids. The amino acid sequence of the homeodomain is very similar among the Hox proteins (Figure 13-8).
Figure 13-8: Hox proteins have a sequence in common
Figure 13-8: Sequences of fly homeodomains. All eight Drosophila Hox genes encode proteins containing a highly conserved 60 amino acid domain, the homeodomain, composed of three α helices. Helices 2 and 3 form a helix-turn-helix motif similarly to the Lac repressor, Cro, and other DNA-binding proteins. Residues common to the Hox genes are shaded in yellow; divergent residues are shaded in red; those common to subsets of proteins are shaded in blue or green.
[S. B. Carroll, J. K. Grenier, and S. D. Weatherbee, From DNA to Diversity: Molecular Genetics and the Evolution of Animal Design, 2nd ed. Blackwell, 2005.]
Although the discovery of a common protein motif in each of the Hox proteins was very exciting, further analysis of the structure of the homeodomain revealed that it forms a helix-turn-helix motif—the structure common to the Lac repressor, the λ repressor, Cro, and the α2 and a1 regulatory proteins of the yeast mating-type loci! This similarity suggested immediately (and it was subsequently borne out) that Hox proteins are sequence-specific DNA-binding proteins and that they exert their effects by controlling the expression of genes within developing segments and appendages. Thus, the products of these remarkable genes function through principles that are already familiar from Chapters 11 and 12—by binding to regulatory elements of other genes to activate or repress their expression. We will see that it is also true of many other toolkit genes: a significant fraction of these genes encode transcription factors that control the expression of other genes.
KEY CONCEPT
Many toolkit genes encode transcription factors that regulate the expression of other genes.
We will examine how Hox proteins and other toolkit proteins orchestrate gene expression in development a little later. First, there is one more huge discovery to describe, which revealed that what we learn from fly Hox genes has very general implications for the animal kingdom.
Clusters of Hox genes control development in most animals
When the homeobox was discovered in fly Hox genes, it raised the question whether this feature was some peculiarity of these bizarre fly genes or was more widely distributed, in other insects or segmented animals, for example. To address this possibility, researchers searched for homeoboxes in the genomes of other insects, as well as earthworms, frogs, cows, and even humans. They found many homeoboxes in each of these animal genomes.
The similarities in the homeobox sequences from different species were astounding. Over the 60 amino acids of the homeodomain, some mouse and frog Hox proteins were identical with the fly sequences at as many as 59 of the 60 positions (Figure 13-9). In light of the vast evolutionary distances between these animals, more than 500 million years since their last common ancestor, the extent of sequence similarity indicates very strong pressure to maintain the sequence of the homeodomain.
Figure 13-9: Drosophila and vertebrate Hox protein show striking similarities
Figure 13-9: The sequences of the Drosophila Deformed protein homeodomain and of several members of the vertebrate Hox group 4 genes are strikingly similar. Residues in common are shaded in yellow; divergent residues are shaded in red; residues common to subsets of proteins are shaded in blue. The very similar C-terminal flanking regions outside of the homeodomain are shaded in green.
[S. B. Carroll, J. K. Grenier, and S. D. Weatherbee, From DNA to Diversity: Molecular Genetics and the Evolution of Animal Design, 2nd ed., Blackwell, 2005.]
The existence of Hox genes with homeoboxes throughout the animal kingdom was entirely unexpected. Why different types of animals would possess the same regulatory genes was not obvious, which is why biologists were further surprised by the results when the organization and expression of Hox genes was examined in other animals. In vertebrates, such as the laboratory mouse, the Hox genes also are clustered together in four large gene complexes on four different chromosomes. Each cluster contains from 9 to 11 Hox genes, a total of 39 Hox genes altogether. Furthermore, the order of the genes in the mouse Hox complexes parallels the order of their most related counterparts in the fly Hox complexes, as well as in each of the other mouse Hox clusters (Figure 13-10a). This correspondence indicates that the Hox complexes of insects and vertebrates are related and that some form of Hox complex existed in their distant common ancestor. The four Hox complexes in the mouse arose by duplications of entire Hox complexes (perhaps of entire chromosomes) in vertebrate ancestors.
Figure 13-10: The order of Hox genes parallels the order of body parts in which they are expressed
Figure 13-10: Like those of the fruit fly, vertebrate Hox genes are organized in clusters and expressed along the anteroposterior axis. (a) In the mouse, four complexes of Hox genes, comprising 39 genes in all, are present on four different chromosomes. Not every gene is represented in each complex; some have been lost in the course of evolution. (b) The Hox genes are expressed in distinct domains along the anteroposterior axis of the mouse embryo. The color shading represents the different groups of genes shown in part a.
[S. B. Carroll, “Homeotic Genes and the Evolution of Arthropods and Chordates,” Nature 376, 1995, 479-485.]
The relationship between adult and embryonic body axes.
Why would such different animals have these sets of genes in common? Their deep, common ancestry indicates that Hox genes play some fundamental role in the development of most animals. That role is apparent from analyses of how the Hox genes are expressed in different animals. In vertebrate embryos, adjacent Hox genes also are expressed in adjacent or partly overlapping domains along the anteroposterior body axis. Furthermore, the order of the Hox genes in the complexes corresponds to the head-to-tail order of body regions in which the genes are expressed (Figure 13-10b).
The Hox-gene expression patterns of vertebrates suggested that they also specify the identity of body regions, and subsequent analyses of Hox-gene mutants have borne this suggestion out. For example, mutations in the Hoxa11 and Hoxd11 genes cause the homeotic transformation of sacral vertebrae to lumbar vertebrae (Figure 13-11). Thus, as in the fly, the loss or gain of function of Hox genes in vertebrates causes transformation of the identity of serially repeated structures. Such results have been obtained in several classes, including mammals, birds, amphibians, and fish. Furthermore, clusters of Hox genes have been shown to govern the patterning of other insects and to be deployed in regions along the anteroposterior axis in annelids, molluscs, nematodes, various arthropods, primitive chordates, flatworms, and other animals. Therefore, despite enormous differences in anatomy, the possession of one or more clusters of Hox genes that are deployed in regions along the main body axis is a common, fundamental feature of at least all bilateral animals. Indeed, the surprising lessons from the Hox genes portended what turned out to be a general trend among toolkit genes; that is, most toolkit genes are common to different animals.
Figure 13-11: Hox genes regulate the identity of serially repeated structures in vertebrates
Figure 13-11: The morphologies of different regions of the vertebral column are regulated by Hox genes. (a) In the mouse, six lumbar vertebrae form just anterior to the sacral vertebrae (numbers in red). (b) In mice lacking the function of the posteriorly acting Hoxd11 gene and possessing one functional copy of the Hoxa11 gene, seven lumbar vertebrae form and one sacral vertebra is lost. (c) In mice lacking both Hoxa11 and Hoxd11 function, eight lumbar vertebrae form and two sacral vertebrae are lost.
[Photographs courtesy of Dr. Anne Boulet, HHMI, University of Utah; from S. B. Carroll, J. K. Grenier, and S. D. Weatherbee, From DNA to Diversity: Molecular Genetics and the Evolution of Animal Design, 2nd ed. Blackwell, 2005.]
KEY CONCEPT
Despite great differences in anatomy, many toolkit genes are common to a broad array of different animal phyla.
Now let’s take an inventory of the rest of the toolkit to see what other general principles emerge.