Answers to Concept Checks

WORKED PROBLEMS

Problem 1

A molecule of double-stranded DNA that is 5 million base pairs long has a base composition that is 62% G + C. How many times, on average, are the following restriction sites likely to be present in this DNA molecule?

  • a. BamHI (recognition sequence is GGATCC)
  • b. HindIII (recognition sequence is AAGCTT)
  • c. HpaII (recognition sequence is CCGG)

574

Solution Strategy

What information is required in your answer to the problem?

The number of restriction sites likely to be present in the DNA molecule for each of the specified restriction enzymes.

What information is provided to solve the problem?

  • The size of the DNA molecule.
  • The G + C base composition of the DNA molecule.
  • The recognition sequences for each restriction enzyme.

For help with this problem, review:

Cutting and Joining DNA Fragments in Section 19.2.

Solution Steps

The percentages of G and C are equal in double-stranded DNA; so, if G + C = 62%, then %G = %C = 62%/2 = 31%. The percentage of A + T = (100% − G − C) = 38%, and %A = %T = 38%/2 = 19%. To determine the probability of finding a particular base sequence, we use the multiplication rule, multiplying together the probably of finding each base at a particular site.

Hint: If you know the percentage of any base in the DNA, you can determine the percentages of all the other bases because G = C and A = T.

Recall: The multiplication rule states that the probability of two or more independent events is calculated by multiplying their independent probabilities.

  • a. The probability of finding the sequence GGATCC = 0.31 × 0.31 × 0.19 × 0.19 × 0.31 × 0.31 = 0.0003333. To determine the average number of recognition sequences in a 5-million-base-pair piece of DNA, we multiply 5,000,000 bp × 0.00033 = 1666.5 recognition sequences.
  • b. The number of AAGCTT recognition sequences is 0.19 × 0.19 × 0.31 × 0.31 × 0.19 × 0.19 × 5,000,000 = 626 recognition sequences.
  • c. The number of CCGG recognition sequences is 0.31 × 0.31 × 0.31 × 0.31 × 5,000,000 = 46,176 recognition sequences.

Problem 2

You are given the following DNA fragment to sequence: 5′-GCTTAGCATC-3′. You first clone the fragment in bacterial cells to produce sufficient DNA for sequencing. You isolate the DNA from the bacterial cells and carry out the dideoxy-sequencing method. You then separate the products of the polymerization reactions by gel electrophoresis. Draw the bands that should appear on the gel from the four sequencing reactions.

Solution Strategy

What information is required in your answer to the problem?

The positions of the bands on the sequencing gel.

What information is provided to solve the problem?

The base sequence of the DNA fragment to be sequenced.

For help with this problem, review:

DNA Sequencing in Section 19.4.

Solution Steps

The first task is to write out the sequence of the newly synthesized fragment, which will be complementary and antiparallel to the original fragment. The original sequence is 5′–GCTTAGCATC–3′, so the newly synthesized sequence will be:

Original (template) sequence:  5′–GCTTAGCATC–3′

Newly synthesized sequence:  3′–CGAATCGTAG–5′

Recall: In dideoxy sequencing, a new DNA strand is synthesized and that strand is what is sequenced. Thus, the bands that appear on the gel represent the complement of the original sequence.

Thus, the sequence of the newly synthesized strand, written 5′→3′ is: 5′-GATGCTAAGC-3′. Bands representing this sequence will appear on the gel, with the bands representing nucleotides near the 5' end of the molecule at the bottom of the gel.

Hint: Small fragments, those nearer the 5' end of the newly synthesized strand, will migrate faster and will appear near the bottom of the gel.

575