DNA Structure and Replication - Complete Interactive Lesson
Part 1: DNA Structure
DNA Structure โ The Molecular Basis of Heredity
Part 1 of 7
Understanding DNA replication requires a thorough understanding of DNA structure. The double helix model, proposed by Watson and Crick in 1953 (based on X-ray crystallography data from Rosalind Franklin and Maurice Wilkins), remains one of the most important discoveries in biology.
Nucleotide Structure
DNA is a polymer of nucleotides. Each nucleotide has three components:
- Deoxyribose sugar (5-carbon sugar lacking an -OH group at the 2' carbon)
- Phosphate group (attached to the 5' carbon of the sugar)
- Nitrogenous base (attached to the 1' carbon of the sugar)
The four DNA bases:
| Base | Type | Rings | Pairs with |
|---|---|---|---|
| Adenine (A) | Purine | 2 rings | Thymine (T) โ 2 hydrogen bonds |
| Guanine (G) | Purine | 2 rings | Cytosine (C) โ 3 hydrogen bonds |
| Thymine (T) | Pyrimidine | 1 ring | Adenine (A) โ 2 hydrogen bonds |
| Cytosine (C) | Pyrimidine | 1 ring | Guanine (G) โ 3 hydrogen bonds |
Chargaff's Rules: In any DNA molecule, the amount of A equals the amount of T, and the amount of G equals the amount of C. This 1:1 ratio (A=T, G=C) was critical evidence for the base-pairing model.
Key structural features of the double helix:
- Two antiparallel strands (one runs 5' โ 3', the other 3' โ 5')
- Sugar-phosphate backbones on the outside; bases face inward
- Bases held together by hydrogen bonds (A-T: 2 H-bonds; G-C: 3 H-bonds)
- The helix has a major groove and a minor groove (important for protein-DNA interactions)
- One complete turn = 10 base pairs = 3.4 nm
- Diameter = 2 nm
Checkpoint โ DNA Structure
DNA Packaging in Eukaryotes
Human cells contain about 6.4 billion base pairs of DNA (~2 meters per cell if stretched out). This must be packaged into a nucleus only ~6 (\mu)m in diameter โ a compaction ratio of ~10,000:1.
Packaging hierarchy:
- Nucleosome โ 147 bp of DNA wraps ~1.65 times around a histone octamer (2 copies each of H2A, H2B, H3, H4); the fundamental unit of chromatin (11 nm "beads on a string")
- Linker histone H1 binds between nucleosomes, helping them pack into a 30 nm fiber
- Looped domains โ 30 nm fiber forms loops of 30,000-100,000 bp attached to a protein scaffold
- Heterochromatin โ maximally condensed, transcriptionally inactive
- Metaphase chromosome โ highest compaction (~10,000-fold), visible under a light microscope
Epigenetics and Histones: Chemical modifications to histones (acetylation, methylation, phosphorylation) alter chromatin structure and gene accessibility. Histone acetylation (by HATs) generally OPENS chromatin (euchromatin, active). Histone deacetylation (by HDACs) CLOSES it (heterochromatin, silenced).
Key Terms โ DNA Structure
Exit Ticket โ DNA Structure
Part 2: Semiconservative Replication
Semiconservative Replication
Part 2 of 7
Three models for DNA replication were proposed:
- Conservative โ the original double helix remains intact; a completely new copy is made
- Semiconservative โ each new double helix consists of one original (parental) strand and one new strand
- Dispersive โ parental and new DNA are interspersed throughout both strands
The Meselson-Stahl experiment (1958) elegantly determined which model is correct.
The Meselson-Stahl Experiment
Design:
- E. coli were grown for many generations in medium containing heavy nitrogen ((^{15})N) โ all DNA became uniformly "heavy"
- Cells were transferred to medium containing light nitrogen ((^{14})N)
- DNA was extracted after each generation and centrifuged in a CsCl density gradient
Results:
| Generation | DNA bands observed | Interpretation |
|---|---|---|
| 0 (all (^{15})N) | One heavy band | All DNA is heavy |
| 1 (one round of replication in (^{14})N) | One intermediate band | Each DNA molecule has one heavy and one light strand |
| 2 | Half intermediate, half light | Half the molecules retain a heavy strand; half are entirely light |
Part 3: Enzymes of Replication
The Replication Machinery
Part 3 of 7
DNA replication requires a team of enzymes and proteins working together at the replication fork โ the Y-shaped region where the parental double helix is unwound and new strands are synthesized.
Key Enzymes and Proteins
| Enzyme/Protein | Function |
|---|---|
| Helicase | Unwinds the double helix by breaking hydrogen bonds between bases; moves along the DNA using ATP hydrolysis |
| Single-strand binding proteins (SSB) | Bind to exposed single-stranded DNA to prevent reannealing and protect from nuclease degradation |
| Topoisomerase (Gyrase in prokaryotes) | Relieves torsional strain (supercoiling) ahead of the replication fork by cutting, swiveling, and re-joining the DNA backbone |
| Primase | RNA polymerase that synthesizes a short RNA primer (5-10 nucleotides) complementary to the template; provides the free 3'-OH needed by DNA polymerase |
| DNA Polymerase III (prokaryotes) | The main replicative polymerase; adds nucleotides to the 3' end of the primer/growing strand in the 5' โ 3' direction |
| DNA Polymerase I (prokaryotes) | Removes RNA primers (5' โ 3' exonuclease activity) and replaces them with DNA |
| DNA Ligase | Seals the nick (phosphodiester bond) between adjacent Okazaki fragments after primer removal |
Part 4: Leading vs Lagging Strand
Proofreading and DNA Repair
Part 4 of 7
DNA replication must be extraordinarily accurate โ the error rate is approximately 1 mistake per 10(^{10}) base pairs in E. coli. This remarkable fidelity is achieved through three layers of error correction.
Three Layers of Replication Fidelity
Layer 1: Base selection by DNA polymerase (~10(^5) accuracy)
- DNA polymerase has a tight active site that favors correct Watson-Crick base pairs (A-T, G-C)
- Incorrect bases fit poorly and are rejected before incorporation
- Error rate: ~1 in 100,000
Layer 2: Proofreading (3' โ 5' exonuclease activity, ~10(^2) improvement)
- DNA polymerase has a built-in editor: if a wrong nucleotide is incorporated, the polymerase detects the mismatch (distortion in the helix)
- The polymerase reverses direction and removes the incorrect nucleotide using 3' โ 5' exonuclease activity
- A correct nucleotide is then inserted
- Combined error rate: ~1 in 10(^7)
Layer 3: Mismatch repair (MMR, ~10(^3) improvement)
- After replication, mismatch repair proteins scan the newly synthesized DNA
- They detect and correct remaining mismatches
- The key challenge: distinguishing which strand has the error (old vs. new strand)
- In E. coli: the parental strand is methylated (GATC sites); the new strand is not yet methylated, so repair enzymes know to fix the new strand
- In eukaryotes: the new strand is identified by the presence of nicks (gaps not yet sealed)
- Combined final error rate: ~1 in 10(^{10})
Checkpoint โ Proofreading and Repair
DNA Damage and Additional Repair Mechanisms
Beyond replication errors, DNA is constantly damaged by environmental and metabolic factors:
Part 5: Proofreading & Repair
Telomeres and the End Replication Problem
Part 5 of 7
Linear chromosomes in eukaryotes face a unique challenge: the end replication problem. This problem does not exist in prokaryotes because their chromosomes are circular.
The End Replication Problem
The problem:
- On the lagging strand, an RNA primer must initiate each Okazaki fragment
- At the very end of the chromosome (3' end of the template), the last RNA primer is synthesized, and DNA polymerase extends from it
- When this primer is removed, there is a short gap at the 5' end of the new strand that CANNOT be filled โ there is no upstream 3'-OH for DNA polymerase to extend from
- Result: the daughter strand is slightly shorter than the parent
Consequence: With each round of replication, chromosomes get shorter at both ends. After many divisions, essential genes near the ends would be lost.
Telomeres โ The Protective Solution
Telomeres are repetitive, non-coding DNA sequences at the ends of linear chromosomes:
- Human telomere repeat: TTAGGG (repeated 1000-2000 times, totaling 5-15 kb)
- Telomeres provide a "buffer zone" of expendable sequence โ shortening removes repeats, not genes
- Telomeres also form a protective structure called a T-loop (the 3' overhang folds back and invades the double-stranded region) with a protein complex called shelterin that prevents the cell from recognizing chromosome ends as DNA breaks
Hayflick Limit: Normal somatic cells can divide approximately 50-70 times before telomeres become critically short. At this point, cells enter replicative senescence (a permanent G(_0) state) or undergo apoptosis. This is a tumor-suppression mechanism.
Checkpoint
Part 6: Problem-Solving Workshop
Problem-Solving Workshop โ DNA Replication
Part 6 of 7
This workshop applies DNA replication concepts to experimental scenarios and quantitative problems.
Scenario 1: Replication Fork Analysis
A researcher treats E. coli with radioactive thymidine ((^3)H-thymidine) for a brief pulse, then chases with unlabeled thymidine. After autoradiography of the replicating DNA:
- The label appears as a band along the newly synthesized DNA
- The leading strand shows a continuous band of label
- The lagging strand shows a series of short labeled segments (Okazaki fragments) with gaps where primers were located
If the pulse is very short (seconds), only the most recently synthesized DNA is labeled. The leading strand shows label near the fork, while the lagging strand shows label in the most recently completed Okazaki fragment.
If the chase is long enough, DNA Pol I replaces primers with DNA and ligase joins fragments, so the lagging strand eventually looks continuous.
Scenario 1 Questions
Scenario 2: Density Gradient Predictions
Starting with one double-stranded DNA molecule where BOTH strands are labeled with (^{15})N (heavy):
After 1 generation in (^{14})N:
- 2 molecules, each with one (^{15})N strand + one (^{14})N strand = 2 intermediate
After 2 generations in (^{14})N:
- 4 molecules total
- 2 have one (^{15})N + one (^{14})N = 2 intermediate
- 2 have both (^{14})N = 2 light
After n generations:
- Total molecules = (2^n)
- Intermediate molecules = (the two original parental strands + a new partner)
Part 7: AP Review
AP Review โ DNA Replication
Part 7 of 7
Comprehensive AP-exam-style questions integrating all DNA replication concepts.
Key Principles Summary
- DNA replication is semiconservative โ each daughter molecule has one parental and one new strand (Meselson-Stahl)
- Replication is bidirectional from origins and proceeds in the 5' โ 3' direction only
- The leading strand is continuous; the lagging strand is discontinuous (Okazaki fragments)
- Primase provides RNA primers; DNA Polymerase III extends; Pol I removes primers; Ligase seals nicks
- Three layers of fidelity: base selection, proofreading (3'โ5' exonuclease), and mismatch repair
- Multiple DNA repair pathways (BER, NER, HR, NHEJ) protect against different types of damage
- Telomeres protect chromosome ends; telomerase counteracts the end replication problem