The Molecular Copier
Every time a human cell divides, it must copy 3.2 billion base pairs of DNA with near-perfect accuracy — a task equivalent to copying the entire Encyclopedia Britannica with fewer than one typo. This is accomplished by an elegant molecular machine centered on the replication fork, where the double helix is unwound and both strands are simultaneously copied by a coordinated team of enzymes.
Anatomy of the Replication Fork
Helicase (yellow in the simulation) unwinds the double helix by breaking hydrogen bonds between complementary bases. Single-strand binding proteins coat the exposed single strands to prevent reannealing. Primase (green) synthesizes short RNA primers that provide the 3'-OH group DNA polymerase needs to begin synthesis. DNA polymerase III (cyan on the leading strand, red segments on the lagging strand) then extends these primers by adding complementary deoxynucleotides.
Leading vs Lagging Strand
Because DNA polymerase can only add nucleotides in the 5' to 3' direction, the two strands of the fork are replicated differently. The leading strand — oriented toward the fork — is synthesized continuously as one long polymer. The lagging strand — oriented away from the fork — must be synthesized discontinuously as short Okazaki fragments (1000–2000 bp in prokaryotes, 100–200 bp in eukaryotes), each requiring its own RNA primer. DNA ligase then seals the gaps between fragments.
Error Correction and Fidelity
DNA polymerase achieves remarkable accuracy through a three-tier system. First, the geometric selection of the active site favors correct base pairing (error rate ~10⁻⁴). Second, the 3' to 5' exonuclease proofreading domain immediately removes mismatched bases (improving to ~10⁻⁷). Third, post-replication mismatch repair proteins scan newly synthesized DNA and correct remaining errors (final rate ~10⁻⁹ to 10⁻¹⁰). Adjust the error rate in this simulation to see how each level of correction impacts genome-wide mutation counts.