Skip to content

DNA Replication and Mutation

DNA replication follows a semi-conservative mechanism in which each new double helix retains one original strand, with DNA polymerase synthesising the complementary strand at high fidelity. Errors that escape proofreading, together with environmentally induced base changes, generate mutations that underlie the heritable variation exploited for individual identification in forensic biology.

Last updated:

Share

DNA replication is the molecular process by which a cell copies its entire genome before dividing, ensuring that each daughter cell receives a complete and accurate set of genetic instructions. The mechanism is semi-conservative: the two strands of the parental double helix are separated, and each strand serves as a template for the synthesis of a complementary new strand. The resulting two daughter molecules each contain one parental strand and one newly made strand. DNA polymerase catalyses chain extension, but it can only add nucleotides to a pre-existing primer and only in the 5-prime to 3-prime direction, so the two template strands are copied by slightly different mechanisms, producing a leading strand synthesised continuously and a lagging strand assembled in short Okazaki fragments.

Fidelity during replication is high but not perfect. DNA polymerase selects the correct incoming nucleotide by Watson-Crick base-pairing and rejects mismatches through a conformational change before phosphodiester bond formation. When a wrong nucleotide is incorporated, the 3-prime to 5-prime exonuclease proofreading domain of the polymerase excises it before synthesis continues. Errors that survive proofreading are addressed by post-replication mismatch repair. The combined effect is a final error rate of approximately one mutation per billion base pairs per cell division in human cells. Mutations that do occur are heritable if they arise in germline cells, or they are confined to a single tissue lineage if they arise in somatic cells.

Mutation is the source of the genetic variation that makes individual identification possible. The short tandem repeat (STR) loci used in forensic profiling are polymorphic because generations of replication errors, particularly slippage of DNA polymerase across repetitive sequences, have created a population with many different allele lengths at each locus. Point mutations that change a single base are the raw material of single nucleotide polymorphism (SNP) panels used for ancestry inference and kinship analysis. Understanding how mutations arise, what types exist, and how they propagate through populations is foundational to interpreting forensic DNA evidence.

By the end of this topic you will be able to:

  • Describe the semi-conservative mechanism of DNA replication and the roles of helicase, primase, DNA polymerase, and ligase.
  • Explain how DNA polymerase proofreading and mismatch repair together reduce the final mutation rate in human cells.
  • Classify the main categories of mutation (point substitution, insertion, deletion, and tandem repeat slippage) and predict their functional consequences.
  • Identify the molecular mechanisms by which UV radiation, alkylating agents, and ionising radiation induce DNA damage and mutation.
  • Explain how accumulated mutations in STR loci and SNPs generate the heritable variation that forensic DNA profiling detects.
Key terms
Semi-conservative replication
The mode of DNA copying in which each daughter double helix retains one parental strand and acquires one newly synthesised strand, confirmed by Meselson and Stahl in 1958 using nitrogen-15 density labelling.
DNA polymerase
The enzyme that synthesises new DNA by adding deoxyribonucleoside triphosphates complementary to the template strand. In humans, polymerase delta copies the lagging strand and polymerase epsilon copies the leading strand. Both carry a 3-prime to 5-prime exonuclease proofreading domain.
Okazaki fragment
A short segment of DNA (100 to 200 nucleotides in eukaryotes) synthesised on the lagging template strand in the 5-prime to 3-prime direction away from the replication fork. Fragments are joined by DNA ligase after their RNA primers are replaced.
Point mutation
A change in a single nucleotide in the DNA sequence. A transition substitutes one purine for another or one pyrimidine for another (A-T to G-C or vice versa). A transversion substitutes a purine for a pyrimidine or vice versa.
Frameshift mutation
A change in the reading frame of a coding sequence caused by an insertion or deletion of a number of nucleotides that is not a multiple of three. All codons downstream of the change are altered, usually producing a truncated or non-functional protein.
Short tandem repeat (STR) slippage
Replication slippage at a repetitive locus where the template and nascent strands temporarily misalign. If the nascent strand slips forward, one repeat unit is added; if it slips back, a repeat is deleted. Accumulated slippage events over generations create the allelic diversity at STR loci used in forensic profiling.

The semi-conservative replication mechanism

Replication begins at defined chromosomal sites called origins of replication. In bacteria a single origin suffices; the human genome has thousands distributed across all chromosomes, allowing simultaneous initiation at many points and completion of the 6.4 billion base pairs within hours rather than years. At each origin, initiator proteins recruit helicase, which unwinds the double helix by breaking the hydrogen bonds between complementary base pairs. The unwinding creates a replication bubble with two diverging replication forks moving outward in opposite directions.

DNA polymerase cannot start a new chain from scratch. It requires a free 3-prime hydroxyl group on which to add the next nucleotide. Primase, an RNA polymerase, synthesises a short RNA primer (8 to 12 nucleotides) complementary to the template strand. DNA polymerase then extends the primer, adding deoxyribonucleotides one at a time. The leading strand template runs 3-prime to 5-prime relative to the direction of fork movement, so the leading strand is synthesised continuously from a single primer. The lagging strand template runs 5-prime to 3-prime, opposite to fork movement. It is copied in short Okazaki fragments, each requiring its own primer, each extended 5-prime to 3-prime away from the fork. The RNA primers are subsequently removed, the gaps are filled by DNA polymerase, and the nicks are sealed by DNA ligase.

In eukaryotes, the replicated strands must also be assembled into chromatin. Histone chaperones deposit histone octamers onto the new DNA almost immediately after synthesis. Parental histones carrying epigenetic marks are distributed between the two daughter strands, but the precise mechanism of epigenetic inheritance through replication remains an active area of research. Forensic applications that exploit epigenetic marks (for example, tissue-type prediction from methylation patterns) depend on understanding how faithfully those marks are copied.

Replication fidelity: proofreading and mismatch repair

DNA polymerase achieves high fidelity through two sequential mechanisms. First, nucleotide selection: the polymerase undergoes a conformational change around the incoming nucleotide before catalysis. The correct nucleotide, forming a standard Watson-Crick pair with the template base, induces a closed active-site conformation that positions the 3-prime hydroxyl for attack. A mismatch holds the active site open, delaying catalysis and giving the polymerase time to reject the incorrect nucleotide. This selectivity reduces the raw error rate to approximately one mistake per 100,000 insertions.

Second, proofreading: if a mismatched nucleotide is incorporated despite selectivity, the 3-prime to 5-prime exonuclease domain of the polymerase detects the distortion in the growing chain and excises the incorrect nucleotide before synthesis resumes. Proofreading reduces the error rate by roughly another 100-fold, to approximately one error per 10 million bases. Post-replication mismatch repair (MMR) catches the remaining errors: the MMR proteins scan newly synthesised DNA, recognise mispaired bases, excise a segment of the new strand containing the mismatch, and fill in the gap correctly. Together, these layers bring the final mutation rate in human somatic cells to approximately one base change per billion bases per division.

Fidelity layerError rate after this layerMechanism
Nucleotide selection1 in 100,000Conformational proofreading before bond formation
3-prime to 5-prime exonuclease1 in 10,000,000Excision of mismatched 3-prime terminus
Mismatch repair (MMR)1 in 1,000,000,000Post-synthesis excision and re-synthesis of new strand

Inherited defects in MMR genes (MLH1, MSH2, MSH6, PMS2) cause Lynch syndrome, the most common hereditary colorectal cancer predisposition syndrome. Tumours from Lynch syndrome patients show microsatellite instability: the STR loci that are stable in normal tissue accumulate new alleles at high frequency. This is directly relevant to forensic biology: tumour DNA from a Lynch syndrome patient may yield STR profiles at some loci that differ from the constitutional profile obtained from blood.

Categories of mutation

Mutations are classified by the nature and scale of the sequence change. Point mutations affect a single base pair. A transition replaces one purine with the other purine (A for G or G for A) or one pyrimidine with the other (C for T or T for C). A transversion replaces a purine with a pyrimidine or vice versa. Transitions occur more frequently than transversions in spontaneous mutation, because the incorrect nucleotide more readily adopts a geometry that mimics a correct Watson-Crick pair when the change is within the same chemical class.

In a protein-coding context, a point mutation may be synonymous (the altered codon specifies the same amino acid, because the genetic code is degenerate), missense (the codon now specifies a different amino acid), or nonsense (the codon becomes a stop codon, truncating the protein). Synonymous mutations are largely invisible to natural selection and accumulate over time, contributing to the population-level sequence diversity captured by SNP panels.

Insertions and deletions (indels) add or remove one or more base pairs. In a coding region, an indel that is not a multiple of three shifts the reading frame, altering every codon downstream of the change. Frameshift mutations in essential genes are usually lethal when homozygous and are strongly selected against. In non-coding regions, indels can alter regulatory sequences or splice sites without directly changing any protein sequence. STR loci are non-coding, and indels at these loci through replication slippage are the primary source of allele-length variation exploited in forensic DNA profiling.

Spontaneous mutation: sources and mechanisms

Spontaneous mutations arise without external chemical or physical insult. The two main sources are replication errors that escape repair and spontaneous chemical changes in DNA bases. Tautomeric shifts are transient alterations in the electronic structure of a base that change its hydrogen-bonding properties. Adenine normally pairs with thymine, but its imino tautomeric form pairs with cytosine. If the imino form is present during replication, the polymerase incorporates cytosine opposite an adenine template, creating an A-T to G-C transition after one further round of replication.

Depurination is the spontaneous hydrolysis of the N-glycosidic bond linking a purine base (adenine or guanine) to the deoxyribose sugar. Human cells lose an estimated 10,000 purines per cell per day through depurination. The resulting abasic (AP) site has no coding information and stalls DNA polymerase. Translesion synthesis polymerases can insert a nucleotide opposite the AP site, but they do so with low fidelity, usually incorporating adenine. Because the missing base was a purine, this insertion may be incorrect, generating a mutation.

Deamination converts cytosine to uracil by removing the amino group. Uracil pairs with adenine, not guanine, so an uncorrected deamination leads to a C-G to T-A transition. Uracil-DNA glycosylase normally removes uracil from DNA before replication, but if it fails, the mutation is fixed. In forensically relevant aged or degraded biological samples, depurination and deamination are significant sources of miscoding lesions that can generate artifactual base calls during PCR-based sequencing.

Induced mutation: physical and chemical mutagens

Environmental agents increase the mutation rate above the spontaneous background. They are classified by mechanism: agents that chemically modify DNA bases, agents that intercalate between base pairs, and agents that break the phosphodiester backbone.

Ultraviolet (UV) radiation at wavelengths around 260 nanometres is absorbed efficiently by pyrimidine bases. The absorbed energy drives covalent bond formation between adjacent pyrimidines on the same strand. The predominant product is a cyclobutane pyrimidine dimer (CPD), most often thymine-thymine, though cytosine-thymine and cytosine-cytosine dimers also form. CPDs distort the double helix and stall the replicative polymerase. Nucleotide excision repair (NER) is the primary removal pathway. When a CPD is replicated before repair, translesion synthesis polymerases typically insert adenine opposite both thymine residues of the dimer, which is correct for thymine but incorrect if a cytosine was involved, producing a characteristic C to T or CC to TT transition. These UV-signature mutations are used in cancer genomics to assign mutational aetiology, and they are relevant to the degradation of DNA in biological samples left in sunlight.

Alkylating agents transfer alkyl groups to DNA bases. Methyl methanesulphonate (MMS) and similar agents methylate guanine at the O6 position, producing O6-methylguanine. This lesion pairs with thymine instead of cytosine, generating a G-C to A-T transition. N-alkyl adducts on adenine block replication. Many alkylating agents found in tobacco smoke and industrial chemicals are potent mutagens. Ionising radiation (gamma rays, X-rays) deposits energy that generates reactive oxygen species within cells, which attack all four bases and the phosphodiester backbone. Double-strand breaks produced by ionising radiation are particularly mutagenic, because repair by non-homologous end joining is error-prone, and may join the wrong chromosome ends together.

Mutagen classExample agentDNA lesionResulting mutation type
UV radiationSunlight (UVB 290-320 nm)Cyclobutane pyrimidine dimer (CPD)C to T or CC to TT transitions
Alkylating agentN-methyl-N-nitrosourea (MNU)O6-methylguanineG-C to A-T transition
Oxidative damageIonising radiation (gamma)8-oxoguanineG-C to T-A transversion
Intercalating agentAcridine orange, ethidium bromideInsertion into base stackFrameshift (insertion or deletion)
Base analogue5-bromouracil (5-BU)BU incorporated in place of thymineA-T to G-C transition

Mutation, heritable variation, and forensic DNA identification

Forensic DNA profiling rests on the existence of heritable variation at specific genomic loci. That variation is the accumulated product of mutations across many generations. The CODIS STR system used in the United States profiles 20 core loci; equivalent systems operate in the United Kingdom (National DNA Database, expanded from 10 to 16 STR loci in 2014), the European Union (ESS 7-plus-additional loci), and India's National DNA Data Bank, established under the DNA Technology (Use and Application) Regulation Act 2019. Each locus is polymorphic because replication slippage has created many alleles of different repeat-unit counts in the human population.

The probability that two unrelated individuals share the same genotype at all 20 CODIS loci is vanishingly small, typically less than one in a quadrillion, because each locus contributes an independent probability factor that compounds multiplicatively. This statistical power depends on the genetic independence of the loci (the loci are selected from different chromosomes or from well-separated positions on the same chromosome) and on accurate population allele frequency databases for each ethnic group. Mutation rates at STR loci are low enough (approximately 0.1 to 0.2 per cent per locus per generation) that parent-child pairs share the same allele at most loci, but that rate is high enough that a mutation is occasionally seen in paternity cases.

Single nucleotide polymorphisms (SNPs) are point mutations that have become fixed in at least one per cent of the population. The human genome contains more than 10 million common SNPs. SNP panels are used in forensic genetics for ancestry inference (informative SNPs differ systematically in frequency across continental population groups), for kinship analysis when STR typing fails or when samples are highly degraded, and for phenotypic prediction (hair colour, eye colour, skin tone). The Forensic Genetics Policy Initiative and regulatory frameworks in the UK (Protection of Freedoms Act 2012), EU, and the United States (Combined DNA Index System regulations) set limits on which applications of SNP data are permissible in law enforcement contexts. More detail on evidence collection and tissue types is at Scope of Biological Evidence; biotechnology applications including next-generation sequencing of SNP panels are covered at Forensic Biotechnology.

Check your understanding
Question 1 of 4· 0 answered

What is the defining feature of semi-conservative replication that distinguishes it from conservative and dispersive models?

Key Takeaways

  • DNA replication is semi-conservative: each daughter molecule retains one original strand and gains one newly synthesised strand, allowing strand-by-strand verification at every copying cycle.
  • Three fidelity layers, nucleotide selection, 3-prime to 5-prime proofreading, and mismatch repair, reduce the final mutation rate to approximately one error per billion bases per cell division, but they do not eliminate mutation entirely.
  • Mutations range from single-base point substitutions (transitions and transversions) to insertions and deletions (indels and frameshift mutations); replication slippage at repetitive STR loci generates the allele-length variation that forensic profiling exploits.
  • UV radiation creates pyrimidine dimers, alkylating agents create miscoding base adducts, and ionising radiation causes double-strand breaks, each leaving a characteristic mutational signature that can be detected in tumour or aged-sample DNA.
  • The statistical power of STR profiling and the utility of SNP panels for ancestry and kinship analysis depend entirely on population-level variation created by accumulated mutations across human generations.
What does semi-conservative replication mean?
Semi-conservative replication means that when a DNA double helix is copied, each of the two resulting daughter molecules retains one original (parental) strand and one newly synthesised strand. This was confirmed by Meselson and Stahl in 1958 using density-labelling with nitrogen-15. The two strands are separated, and each acts as a template for a complementary new strand built by DNA polymerase.
How accurate is DNA polymerase during replication?
DNA polymerase makes approximately one base-incorporation error per 100,000 nucleotides added before proofreading. Its 3-prime to 5-prime exonuclease proofreading activity then corrects most of these, reducing the raw error to roughly one error per 10 million bases. Mismatch repair after replication brings the final mutation rate down to approximately one error per billion bases per cell division in humans.
What is the difference between a point mutation and an insertion or deletion?
A point mutation is the substitution of one base for another at a single position in the DNA sequence. An insertion adds one or more extra bases, and a deletion removes one or more bases. Insertions and deletions in a coding region shift the reading frame of the codons downstream of the change, which usually disrupts protein function far more severely than most single-base substitutions.
How do environmental mutagens cause DNA mutations?
Environmental mutagens damage DNA by chemically altering bases or blocking replication. Ultraviolet radiation causes adjacent thymine bases to form covalent thymine dimers that stall DNA polymerase and lead to misincorporation during repair. Alkylating agents add chemical groups to bases, changing their base-pairing properties. Ionising radiation breaks the phosphodiester backbone, creating double-strand breaks that are repaired by error-prone pathways.
Why do mutations matter for forensic DNA profiling?
Forensic DNA profiling relies on the heritable variation in short tandem repeat (STR) loci and, for ancestry and kinship analysis, in single nucleotide polymorphisms (SNPs). This variation arose through mutations accumulated over many generations. Identical twins are the only individuals with indistinguishable STR profiles, because their genomes derive from a single zygote before somatic mutations diverge them.

Test yourself on Forensic Biology with free, timed mocks.

Practice Forensic Biology questions

Found this useful? Pass it along.

Share

Spotted an error in this page? Report a correction or read our editorial standards.

Your journey to becoming a forensic professional starts here.

Practice with mock tests, learn from structured notes, and get your questions answered by a global forensic community, all in one place.