Mitochondrial DNA Sequencing on HV1/HV2 and the rCRS Reference

Why mtDNA wins when nuclear DNA is exhausted: the 16,569-base circular mitochondrial genome, the hypervariable regions HV1 and HV2, the revised Cambridge Reference Sequence (rCRS) and the Reconstructed Sapiens Reference Sequence (RSRS), heteroplasmy interpretation, and casework anchors (Romanov identification 1998, Anna Anderson, the Mengele identification 1985).

Last updated: 18 Jun 2026

When a femur recovered from a mass grave has been in the soil for decades and every STR amplification returns nothing, the mitochondrial genome is often the last option. A somatic cell may carry a few hundred to several thousand mitochondria, each with two to ten genome copies, giving mtDNA a copy-number advantage that can exceed 1,000-fold over nuclear DNA. That advantage is why mtDNA recovers profiles from shed hairs, degraded bone, and ancient specimens where nuclear STR is undetectable.

Key takeaways

The forensic mtDNA target is the D-loop control region, specifically HV1 (positions 16024-16365) and HV2 (positions 73-340), which together span approximately 610 base pairs of the 16,569-bp mitochondrial genome.
All differences are reported relative to the revised Cambridge Reference Sequence (rCRS, GenBank NC_012920, Andrews et al. 1999), which is mandatory in EMPOP, ISFG, SWGDAM, and ENFSI protocols.
EMPOP (Institute of Legal Medicine, Innsbruck) contains over 50,000 curated sequences and uses phylogenetic frequency estimation rather than simple counting, making it robust to small database sample sizes.
Heteroplasmy (two coexisting mtDNA sequences within one individual) is tissue-specific; a single-position difference attributable to heteroplasmy is not an exclusion under SWGDAM (2019) and ENFSI guidelines.
mtDNA cannot individualise: all maternally related individuals share the same haplotype, so the forensic statistic is a population frequency, not a random-match probability.

The forensic utility of the mitochondrial genome concentrates in a 1,122-base non-coding stretch called the control region, or D-loop. Within that region, two hypervariable segments, HV1 (approximately nucleotide positions 16024 to 16365) and HV2 (approximately positions 73 to 340), accumulate sequence variation at a rate roughly ten times higher than the nuclear genome average. Sequencing both segments and comparing the result to a reference sequence is the analytical core of every forensic mtDNA examination. The reference in universal use is the revised Cambridge Reference Sequence, published in 1999 and deposited in GenBank as accession NC_012920.

The technique entered forensic practice with landmark casework in the 1990s, but its scientific foundations connect to cell biology, population genetics, and clinical medicine in ways that bear directly on how a profile is interpreted in court. Understanding what an mtDNA sequence can and cannot prove, how heteroplasmy complicates exclusion, and why the copy-number advantage comes with a statistical ceiling are all prerequisites for responsible casework interpretation across jurisdictions, from the AFDIL laboratory at Dover Air Force Base in the United States to the ENFSI mtDNA Working Group member laboratories across Europe.

The Mitochondrial Genome: Architecture and Forensic Advantages

The molecule that solves cases nuclear STR cannot is smaller, circular, maternally inherited, and present in thousands of copies per cell, and each of those properties has a direct forensic consequence.

The human mitochondrial genome is 16,569 base pairs long. It is double-stranded and circular, encoding 37 genes: 13 proteins of the oxidative phosphorylation pathway, 22 transfer RNAs, and 2 ribosomal RNAs. The entire genome, by nuclear standards, is tiny. By forensic standards, its architecture is almost purpose-built for the task.

The first forensic advantage is copy number. A single nucleated cell contains two copies of each nuclear chromosome, but it may contain hundreds to thousands of mitochondria. Under electron microscopy, skeletal muscle cells carry over 1,000 mitochondria; a hair-shaft keratinocyte carries fewer but still in the hundreds. Because each mitochondrion contains multiple genome copies, the total mtDNA molecules per cell can exceed 5,000 even in a hair-shaft that has been shed (losing nuclear DNA) and left exposed to ultraviolet radiation and humidity for months. This is why hair, which lacks a root and therefore has no nuclear DNA, can still yield an mtDNA sequence, as demonstrated in the early FBI casework of the 1990s and confirmed in validation studies by the Armed Forces DNA Identification Laboratory (AFDIL), the US military's central identification laboratory at Dover, Delaware.

The second forensic advantage is maternal inheritance. Mitochondria are transmitted almost exclusively through the maternal egg cytoplasm. Sperm mitochondria are selectively degraded after fertilisation via a ubiquitin-mediated pathway. The consequence is uniparental, maternal inheritance: a child receives their mtDNA haplotype from their mother, who received it from her mother, extending back in time along the matrilineal line without the reshuffling that recombination imposes on nuclear alleles. This makes mtDNA an ideal lineage marker: all maternally related individuals (siblings, half-siblings sharing the same mother, maternal cousins, and so on) carry an identical or near-identical mtDNA sequence. In identification casework, a reference sample from a maternal relative can substitute for a direct reference from the individual.

The third forensic advantage is degradation resistance relative to nuclear DNA. The circular topology of the mitochondrial genome and its location inside the mitochondrial double-membrane provide some physical protection. More importantly, the high copy number means that extensive degradation can reduce template to levels that still permit PCR amplification. In bones recovered from wet soil environments in Southeast Asia (a setting directly relevant to US Vietnam War MIA recovery operations through the Joint POW/MIA Accounting Command, later JPAC and now DPAA), nuclear STR typing routinely fails while mtDNA sequencing succeeds.

The mitochondrial D-loop control region showing the positions of HV1 (positions 16024-16365) and HV2 (positions 73-340) within the 16,569-bp circular genome; the tRNA gene cluster separates the two hypervariable segments. Forensic sequencing targets both regions and compares the result to the rCRS reference.

HV1 and HV2: Sequencing the Hypervariable Regions

A stretch of 610 base pairs within a 16,569-base genome carries most of the forensic discriminating power, and understanding why those positions are hypervariable explains both the technique's value and its limits.

The control region accumulates mutations faster than the coding portions of the mitochondrial genome because it does not encode a functional product and therefore faces less purifying selection. Within the control region, HV1 and HV2 are designated hypervariable because population surveys have found that a disproportionate share of sequence differences between individuals cluster in these two segments. A comparison of the two reference standards, the rCRS and the RSRS, reveals that HV1 and HV2 together carry the majority of the sequence differences between the Cambridge sequence and the reconstructed ancestral human sequence.

Forensic sequencing of HV1 and HV2 proceeds by PCR amplification of the two regions followed by Sanger-method dideoxy chain-termination sequencing. The amplicons are designed to overlap slightly to confirm that the HV1 and HV2 sequences are from the same molecule. Each sequencing read is compared nucleotide-by-nucleotide to the rCRS, and differences are noted as substitutions, insertions, or deletions relative to the reference position numbers. A result might read: 16126C, 16294T, 73G, 263G, 315.1C, meaning that at position 16126 the sample has cytosine where the rCRS has thymine, and so on. This notation system, standardised by the ISFG (International Society for Forensic Genetics) and the ENFSI mtDNA Working Group, is what allows profiles from different laboratories and different countries to be compared in the EMPOP database.

EMPOP (the EMPOP mtDNA Population Database), maintained at the Institute of Legal Medicine at Innsbruck, Austria, is the forensic community's central reference population resource. As of 2024 it contains over 50,000 sequences from populations worldwide, curated to remove artefactual sequences and phylogenetically classified. In the US, the FBI's mtDNA Population Database (a predecessor to EMPOP integration) has been used in casework since the late 1990s. In India, CFSL laboratories performing mtDNA analysis use reference frequency estimates from published Indian population datasets, with the ENFSI mtDNA Working Group guidelines providing the methodological framework. Across the European Union, ENFSI member laboratories contribute sequences to EMPOP under a data-sharing agreement that ensures consistent quality and nomenclature.

A third hypervariable region, HV3 (approximately positions 438 to 576), is sometimes sequenced in cases where the HV1/HV2 profile is insufficiently discriminating, but this is not standard in routine casework. Whole-mitochondrial-genome sequencing, now available via MPS (massively parallel sequencing) as covered in NGS data analysis: allele calling and variant callers, provides the highest resolution: instead of 610 base pairs of HV1/HV2, the analyst sequences all 16,569 positions, dramatically improving discrimination. The Verogen ForenSeq mtDNA Whole Genome Solution is the commercial kit used for this purpose in high-resolution casework in the US, UK, and EU.

The rCRS and RSRS: Navigating the Reference Standards

Every mtDNA forensic report names a reference sequence, and which reference is named, and whether the analyst understands the relationship between the two major standards, determines whether the profile notation is internationally comparable.

The original Cambridge Reference Sequence (CRS) was published by Anderson et al. in Nature in 1981 and was the first complete human mitochondrial genome sequence. It was derived from HeLa cells and from a placental sample of European origin. Because it was published first, the entire forensic community adopted it as the comparator for position numbering, and that number scheme is embedded in decades of published casework, population databases, and laboratory protocols.

Two problems emerged. First, the 1981 sequence contained several errors that were identified when the material was re-sequenced. In 1999, Andrews et al. published the corrected sequence, designated the revised Cambridge Reference Sequence (rCRS), deposited in GenBank as NC_012920. The rCRS is now the mandatory reference in EMPOP and in ISFG guidelines, and all modern forensic reports express differences relative to the rCRS. An analyst using the original 1981 CRS rather than the rCRS will produce a notation that differs at 11 positions, which could cause a false apparent discrepancy in a cross-laboratory comparison.

Second, the rCRS is not a universal human ancestor: it is a modern European sequence, and variants common in Asian or African populations appear as substitutions relative to the rCRS even though they represent ancestral states. Behar et al. (2012) proposed the Reconstructed Sapiens Reference Sequence (RSRS), derived from phylogenetic analysis to represent the most probable ancestral sequence of all living humans. The RSRS is used in academic phylogenetic research and in some population-genetics analyses, but it has not displaced the rCRS in forensic practice because the entire legacy database and nomenclature system is built around rCRS coordinates. ISFG guidelines maintain rCRS as the forensic standard.

In operational casework the implications are practical. A UK laboratory using EMPOP must report results as differences from the rCRS. A US FBI laboratory must use the same convention. An Indian CFSL laboratory reporting under NABL accreditation cross-referencing ENFSI guidelines uses the rCRS. A result from a Vietnamese identification laboratory submitted to DPAA (Defense POW/MIA Accounting Agency) for family comparison must be in rCRS notation to be matched against the AFDIL reference database. Any laboratory that deviates from rCRS notation must include explicit conversion information in its report, or the cross-laboratory comparison is compromised.

Reference	Published	Origin	GenBank accession	Current forensic status
CRS (original)	1981 (Anderson et al., Nature)	HeLa cells + European placenta	J01415 (historical)	Superseded; 11 error positions corrected
rCRS (revised)	1999 (Andrews et al., Nat Genet)	Re-sequenced original material	NC_012920	Mandatory, ISFG, ENFSI, EMPOP, SWGDAM
RSRS	2012 (Behar et al., AJHG)	Phylogenetic ancestral reconstruction	Not a primary accession	Academic research; not used in casework nomenclature

Heteroplasmy: When a Sample Contains Two Different mtDNA Sequences

The assumption of a single mtDNA sequence per individual is almost always correct, but the exceptions, where two sequences coexist in the same sample, can turn an apparent exclusion into an inclusion and require careful interpretive handling.

Heteroplasmy refers to the coexistence of two or more different mtDNA sequences within a single individual. It arises because somatic mutations in the mitochondrial genome can become clonally expanded in a tissue. Heteroplasmy can be present at low frequency (a minor variant that appears as a shoulder on a Sanger sequencing peak), at high frequency (two peaks of comparable height), or anywhere in between. It can be tissue-specific: an individual may be heteroplasmic in hair follicles but homoplasmic in blood, because the two tissues had different lineage histories during development.

The forensic consequence is significant. If a questioned hair and a blood reference sample from the same individual are compared, and the hair carries a heteroplasmic position that the blood sample does not, a naive analyst might call an exclusion. The correct interpretation is that the hair evidence and reference are consistent: the heteroplasmic variant in the hair represents a somatic mutation that has not been clonally expanded to detectable levels in blood. The SWGDAM Interpretation Guidelines for Mitochondrial DNA (2019 revision) and the ENFSI mtDNA Working Group Best Practice Manual both address this: an apparent single-nucleotide difference between a questioned sample and a reference can be attributed to heteroplasmy if the variant has been observed in the relevant tissue type in the published literature, and the analyst must document the reasoning.

Point heteroplasmy (a single position where two nucleotide variants are observed) is more common and easier to interpret than length heteroplasmy, which occurs in the C-stretch at positions approximately 303-315 and 16184-16193. These cytosine-stretch regions are prone to replication slippage, generating length variants (extra or missing cytosine residues) that produce a complex overlapping sequencing pattern. Length heteroplasmy in these regions can be documented as a common variant rather than an artefact, but it requires particular care in Sanger sequencing and is one reason some laboratories prefer MPS for its ability to resolve length variants as distinct reads.

In the identification of Tsar Nicholas II and the Romanov family (1998 confirmation, following the initial 1994 identification work led by Dr Peter Gill and the UK Forensic Science Service), a critical challenge was exactly this: Tsar Nicholas II's mtDNA carried a point heteroplasmy at position 16169 (a mixture of T and C), observed in both his skeletal remains and in a blood sample from his living maternal-line relative Prince Philip, Duke of Edinburgh. The initial 1994 analysis by Dr Gill's team at the Forensic Science Service in Aldermaston treated the result cautiously; by 1998, independent analysis by Russian and US teams confirmed that the heteroplasmy was genuine and consistent with a shared maternal lineage, confirming the identification.

Point heteroplasmy at position 16169: the hair sample shows a mixed T/C peak in Sanger sequencing while the blood reference from the same individual shows only C. A naive comparison would call an exclusion; SWGDAM 2019 and ENFSI guidelines require this to be reported as consistent with the same source.

Landmark Casework: Romanov, Anna Anderson and Mengele

Three high-profile identifications in the 1990s established mtDNA sequencing as a court-grade forensic tool, each requiring a different combination of reference sample strategy, heteroplasmy interpretation, and statistical framing.

The Romanov identification is the textbook example of forensic mtDNA applied to historical remains under genuine scientific uncertainty. The skeletal remains of nine individuals recovered from a shallow grave near Yekaterinburg, Russia, in 1991 were subjected to STR and mtDNA analysis. Dr Peter Gill and colleagues at the UK Forensic Science Service published the primary identification results in Nature Genetics in 1994. The mtDNA from bones attributed to Alexandra Feodorovna, the Tsarina, and her three daughters matched a reference sequence from Prince Philip, Duke of Edinburgh, a maternal-line descendant of Alexandra's mother Princess Alice of Hesse. The Tsar's bones showed a heteroplasmic position 16169 matching both the skeletal remains and a reference blood sample from Prince Philip. The 1998 confirmation by an independent Russian-US team resolved the last doubts and led to the official state reinterment.

Anna Anderson had claimed since 1920 to be the Grand Duchess Anastasia, the youngest Romanov daughter presumed to have escaped the execution. She died in 1984. In 1994, a post-mortem tissue sample from Anna Anderson's intestinal biopsy, preserved at the University of Virginia Medical Center, was available for testing. Dr Mark Stoneking and colleagues at Pennsylvania State University sequenced her mtDNA and compared it to the reference sequence from Prince Philip and to the Romanov bones. The sequences did not match. Instead, Anna Anderson's mtDNA matched that of a Karl Maucher, a living maternal-line descendant of Franziska Schanzkowska, a Polish factory worker who had disappeared around 1920. The identification confirmed that Anna Anderson was Franziska Schanzkowska, not Anastasia. The case is now a standard reference in discussions of the statistical and interpretive value of mtDNA exclusion in historical claims.

The Mengele identification presented a different challenge: remains recovered in Brazil in 1985 without any living direct relative available as a reference. The initial 1985 examination by a team including Dr Clyde Snow and American forensic anthropologists provided anthropological and odontological evidence consistent with Josef Mengele, but the identification was contested. In 1992, Dr Alec Jeffreys (inventor of DNA fingerprinting) and Brazilian scientists conducted mtDNA sequencing on the 1985 bones and compared the result against a blood reference from Rolf Mengele, Josef's son. A son inherits his mitochondrial genome from his mother, who would have transmitted the same sequence to Josef Mengele as well (assuming the same maternal lineage). The mtDNA sequences matched, providing the molecular confirmation that resolved the case. The 1992 Mengele confirmation became an early demonstration that forensic mtDNA analysis with a single indirect reference sample can provide legally meaningful evidence.

Across the Atlantic, the AFDIL laboratory at Dover Air Force Base has processed over 1,800 mtDNA identifications since the late 1990s as part of the US government's Vietnam War MIA recovery programme. Where the case involves male-lineage questions alongside the matrilineal mtDNA evidence, laboratories also deploy Y-STR profiling to cover the paternal-lineage dimension. Recovery missions in Vietnam, Laos, and Cambodia retrieve skeletal fragments from crash sites and graves; nuclear STR is routinely unavailable because the remains have been in tropical soil for decades. AFDIL's mtDNA casework protocol, which uses whole-mitochondrial-genome sequencing for higher resolution, is the largest systematic application of forensic mtDNA sequencing in history.

Population Databases, Statistical Reporting and Admissibility

An mtDNA match without a frequency estimate is not forensic evidence; the number that gives the match its legal weight comes from a curated, peer-reviewed population database, and how that number is calculated and reported determines whether the expert survives cross-examination.

The forensic statistic for an mtDNA match is the profile frequency: the proportion of individuals in a relevant reference population who carry the same sequence (or, more precisely, the proportion who cannot be excluded as the source). This is fundamentally different from the random-match probability in autosomal STR analysis. Because all maternally related individuals share the same haplotype, the mtDNA result cannot individualise; it can only include or exclude. The frequency estimate bounds the inclusion: if the observed haplotype occurs in 1 in 1,000 individuals of the relevant population, the examiner can say only that 1 in 1,000 unrelated individuals could not be excluded, not that the sample came from a specific person.

Population frequency estimates come from EMPOP, the primary curated forensic mtDNA database maintained at the Institute of Legal Medicine, Innsbruck. EMPOP uses a phylogenetic method for frequency estimation rather than simple counting: sequences are placed on the human mtDNA phylogenetic tree, and frequency estimates are derived from the population distribution of the haplogroup and sub-haplogroup to which the profile belongs. This approach reduces the sensitivity to small reference sample sizes and is the standard endorsed by ISFG and ENFSI.

In the United States, the FBI's population database preceded EMPOP and has been used in federal and state courts under Daubert and state equivalents. Expert testimony on mtDNA frequency has been challenged in multiple US cases; the most significant was United States v. Beverly (2003) and a series of state-level Daubert hearings in the early 2000s. In the UK, ENFSI member laboratories contribute to EMPOP and report frequency estimates from the European dataset stratified by haplogroup. In India, CFSL Hyderabad and CFSL New Delhi perform mtDNA analysis, and published Indian population datasets provide the reference frequencies, though the absence of a national mtDNA database analogous to NDNAD or EMPOP remains a gap in the quality framework.

The minimum reporting standard, required by SWGDAM (US), ENFSI (EU), and the Forensic Science Regulator (UK), is that any mtDNA report that results in an inclusion must include a frequency estimate from a relevant population database, the number of sequences in the reference database, and the method of frequency calculation. A qualitative inclusion statement without a frequency number does not meet any of these standards. In Indian NABL-accredited labs, the reporting requirement follows ISO/IEC 17025 section 7.8, which mandates that results be reported with their associated uncertainty, which in the mtDNA context is the frequency estimate and its database provenance.

Key terms

Hypervariable region (HV1, HV2): Two segments within the mitochondrial D-loop that accumulate sequence variation faster than the coding mitochondrial genome. HV1 spans positions 16024-16365; HV2 spans positions 73-340. Both are routinely sequenced in forensic casework.
rCRS: Revised Cambridge Reference Sequence, the corrected 1999 version of the first complete human mtDNA sequence (GenBank NC_012920). All forensic mtDNA profiles are reported as differences from rCRS positions. Mandatory in ISFG, ENFSI, and EMPOP.
RSRS: Reconstructed Sapiens Reference Sequence (2012). A phylogenetically derived ancestral human mtDNA sequence used in academic population genetics but not in forensic casework nomenclature, where rCRS remains the standard.
Heteroplasmy: The presence of two or more different mtDNA sequences within a single individual or tissue. Point heteroplasmy appears as a mixed peak in Sanger sequencing; length heteroplasmy produces overlapping patterns in the cytosine-stretch regions. Must be distinguished from false exclusion.
EMPOP: EMPOP mtDNA Population Database, maintained at the Institute of Legal Medicine, Innsbruck, Austria. Contains over 50,000 curated mtDNA sequences and uses phylogenetic frequency estimation. Primary forensic reference for ISFG and ENFSI member laboratories.
AFDIL: Armed Forces DNA Identification Laboratory, at Dover Air Force Base, Delaware. The US military's central identification laboratory, responsible for mtDNA-based identification of MIA remains from Vietnam, Korea, and World War II.

Worked example

Shed Hair in a Cold Case, mtDNA Typing When No Root Is Present

A 1987 UK murder case is reopened. The only biological evidence is a single shed hair found on the victim's clothing. There is no nuclear DNA. Can the hair still identify the contributor?

Scene: A cold-case review unit examines a single unpigmented hair preserved in a crimped paper packet since 1987. There is no visible root bulb. The hair is from a Caucasian individual, approximately 5 cm in length. STR profiling using GlobalFiler returns no allele calls at any of the 21 loci. The analyst submits the hair for mtDNA analysis.

Step 1 (Extraction): Decontamination of the hair shaft surface by sequential washes in bleach (5%), ethanol (70%), and UV irradiation for 15 minutes per face to destroy surface contaminant DNA. The interior of the hair shaft is pulverised in liquid nitrogen and extracted using PrepFiler BTA with a 48-hour lysis at 56°C.

Step 2 (Sanger sequencing): PCR amplification of HV1 (positions 16024-16365) and HV2 (positions 73-340) using 4-6 overlapping primer pairs covering 100 bp amplicons. Products are Sanger-sequenced on the ABI 3500 and assembled using Sequencher. Comparison to rCRS (NC_012920) reveals 7 differences: 16093T, 16147C, 16223T, 16259C, 16278T, 16311C, and 263G. EMPOP placement confirms haplogroup H1c.

Step 3 (Database query): The EMPOP phylogenetic frequency estimation for this 7-position profile in the Northwest European population database gives a frequency of approximately 1 in 450 (0.22%). The haplotype is not rare but is not common either: roughly 1 in 450 unrelated individuals of Northwest European ancestry would have this profile and cannot be excluded as the source.

Step 4 (Suspect comparison): A reference buccal swab from the suspect, now 68 years old, is submitted. His HV1/HV2 sequence differs from the hair profile at position 16311: the reference shows a mixture of C and T (point heteroplasmy). SWGDAM heteroplasmy guidelines confirm that the hair (16311C homoplasmic) is consistent with the suspect carrying the same lineage with tissue-specific heteroplasmy. The report records an inclusion with a 1 in 450 frequency.

Conclusion: The case demonstrates the key features of forensic mtDNA: copy-number advantage enabling profiling from shed hair, rCRS-based nomenclature, EMPOP frequency estimation, and the heteroplasmy interpretation rule that prevents false exclusions. The result is inclusion, not individualisation, and the frequency estimate is the essential number that gives it evidential weight.

Frequently asked questions

Why can mtDNA not individualise a contributor the way autosomal STR profiling can?

Mitochondrial DNA is maternally inherited as a single non-recombining unit. Every individual in a maternal lineage, mother, children, maternal aunts, and cousins going back generations, carries an identical or near-identical haplotype. An mtDNA match cannot distinguish between any two people in the same maternal line. The forensic statistic is a population frequency from EMPOP (what proportion of the relevant population shares this haplotype), not a random-match probability of the kind calculated for autosomal STR profiles. Compare this with the equivalent limitation of [Y-STR profiling](/topics/forensic-biotechnology/y-str-and-x-str-profiling-for-lineage-and-pedigrees) for the paternal line.

What is heteroplasmy and when does it cause a false exclusion in mtDNA casework?

Heteroplasmy is the coexistence of two or more different mtDNA sequences within one individual, arising from somatic mutations clonally expanded in some tissues but not others. In Sanger sequencing, point heteroplasmy appears as a double peak at one position. The forensic challenge: a hair may be homoplasmic for one variant while a blood reference from the same person is homoplasmic for the other, or shows a mixture. SWGDAM Mitochondrial DNA Interpretation Guidelines (2019) and ENFSI best practice specify that a single-position difference attributable to heteroplasmy is not an exclusion; the profiles are treated as consistent with the same source.

How does EMPOP estimate mtDNA haplotype frequency and why is phylogenetic estimation better than simple counting?

EMPOP (over 50,000 curated sequences, Institute of Legal Medicine, Innsbruck) places a profile on the human mtDNA phylogenetic tree and derives the frequency from the haplogroup and sub-haplogroup population distribution. Simple counting divides exact matches by database size; this is sensitive to small samples and rare haplotypes seen only once. The phylogenetic method is more robust because even a haplotype never before observed still belongs to a haplogroup with a known distribution. ISFG, ENFSI, and SWGDAM all endorse the phylogenetic method as the standard for forensic reports.

Which reference sequence is mandatory for forensic mtDNA reporting and what is the difference between rCRS and RSRS?

The rCRS (revised Cambridge Reference Sequence, GenBank NC_012920, Andrews et al. 1999) is mandatory in EMPOP, ISFG, SWGDAM, ENFSI, and the UK Forensic Science Regulator's Codes of Practice. All mtDNA differences are reported relative to rCRS positions regardless of haplogroup. The RSRS (Reconstructed Sapiens Reference Sequence, Behar et al. 2012) is a phylogenetically derived ancestral sequence used in academic population genetics; it has not replaced rCRS in any forensic jurisdiction. Using the original 1981 CRS rather than rCRS introduces errors at 11 positions and would compromise cross-laboratory comparisons.

Practice

Question 1 of 5· 0 answered

A shed hair without a root yields no result on a nuclear STR multiplex but produces a readable mtDNA sequence. The most likely reason for this difference in recovery is:

Test yourself on Forensic Biotechnology with free, timed mocks.

Practice Forensic Biotechnology questions

Found this useful? Pass it along.

Spotted an error in this page? Report a correction or read our editorial standards.

Frequently asked questions

Your journey to becoming a forensic professional starts here.