Capillary Electrophoresis and Electropherogram Interpretation

How a multiplex PCR product becomes a numbered allele call: capillary electrophoresis on 3500 and 3130 platforms, the spectral matrix and dye channels, allelic ladder calibration, and the artefacts (stutter, pull-up, off-ladder alleles, drop-out, drop-in) that an examiner reads off the electropherogram.

Last updated: 18 Jun 2026

After STR amplification, the labelled fragment mixture enters capillary electrophoresis, which separates fragments by length at single-base resolution, measures their fluorescence, and writes the result as a coloured trace on screen. That trace, the electropherogram, is the primary record every examiner reads before calling a genotype.

Key takeaways

The ABI 3500xL uses POP-7 polymer and a 505 nm solid-state laser to separate STR fragments at single-base resolution across up to 24 capillaries simultaneously.
A spectral matrix must be recomputed after every capillary replacement; an outdated matrix is the direct cause of pull-up (bleed-through) artefacts and false allele calls.
Allele numbers are assigned by matching sample peaks to the co-injected allelic ladder, making designations instrument-independent and comparable across laboratories.
Stutter peaks appear one repeat unit below each genuine allele and are filtered using validated locus-specific stutter ratio thresholds (typically 5-15% for tetranucleotide loci).
Peaks above the analytical threshold but below the stochastic threshold are in an interpretive grey zone requiring probabilistic genotyping rather than binary allele calling.

Capillary electrophoresis replaced the silver-stained polyacrylamide gel in the late 1990s. The platforms used worldwide today are the Applied Biosystems (ABI) 3500, 3500xL, 3130, and 3130xl genetic analysers. The 3500 and 3500xL are the current-generation instruments, running eight or twenty-four capillaries in parallel; the 3130 and 3130xl remain in operational use across many laboratories in Australia, India, Pakistan, South Africa, and parts of South America that have not yet completed instrument refresh cycles. Analysis software, typically GeneMapper ID-X (US, AU, UK), OSIRIS (developed by NIST), or FaSTR DNA (a more recent probabilistic-aware front end), converts raw fluorescence data into allele calls indexed against an allelic ladder.

Reading an electropherogram correctly is a core forensic DNA competency. The trace is never clean. Stutter peaks, pull-up artefacts, off-ladder alleles, and sporadic drop-in peaks share real estate with genuine alleles. An examiner who misreads stutter as a second contributor, or who ignores drop-out on a low-template sample, can report either an inflated contributor count or a misleadingly clean single-source profile. Courts in the United States, the United Kingdom, and Australia have challenged DNA evidence on exactly this interpretive step, not on the chemistry behind it. Complex traces with multiple contributors require mixture deconvolution and probabilistic genotyping, and the resulting profile enters a CODIS 20 or ESS17 database.

The Capillary Electrophoresis Instrument

Inside a glass capillary no wider than a human hair, a forensic identity is untangled one base pair at a time.

A capillary electrophoresis genetic analyser resolves DNA fragments through a polymer-filled glass capillary, typically 36 cm or 50 cm in length, under a high-voltage electric field. The polymer, POP-4 on the 3130 and POP-7 on the 3500 series (the 3500 also accepts POP-6 for certain fragment-analysis applications), acts as the sieving matrix. Smaller fragments migrate faster and reach the detection window first; larger fragments arrive later. The separation resolution is sufficient to distinguish fragments differing by a single base pair, which is the minimum discriminating unit for STR alleles that share a repeat unit but differ in flanking sequence.

The 3500 and 3500xL instruments carry eight and twenty-four capillaries respectively, running simultaneously in a single electrophoresis run. This parallelism is why a busy forensic laboratory processing case-work can run twenty-four samples with an allelic ladder and size standard in a single injection rather than reinjecting after every eight. The 3130xl runs sixteen capillaries; the legacy 3130 runs four. Indian Central Forensic Science Laboratories (CFSLs) and many State FSLs in India operate a mix of 3130xl and 3500 instruments; the UK Forensic Science Service successor laboratories (Eurofins Forensics UK, LGC Forensics) have largely standardised on 3500xL; the FBI's Combined DNA Index System (CODIS) laboratories in the United States report validation data against 3500xL with GlobalFiler.

Detection is by laser-induced fluorescence. The 3500 series carries a 22 mW, 505 nm solid-state laser. Fluorescently labelled PCR products pass the detection window and emit light at wavelengths characteristic of their dye. The CCD (charge-coupled device) camera captures the emission spectrum. Because multiple dye channels are active simultaneously, a critical pre-run step is spectral calibration to separate the overlapping emission spectra of the different dyes.

Capillary electrophoresis signal flow: polymer-filled capillary separates labelled fragments by size; laser excites fluorescence at the detection window; CCD captures emission for all dye channels simultaneously.

Spectral Matrix and the Dye Channels

Five dyes, five colours, one capillary, the spectral matrix is the arithmetic that keeps them from bleeding into each other.

Modern STR multiplex kits use five or six fluorescent dyes to label primer sets across the panel. Applied Biosystems chemistry uses the FAM (blue), VIC (green), NED (yellow), PET (red), and LIZ (orange) dye system for five-dye kits such as Identifiler Plus and GlobalFiler. Promega kits (PowerPlex Fusion, PowerPlex 18D) use FL (blue), JOE (green), TMR (yellow), CXR (red), and WEN (orange). Each dye emits over a slightly different emission spectrum, but the spectra overlap significantly.

The spectral matrix (also called the spectral calibration matrix or colour separation matrix) is a correction factor that decouples these overlapping signals. Before a new capillary array is used, or after a routine maintenance event, the laboratory runs a matrix standard that contains all dyes at equal concentration and no DNA. The instrument software (Data Collection on the 3500 series) computes the overlap correction coefficients from that standard run. Every subsequent electropherogram is mathematically corrected by matrix multiplication to yield five separate, idealised dye channels.

If the matrix is outdated, misapplied, or computed on a poorly prepared standard, the correction is imperfect. The result is spectral pull-up (or bleed-through): a tall peak in one dye channel generates a spurious smaller peak in an adjacent channel at the same size position. Pull-up peaks are a major source of false allele calls in inexperienced hands and are the mechanism behind the Adam Scott laboratory contamination case in the UK in 2011, though that case also involved a physical cross-contamination event at sample preparation.

In the United Kingdom, the Forensic Science Regulator's Codes of Practice and Conduct (version 6, 2021) require that matrix correction is validated as part of the laboratory's accreditation package under the Forensic Science Regulator Act 2021. In the United States, SWGDAM's 2017 guidelines require that the matrix be rerun after every capillary replacement or polymer change. In India, FSL laboratories operating under NABL accreditation (ISO/IEC 17025:2017) are expected to follow instrument manufacturer validation protocols, which carry the same requirement.

Allelic Ladder and Size Calibration

Every allele number a jury hears was assigned by comparison with a ladder that contains the entire known allele range for that locus.

An allelic ladder is a mixture of commonly observed alleles at each locus in the STR panel, all amplified and pooled at near-equal concentrations. The ladder is co-injected in each run alongside case samples. The analysis software matches each sample peak to the corresponding ladder peak by size, and from that size match assigns the allele designation: 14, 15, 16.2, 17, and so on.

This two-step process (size by internal lane standard, then allele call by ladder comparison) is what makes the system robust to minor run-to-run variation in migration time, which changes slightly with polymer age, temperature, and capillary condition. The internal lane size standard, typically GeneScan 500 LIZ, GeneScan 600 LIZ v2.0, or WEN ILS 500 (Promega), is a set of fragments of known size that is co-electrophoresed in every capillary in every run. The software fits a local size-calling curve from the size standard peaks, which converts migration time to fragment size in base pairs. Allele assignment then matches sample peaks to the ladder's size-calibrated positions.

Alleles that fall outside the ladder range are termed off-ladder (OL) alleles. Off-ladder calls must be verified: the peak may be a true rare allele (a variant allele), a stutter artefact on the low-size side of a genuine allele, or a pull-up artefact from an adjacent channel. Microvariants (e.g. D21S11 allele 29.2, which has three full TCTA repeats and one partial TCTA repeat giving a non-integer allele size) are legitimate off-ladder calls and are reported with their decimal designation. The SWGDAM guidelines and the European DNA Profiling Group (EDNAP) have published population data for the most common microvariants at panels including D21S11, TH01, and FGA.

1. Run internal size standard
GeneScan 500 LIZ or WEN ILS 500 is co-injected in each capillary. Software fits a local size-calling curve mapping migration time to fragment length in base pairs.
2. Co-inject allelic ladder
The kit-supplied ladder (e.g. GlobalFiler Allelic Ladder) is run in at least one capillary per plate. Ladder peaks are sized against the same internal standard.
3. Match sample peaks to ladder
GeneMapper ID-X or OSIRIS assigns an allele number to each sample peak by matching its sized position to the nearest ladder peak within a defined bin window (typically ±0.5 bp).
4. Review off-ladder calls
Peaks outside bin windows are flagged OL. The examiner checks raw data for microvariant, stutter, or artefact origin before reporting.
5. Apply analytical threshold
Peaks below the analytical threshold (AT), validated for each kit and instrument combination, are not called as alleles and are not reported.

Stutter: The Artefact Every Examiner Must Know

Stutter is a shadow of a real allele, always one repeat unit smaller, and it is the single most common cause of misinterpreted electropherograms.

Stutter peaks are a byproduct of PCR amplification. The DNA polymerase occasionally slips on the repeat unit during extension, producing a product one repeat unit shorter (n-4 stutter at a TPOX tetranucleotide locus, n-3 at a trinucleotide) or, less commonly, one unit longer (plus-stutter). Stutter peaks consistently appear immediately below (and occasionally above) each genuine allele peak in the electropherogram.

Stutter ratios, the ratio of the stutter peak height to the parent allele peak height, are locus-specific and have been characterised for every locus in every major STR kit. Typical stutter ratios are in the range of 5 to 15 percent for most CODIS core loci. A peak below the stutter ratio threshold for that locus and that allele size is treated as a stutter artefact rather than a genuine allele. SWGDAM's 2017 guidelines and the ENFSI DNA Working Group's 2016 document both require that stutter thresholds be validated empirically during kit and platform validation, not taken from the manufacturer's documentation alone.

The interpretive complication arises in mixture samples: if a genuine allele from a minor contributor happens to fall at the stutter position of a major contributor's allele at the same locus, the minor contributor's allele may be masked or misclassified. This is the central problem that probabilistic genotyping software was designed to address, and is covered in depth in mixture deconvolution and probabilistic genotyping.

In the UK, R v. Adams (No 2) (1998) involved, among other issues, the question of how statistical weight was assigned to DNA evidence when the raw data required interpretive judgment. The court was not dealing with stutter per se, but the case established the principle that the examiner's interpretive decisions are part of the evidence and must be disclosed. In Australia, the Forensic Science Regulator equivalent, NATA (National Association of Testing Authorities), audits stutter threshold validation records as a standard accreditation item.

Drop-Out, Drop-In and the Stochastic Threshold

When a template is scarce enough that individual alleles fail to amplify or random fragments amplify by chance, the trace can no longer be taken at face value.

Drop-out is the failure of an allele to amplify to a detectable level. At low template amounts, the Poisson statistics of PCR starting material mean that a given allele may be present in only one or two initial copies. If those copies fail to enter the reaction at the denaturation step, that allele produces no signal above the analytical threshold. The result is an apparent homozygote at a locus where the true genotype is heterozygous, or a locus where only one of two genuine alleles is reported.

Drop-in is the complement: the sporadic amplification of a low-level contaminant allele that was not present in the original evidence. Drop-in peaks are characteristically small (typically below 200 relative fluorescence units, RFU, in most validated workflows), appear at only one or two loci across the profile, and do not recur in re-amplifications. The distinction between a minor contributor's genuine allele and a drop-in peak is a practical challenge in every high-sensitivity forensic workflow.

The stochastic threshold (ST) is the minimum peak height above which an allele is considered reproducibly amplified and therefore interpretable. Peaks between the analytical threshold and the stochastic threshold are visible but uncertain: they may represent genuine alleles that are dropping towards zero, or they may be artefacts. The SWGDAM guidelines require that the ST be empirically determined from the laboratory's own data and documented in the laboratory's validated interpretation guidelines. UK laboratories working under the Forensic Science Regulator's codes apply a similar concept under the term probabilistic threshold, and Australian NATA-accredited labs document it as part of the validation package for each kit.

Artefact	Appearance on Electropherogram	Common Cause	Interpretive Action
Stutter (n-1)	Peak 4 bp below genuine allele, typically 5-15% of parent height	Polymerase slippage during PCR	Apply validated stutter ratio filter; flag if near minor contributor height
Pull-up / bleed-through	Spurious peak in adjacent dye channel at same size as a tall peak	Imperfect spectral matrix correction	Check raw spectral data; rerun matrix calibration if recurrent
Off-ladder allele	Peak outside allele bin window	Microvariant, degradation artefact, or pull-up	Check raw data; consult population microvariant data; consult lab SOP
Drop-out	Missing allele at a heterozygous locus	Insufficient template; inhibition; degradation	Report as potential drop-out; consider increasing DNA input or re-extraction
Drop-in	Low-height isolated peak at one or two loci only	Sporadic low-level contamination	Note in report; consider probabilistic genotyping; re-amplify to test reproducibility

Four electropherogram artefacts at a glance: stutter sits 4 bp below its parent and is filtered by ratio; pull-up mirrors a tall peak in the wrong dye channel and flags a stale spectral matrix; drop-out leaves a locus apparently homozygous from template shortage; drop-in delivers a lone, non-reproducible low-height peak from trace contamination.

The Interpretive Record and Chain of Custody

The electropherogram is not an output file, it is evidence, and every decision the examiner makes on it is part of the court record.

Every interpretation decision the examiner makes on the electropherogram must be documented. This includes: which peaks were called as alleles and why, which peaks were treated as stutter or artefact and why, which loci were treated as potential drop-out loci, and whether the stochastic threshold was triggered. In accredited laboratories, this documentation typically lives in a case notes file that is retained alongside the raw data files (FSA files on ABI instruments) and is subject to discovery in any criminal or civil proceeding.

In the United States, the FBI's Quality Assurance Standards (QAS) for Forensic DNA Testing Laboratories require that raw data be retained in a recoverable format and that the case file document all interpretive decisions. CODIS upload requires a declaration that the profile was generated by a validated and QAS-compliant process. The allele calls from the electropherogram feed directly into the random match probability and likelihood ratio calculation reported to the court. In England and Wales, the Criminal Procedure and Investigations Act 1996 (CPIA) requires disclosure of all material that could assist the defence, including the raw electropherogram data and any notes on artefacts encountered. The 2021 Forensic Science Regulator Act put that disclosure obligation on a statutory footing for the first time.

In India, the Bharatiya Sakshya Adhiniyam 2023 (BSA § 39, replacing IEA § 45) governs the admissibility of expert opinion, and forensic DNA testimony routinely involves the production of printed electropherograms and instrument output files as supporting exhibits. Indian FSL laboratories operating under CBI or CFSL authority maintain case files that include the raw electropherogram printouts. The challenge, documented in Lok Sabha committee reports on the DNA Technology Bill, is that chain-of-custody documentation for instrument outputs is not yet uniformly standardised across State FSLs, a gap the proposed DNA profiling rules would address.

Key terms

Capillary electrophoresis (CE): A separation technique that resolves DNA fragments by size through a polymer-filled glass capillary under an applied electric field. Single-base resolution is achievable under optimised conditions.
Allelic ladder: A multi-allele reference standard containing the full range of known alleles at each STR locus in the panel, co-injected with samples to allow allele designation by size comparison.
Spectral matrix: A mathematical correction factor that decouples the overlapping emission spectra of the fluorescent dyes used in a multiplex STR kit. Must be computed fresh for each capillary array.
Stutter peak: A PCR artefact appearing one repeat unit below (and occasionally above) a genuine allele peak, caused by polymerase slippage. Typically 5-15% of the parent allele height.
Pull-up (bleed-through): A spectral artefact where a tall peak in one dye channel generates a spurious smaller peak in an adjacent dye channel at the same size, due to incomplete spectral matrix correction.
Drop-out: Failure of a genuine allele to amplify above the analytical threshold, typically due to low or degraded template. Creates apparent homozygosity at a heterozygous locus.
Drop-in: Sporadic appearance of a low-level peak not reproducibly present in re-amplifications, typically caused by trace contamination. Distinguished from genuine minor-contributor alleles by peak height and locus distribution.
Stochastic threshold (ST): A validated minimum peak height below which allele calls are considered uncertain due to stochastic amplification effects. Profiles with peaks between the analytical threshold and the ST require probabilistic interpretation.
GeneMapper ID-X: Applied Biosystems analysis software for STR genotyping. Current-generation tool used in the majority of US, UK, and Australian forensic DNA laboratories.
OSIRIS: Open-source STR analysis software developed by NIST (US National Institute of Standards and Technology), used as an independent or supplementary analysis tool in several accredited laboratories.

Practice

Question 1 of 5· 0 answered

An examiner observes a peak at 162 bp in the blue (FAM) dye channel of an electropherogram that also shows a peak of equal height at 162 bp in the green (VIC) channel. The most likely cause is:

Worked example

Low-Template Touch DNA, Interpreting a Degraded Electropherogram from a Knife Handle

A kitchen knife recovered at a stabbing scene yields an electropherogram with stutter above threshold, two suspected minor contributors, and an off-ladder allele at D21S11. How does the examiner read the trace?

Scene: A UK casework laboratory receives a kitchen knife seized from a suspect's flat. The item is submitted for touch DNA recovery from the handle. The examination unit applies a tape-lift, which is then extracted using PrepFiler BTA. Quantification shows 0.06 ng/µL of human-specific DNA.

Step 1 (Amplification): GlobalFiler kit, 29 cycles, 0.5 ng input. The 3500xL produces an electropherogram with 5-colour data across all 21 autosomal loci plus Amelogenin and SE33.

Step 2 (Artefact interpretation): At D8S1179, a peak at 12 RFU is noted 4 bp below the 14 allele. The analyst calculates 12/480 = 2.5% stutter ratio, below the validated 15% stutter threshold, and calls only the 14 allele. At D21S11, a peak falls at a position 0.2 bp above the 29 bin. Cross-referencing the D21S11 microvariant table identifies this as 29.2, a known partial-repeat microvariant with full documentation in NIST STRBase and EMPOP literature.

Step 3 (Contributor assessment): Peak height imbalance across multiple loci and the presence of three and four peaks at several heterozygous loci suggest a mixture with at least two contributors. The analyst documents the electropherogram as unsuitable for binary interpretation and submits the data to STRmix v2.6 for probabilistic deconvolution.

Step 4 (Reporting): STRmix returns a two-contributor model (probability ratio of two versus three contributors is 99:1). The primary contributor profile matches the reference sample of the first suspect. The LR of 4.5 × 10^8 is reported under the UK Forensic Science Regulator's Codes of Practice, with full documentation of the STRmix validation version, input parameters, and electropherogram observations.

Conclusion: The electropherogram reading, artefact recognition, and the decision pathway from binary to probabilistic interpretation directly reflect the capillary electrophoresis principles in this topic. Stutter thresholds, microvariant databases, and mixture indicator recognition all converge before a single allele call reaches the report.

Frequently asked questions

What is stutter in an STR electropherogram and how is it distinguished from a real allele?

Stutter is a PCR artefact from polymerase slippage on the repeat tract, producing a product one repeat unit shorter (4 bp for tetranucleotide loci) than the main allele. Laboratories validate a stutter threshold, typically 10-15% of the parent peak height; any peak below that threshold at the expected stutter position is called as artefact. A peak exceeding the threshold or falling at a non-stutter position requires a separate explanation and may indicate a minor contributor, which is where probabilistic genotyping takes over.

What causes pull-up artefacts on an electropherogram and how are they fixed?

Pull-up (bleed-through) occurs when the spectral matrix is inaccurate, causing a tall peak in one dye channel to generate a ghost peak in the adjacent channel at the same size position. It is most common after capillary replacement if the matrix standard was not rerun. Fix: recompute the matrix with the current capillary array, then reanalyse the data. The diagnostic signature is a spurious peak at exactly the same base-pair position as a real high-intensity peak in an adjacent channel.

Why is an allelic ladder needed in STR typing if an internal size standard is already used?

The internal size standard (e.g. GeneScan 500 LIZ) converts migration time to fragment length in base pairs, but that conversion varies slightly between runs, instruments, and polymer batches. The allelic ladder, containing all common alleles for every locus, is co-injected in the same run and provides a size-calibrated reference that translates a corrected fragment size into an allele name. This two-step process makes allele designations instrument-independent and directly comparable across laboratories worldwide.

Can a profile from the ABI 3500xL be uploaded directly to CODIS?

Yes, provided the profile covers all 20 core CODIS loci and the laboratory holds NDIS enrolment with FBI QAS accreditation. GlobalFiler types all 20 CODIS core loci in a single run. The profile is exported from GeneMapper ID-X in CODIS-compatible XML format. Partial profiles (fewer than 20 loci) can be uploaded as partial hits; minimum match thresholds for partial profiles are defined in the FBI NDIS Operating Procedures. For detail on locus standards, see the [CODIS 20, ESS17, and India NDIS locus standards](/topics/forensic-biotechnology/codis-20-ess17-and-india-ndis-locus-standards) topic.

Test yourself on Forensic Biotechnology with free, timed mocks.

Practice Forensic Biotechnology questions

Found this useful? Pass it along.

Spotted an error in this page? Report a correction or read our editorial standards.

Frequently asked questions

Your journey to becoming a forensic professional starts here.