Practice with mock tests, learn from structured notes, and get your questions answered by a global forensic community, all in one place.
The documented sources of error in forensic odontology, including contextual bias in comparison tasks, well-publicised casework failures, and the mitigation strategies of blind verification, linear-up testing, and proficiency programmes.
Last updated:
Forensic comparison disciplines assume that a trained examiner looking at the same evidence will reach the same conclusion. That assumption turns out to be much harder to defend than it looks. When researchers set up controlled studies, giving different examiners the same bite-mark photographs with different contextual stories attached, the conclusions can flip. The mark that was a 'positive identification' when the examiner was told the suspect had already confessed becomes 'inconclusive' when the examiner is told the suspect has a strong alibi.
This is not a matter of dishonesty. Cognitive bias is a structural feature of human perception, not a character flaw. The problem is that forensic science spent decades building its reputation on the claim of objectivity, which made it slow to acknowledge that its human practitioners were subject to the same perceptual pressures that psychology had been documenting in eyewitness memory, medical diagnosis, and financial forecasting for a generation.
This topic covers the main bias types that have been demonstrated in forensic odontology contexts, the error cases that drew the most serious professional and judicial attention, and the operational countermeasures, blind verification, linear-up comparison, and proficiency testing, that represent the field's current best practice for managing these risks.
Every comparison is also a hypothesis-test, and humans are biased hypothesis-testers.
Itiel Dror, a cognitive neuroscientist at University College London, has spent two decades studying how context corrupts forensic comparison judgments. His 2006 experiment with latent fingerprint examiners showed that the same pairs of prints, presented under different contextual frames, elicited contradictory conclusions from the same examiners. Variants of that design have since been replicated in fingerprints, footwear marks, and bite-mark analysis.
Several bias types recur across the odontology literature. They are worth naming precisely because the mitigation strategy for each is different.
The failures that changed how the field looks at itself.
The errors that attracted the most scrutiny came in two categories: false identifications in disaster victim identification (DVI) settings, and false bite-mark attributions leading to wrongful convictions. Both categories produced formal investigations and drove reforms.
In DVI work, the most studied failure involved misidentifications in the aftermath of the 2004 Indian Ocean tsunami. The sheer scale (an estimated 220,000 dead across 14 countries) and the pressure to return remains to families quickly created conditions in which confirmation bias could operate powerfully. Post-event reviews found cases where dental comparison was completed too quickly with poor-quality antemortem records and without proper reconciliation. Some families received the wrong remains.
In criminal contexts, the most consequential errors involved bite-mark testimony in homicide cases. Beyond Ray Krone (Arizona, 1992, exonerated 2002 by DNA), the cases of Jimmy Ray Bromgard (Montana, exonerated 2002), Steven Avery's first wrongful conviction (Wisconsin, 1985, exonerated 2003), and several cases from Mississippi in the 1990s linked to a single forensic odontologist, Michael West, who developed an idiosyncratic technique called ABFO No. 2 Scale that was never subjected to peer review or proficiency testing.
The simplest reform: don't tell the second examiner what the first one said.
Blind verification is the quality control procedure with the clearest evidence base in forensic science. The principle is straightforward: a second examiner reviews the same evidence independently, without being told the first examiner's conclusion. If both reach the same conclusion, confidence is higher. If they disagree, the disagreement itself is informative data about the evidence, not an error to be suppressed.
In a non-blind review, the second examiner is told the first conclusion before starting. Studies consistently show that this produces high agreement not because the conclusions are reliable but because the second examiner anchors on the first. The apparent inter-examiner agreement is artefactual. Blind review breaks this anchoring by design.
| Procedure | What examiner 2 knows | Risk | Use in accredited labs |
|---|---|---|---|
| Open review | First examiner's conclusion and reasoning | High anchoring; agreement is inflated | Declining; flagged by FSR and ABFO |
| Blind review | Evidence only; no prior conclusion | Low anchoring; disagreement surfaces genuine uncertainty | Recommended; required by ISO 17025 in practice |
| Linear-up | Set of candidates; target unknown | Reduces target-present bias in bite-mark work | Recommended for bite-mark comparisons post-2016 |
Implementing blind review requires a case management procedure that withholds the first conclusion until the second examiner has submitted their independent finding. In busy operational settings this can feel cumbersome. The evidence from fingerprint and DNA laboratories that have introduced it suggests the disruption is manageable and the quality improvement is real.
A profession that never tests itself cannot claim to know its own error rate.
Proficiency testing presents examiners with test problems that have known correct answers, concealed from the examiner. Results are scored and reported, usually with aggregate statistics across the group. This is standard practice in medical laboratory accreditation (under ISO 15189) and in forensic DNA typing (under ISO 17025 and FBI QAS). It has been applied more unevenly in comparison disciplines.
The ABFO ran formal proficiency exercises in the late 1980s and 1990s. The results were not reassuring. In one widely cited 1999 exercise, 63 ABFO diplomates assessed four bite-mark cases. Agreement on whether the injury was a human bite mark at all was below 85%. Agreement on specific comparisons was lower. These results were used as evidence in subsequent Daubert hearings to argue that the discipline did not have a known or acceptable error rate.
Post-2016, both the ABFO and INTERPOL DVI have moved toward requiring documented proficiency and competency testing as a condition of participation in casework. ISO 17025-accredited dental comparison laboratories now include proficiency exercises in their quality management systems. The shift is from self-assessed expertise to externally validated performance.
The fix is structural, not motivational.
Telling examiners to 'try harder to be objective' does not reduce contextual bias. The cognitive mechanisms that produce it are automatic and largely unconscious. The effective interventions are structural: changing who gets what information, and when, rather than relying on individual willpower.
Individual good practice matters less than systematic accountability.
The move toward quality management systems in forensic laboratories, driven by ISO 17025 accreditation requirements, creates an institutional framework for managing bias risk rather than relying on individuals. A quality management system defines procedures, requires their documentation, mandates proficiency testing, and creates an audit trail that external assessors can review.
For forensic odontology specifically, the Texas Forensic Science Commission in 2016 issued a report recommending that bite-mark evidence not be admitted in Texas courts until its reliability had been better established. This was significant because it came from a state oversight body, not a defence attorney, and it applied the same framework the commission uses to assess laboratory error in DNA cases. Several other states have since restricted or excluded bite-mark evidence on similar reasoning.
In DVI contexts, INTERPOL's DVI guidelines require an independent second confirmation before a positive identification is recorded on the DVI Reconciliation Form. The confirmation must be performed by a different expert, blinded to the first finding. This is operationally embedded quality control, not an add-on.
An investigator tells the forensic odontologist that the suspect has already confessed before the comparison is done. What bias risk does this create?
Test yourself on Forensic Odontology with free, timed mocks.
Practice Forensic Odontology questionsSpotted an error in this page? Report a correction or read our editorial standards.