The Defence Fallacy
The defence fallacy is the argument that a one-in-a-million DNA or trace-evidence match probability is unimpressive because a large population could produce that many matches, ignoring all other evidence in the case. This topic explains the mathematical error in that reasoning and shows how match probabilities should be correctly contextualised within the full body of evidence.
Last updated:
The defence fallacy is a statistical argument made in forensic evidence disputes: because one person in a million matches a DNA or trace-evidence profile, and a country of 60 million people would therefore contain 60 potential matches, the evidence against the suspect proves nothing. The argument sounds superficially compelling but is logically wrong. It treats the random match probability as the only relevant fact while discarding every other piece of evidence in the case, including where the suspect was, what they were doing, and any corroborating physical evidence. Correctly interpreted, a match probability of one in a million is a likelihood ratio component that must be combined with a prior probability built from all the other evidence, not evaluated as a standalone figure against raw population size.
The fallacy pairs with its mirror image, the prosecutor's fallacy, which claims that a one-in-a-million match probability means there is only a one-in-a-million chance the suspect is innocent. Both errors arise from the same root mistake: treating a conditional probability as if it were something it is not. The defence version inflates doubt by citing population size. The prosecution version inflates certainty by conflating match probability with guilt probability. Neither is an honest representation of what the statistical evidence actually says.
Courts in England and Wales, the United States, Australia, and elsewhere have encountered both fallacies in reported cases. The Law Commission of England and Wales identified them as the two principal statistical errors in forensic evidence presentation in its 2011 report. The correct framework, recommended by forensic statistics bodies internationally, is Bayesian: express the evidential value of a forensic match as a likelihood ratio, then combine that ratio with a prior probability derived from all the other evidence. The defence fallacy cannot survive this analysis because the prior probability already reflects the context that the fallacy ignores.
By the end of this topic you will be able to:
- State the defence fallacy precisely and identify it in courtroom arguments or expert testimony.
- Explain why citing population size to dismiss a match probability is a misuse of conditional probability.
- Distinguish the defence fallacy from the prosecutor's fallacy and describe the direction of error in each.
- Apply the likelihood ratio framework to combine a match probability with a prior probability drawn from other evidence.
- Identify the relevant population for a random match probability and explain why restricting that population changes the weight of the evidence.
- Defence fallacy
- The erroneous argument that a match probability of one in N is evidentially worthless because a large population of size M would contain M/N potential matches. It discards all evidence other than the match probability itself.
- Random match probability (RMP)
- The probability that a randomly selected person from the relevant population would by chance produce the same forensic profile as the sample recovered from the scene. A small RMP means a rare profile, not that the suspect is guilty.
- Likelihood ratio (LR)
- The ratio of the probability of the evidence given one hypothesis to the probability of the evidence given an alternative hypothesis. In forensic matching, LR = P(evidence | suspect left sample) / P(evidence | a random person left sample). The LR is the reciprocal of the RMP when the match is otherwise certain.
- Prior probability
- The probability of a hypothesis before the forensic evidence is considered, derived from all other evidence in the case: witness accounts, CCTV, alibi, motive, and location. The defence fallacy error is to treat the prior as if it were 1/population size.
- Prosecutor's fallacy
- The mirror error to the defence fallacy: treating the random match probability as equivalent to the probability the suspect is innocent (i.e., equating P(evidence | innocent) with P(innocent | evidence)). This inflates the apparent certainty of guilt.
- Relevant population
- The population of individuals who could plausibly have left the forensic sample, constrained by geography, time, and other case facts. Choosing the right relevant population is critical: using the entire national population when the suspect pool is much smaller deflates the weight of the evidence.
The structure of the argument
The defence fallacy appears in a recognisable form. A forensic scientist testifies that the probability of a random person matching the DNA profile found at the crime scene is one in a million. Defence counsel responds: a country of 60 million people would therefore contain approximately 60 people who match the profile. The suspect is just one of 60 innocent matches. The evidence therefore identifies no one.
The arithmetic is correct. In a population of 60 million, a profile present in one person in a million is expected in about 60 people. The logical error is what follows: the claim that this arithmetic renders the DNA evidence valueless ignores every other piece of evidence in the case. The probability that the suspect is guilty is not derived from the match probability alone. It is derived from the match probability combined with the prior probability of guilt, which already incorporates the location of the crime, the suspect's proximity, any witness evidence, CCTV footage, alibis, and all other facts established by the investigation.
Suppose that before the DNA evidence is introduced, the other evidence in the case establishes a prior probability of guilt of 0.5 for the suspect. The DNA match with an RMP of one in a million then updates that probability. The posterior probability of guilt, using Bayes' theorem, is approximately 0.999999. The existence of 59 other people in the country who also match the profile does not change this calculation, because those other 59 people had a near-zero prior probability of being at the crime scene.
Conditional probability and the direction of error
Both the prosecutor's fallacy and the defence fallacy are errors in handling conditional probability. The precise statement of the two errors is:
| Prosecutor's fallacy | Defence fallacy | |
|---|---|---|
| Claim made | P(innocent | match) = P(match | innocent) | Match probability = 1/population, so evidence is worthless |
| Direction of error | Overstates guilt: conflates RMP with probability of innocence | Understates evidence: ignores prior probability from other facts |
| Conditional probability confused | P(E | H0) treated as P(H0 | E) | P(E | H0) evaluated against population size, ignoring P(H1) |
| Real-world effect | Jury thinks 1-in-1M match = near-certain guilt | Jury thinks 60 matches in population = evidence proves nothing |
| Correct framing | Use LR to update prior; posterior is not 1 minus RMP | Use LR to update prior; prior already reflects case context |
The key insight is that P(E | H0), the probability of observing the DNA match given that an innocent random person left the sample, is not the same as P(H0 | E), the probability that the suspect is innocent given the observed match. Bayes' theorem governs the relationship between these quantities, and it requires knowledge of the prior probability. The defence fallacy error is to use P(E | H0) as if it were the only relevant calculation, and to derive P(H0 | E) directly from population arithmetic.
See Role of Statistics in Evidence Evaluation for a fuller treatment of how the likelihood ratio framework is applied in evaluative reporting.
The relevant population problem
The defence fallacy typically invokes the entire national population as the pool of potential matches. This is rarely the correct population for evaluating a forensic match. The relevant population is the set of people who could plausibly have left the sample, given all the other evidence. A burglary that occurred in a specific neighbourhood in Mumbai, committed between 2 a.m. and 4 a.m. on a Tuesday, is not plausibly committed by a random draw from India's 1.4 billion people. The relevant population is far smaller: people known to be in that area at that time, people without alibi, people with prior connection to the premises.
The same logic applies in the United States, the United Kingdom, Australia, or any other jurisdiction. A crime scene DNA profile matched against a national database returns a random match probability relative to the database population. But if the investigation has already placed the suspect at the scene through other evidence, the effective prior probability of guilt is much higher than 1/N where N is the national population. Using the national population in the denominator of a defence argument is a rhetorical move, not a statistical one.
The Forensic Science Regulator in England and Wales, in guidance issued from 2012 onward, requires forensic scientists to avoid presenting match probabilities in a way that invites either the prosecutor's fallacy or the defence fallacy. The guidance recommends expressing DNA evidence as a likelihood ratio and being explicit about the population assumptions underlying the calculation. Similar recommendations appear in guidance from the European Network of Forensic Science Institutes (ENFSI) and the Scientific Working Group for DNA Analysis Methods (SWGDAM) in the United States.
Applying Bayes' theorem: a worked illustration
Bayes' theorem in the odds form is the clearest tool for showing why the defence fallacy fails. The posterior odds of guilt equal the prior odds multiplied by the likelihood ratio. Written as an equation: Posterior odds = Prior odds x LR, where LR = P(E | Hp) / P(E | Hd), Hp is the prosecution hypothesis (suspect left the sample), and Hd is the defence hypothesis (a random person left the sample).
If the RMP is one in a million, then P(E | Hd) = 0.000001. If the suspect left the sample, the evidence is certain, so P(E | Hp) = 1. The LR is therefore 1,000,000. Now suppose the prior odds of guilt, derived from non-DNA evidence, are 1:1 (an even chance the suspect is the perpetrator). The posterior odds are 1,000,000:1, corresponding to a posterior probability of guilt of approximately 0.9999990. Adding 59 other people in the national population who also match the profile does not change this calculation. Those 59 people each had near-zero prior odds; their posterior odds after multiplying by the same LR of 1,000,000 are still negligible.
The defence fallacy effectively sets the prior odds at 1/(national population), which would be correct only if the suspect were identified by a cold database trawl from the entire population with no other evidence. In any case where non-DNA evidence already implicates the suspect, this prior is wrong, and the fallacy follows from using it.
For a deeper treatment of how prior probabilities are constructed from non-scientific evidence, see Numbers in Forensic Conclusions.
Case law and regulatory guidance
The defence fallacy has been addressed directly in a number of reported cases. In England and Wales, R v Adams [1996] EWCA Crim 222 saw the Court of Appeal grapple with the use of Bayes' theorem by the defence expert. The court recognised both the prosecutor's fallacy and the defence fallacy as errors but expressed discomfort with directing juries to perform explicit Bayesian calculations. Subsequent cases, including R v Deen (1994) and R v Doheny and Adams [1997] EWCA Crim 222, refined the rules on how DNA match statistics may be presented.
In the United States, People v Collins (1968) 68 Cal 2d 319 is an early case addressing misuse of probability in court, where the court rejected a probabilistic argument partly because the underlying population assumptions were unverified. More recent DNA cases have addressed the population denominator problem. The National Academy of Sciences report Strengthening Forensic Science in the United States (2009) and the President's Council of Advisors on Science and Technology (PCAST) report (2016) both identify flawed statistical reasoning, including the defence fallacy structure, as a problem in forensic evidence presentation.
In India, the Bharatiya Sakshya Adhiniyam 2023 (which replaced the Indian Evidence Act 1872) governs the admissibility of expert evidence. Section 39 of the BSA 2023 (equivalent to former Section 45) allows courts to receive expert opinions but leaves weight to the court. Indian courts have not yet developed a body of case law as detailed as the English or US cases on forensic statistics, but the same underlying principles apply: match probability evidence must be presented with its correct context or it misleads the tribunal.
Australian courts addressed the defence fallacy structure in R v Karger [2002] SASC 294, where the Supreme Court of South Australia examined the proper way to present DNA evidence and considered the argument that population size dilutes the evidential value of a match. The court rejected the fallacy in that form. In the European Union, the ENFSI Guideline for Evaluative Reporting in Forensic Science (2015) requires likelihood ratio presentation for precisely this reason.
Preventing the fallacy in practice
Forensic scientists presenting match probabilities can prevent the defence fallacy by framing their evidence as a likelihood ratio and explicitly stating what the LR does and does not represent. The LR quantifies how much more probable the observed evidence is under one hypothesis than under the competing hypothesis. It does not specify the prior probability, and the expert should say so. Combining the LR with the prior is the task of the tribunal, informed by all the evidence in the case.
Judges and counsel can prevent the defence fallacy from misleading juries by requiring experts to state the population assumptions underlying any RMP figure and by directing juries that a match probability must be evaluated alongside the other evidence, not in isolation. Some jurisdictions now have standard jury directions on forensic statistics for this purpose.
- Express the match probability as a likelihood ratio, not as a bare number.
- State explicitly which population was used as the reference database and why.
- Remind the tribunal that the LR must be combined with prior odds derived from other case evidence, and that this combination is not the expert's role.
- In a cold-database-trawl case, advise that the prior probability is lower and the threshold for prosecution should reflect that.
- Avoid presenting match statistics alongside language that implies either near-certain guilt or near-worthless evidence.
Training for legal professionals in forensic statistics is the longer-term solution. The UK's Forensic Science Regulator, the US National Commission on Forensic Science, and equivalent bodies elsewhere have advocated for statistical literacy programmes for judges, prosecutors, and defence counsel. The defence fallacy persists partly because counsel who make the argument and tribunals who accept it lack the statistical grounding to identify the error at the moment it is made. See History of Statistical Evidence in Courts for how courts have evolved in their handling of these issues.
A forensic scientist states that the probability of a random DNA match is one in 500,000. Defence counsel responds: 'In a country of 50 million, that means 100 people match. This evidence identifies nobody.' What is the primary error in this argument?
Key Takeaways
- The defence fallacy argues that a small match probability is evidentially worthless because a large population generates many potential matches. The error is that it ignores all other evidence in the case by implicitly treating every person in the population as an equally probable suspect.
- Both the prosecutor's fallacy and the defence fallacy are errors in conditional probability. The prosecutor's fallacy overstates guilt by conflating P(evidence | innocent) with P(innocent | evidence). The defence fallacy understates the evidence by using population size instead of the correct prior probability.
- The correct framework is Bayesian: the likelihood ratio (the reciprocal of the RMP, when the match is certain under the prosecution hypothesis) updates the prior odds of guilt, which are derived from all the non-forensic evidence. The posterior odds cannot be derived from the RMP alone.
- The relevant population for an RMP calculation is not the entire national population but the set of people who could plausibly have left the sample, constrained by geography, time, and other case facts. Using too large a population artificially inflates the number of expected matches and strengthens the false defence argument.
- Courts in England and Wales, the United States, Australia, and other jurisdictions have recognised the defence fallacy as an error. Regulatory guidance from the Forensic Science Regulator, ENFSI, and SWGDAM requires likelihood ratio presentation precisely to prevent both fallacies from misleading tribunals.
What is the defence fallacy?
How does the defence fallacy differ from the prosecutor's fallacy?
What is the correct way to present match probability in court?
Why does specifying the relevant population matter?
Have courts recognised the defence fallacy as an error?
Test yourself on Forensic Statistics with free, timed mocks.
Practice Forensic Statistics questionsSpotted an error in this page? Report a correction or read our editorial standards.