Prior and Posterior Odds in Court
Prior odds capture the probability of a proposition before forensic evidence is considered, and the likelihood ratio converts those prior odds into posterior odds after the evidence is weighed. This topic explains the mathematics of that conversion and the strict division of roles it demands between the forensic scientist, the judge, and the jury.
Last updated:
Prior odds and posterior odds are the two endpoints of a Bayesian inference chain. The prior odds express how probable the competing propositions are before forensic evidence is considered. The likelihood ratio (LR) is a number the scientist calculates from the evidence. Multiplying the prior odds by the LR gives the posterior odds: the updated probability after the evidence has been weighed. Bayes' theorem, the mathematical statement of this relationship, is written as: posterior odds = prior odds x LR. In court, the scientist provides the LR; the fact-finder, whether judge or jury, supplies the prior odds and arrives at the posterior. These are distinct roles, and keeping them separate is not a technicality but a constitutional principle in most legal systems.
The division of roles matters because the prior odds are set by non-scientific evidence: eyewitness accounts, CCTV footage, alibi claims, circumstantial facts, and the background rate of the crime. A forensic scientist who has examined only a glass fragment or a DNA profile has no basis for evaluating that non-scientific evidence. If the scientist states a probability of guilt rather than a likelihood ratio, they are implicitly combining their own evidence with prior information they have not seen and cannot weigh. This error, known as the transposition fallacy or the prosecutor's fallacy, has led to convictions being quashed in appellate courts in England, the Netherlands, and elsewhere.
Bayesian thinking entered forensic science through DNA profiling in the late 1980s and has since been applied to fingerprints, footwear marks, glass fragments, firearms residue, and voice comparison. The debate is no longer about whether probability matters in forensic evidence, but about how the scientist should communicate the LR, what verbal scale (if any) should accompany a numerical LR, and whether jurors can reason correctly with probabilistic testimony. These debates are active in courts and forensic science regulators across the UK, US, continental Europe, Australia, and increasingly in India and East Asia.
By the end of this topic you will be able to:
- Define prior odds, posterior odds, and the likelihood ratio and state the Bayesian equation that connects them.
- Explain why setting the prior odds is the fact-finder's task and not the forensic scientist's task.
- Identify the transposition fallacy and describe its legal consequences with reference to at least one real appellate case.
- Apply the Bayesian updating equation to a simple numerical example and correctly interpret the result.
- Describe how different legal systems, including England and Wales, the United States, and India under the Bharatiya Sakshya Adhiniyam 2023, approach probabilistic expert testimony.
- Prior odds
- The ratio of the probability of the prosecution proposition (Hp) to the probability of the defence proposition (Hd) before any forensic evidence is considered. Set by the fact-finder from non-scientific case information. Written as P(Hp) / P(Hd).
- Posterior odds
- The updated ratio after the likelihood ratio has been applied: posterior odds = prior odds x LR. The fact-finder uses posterior odds (combined with all other evidence in the case) to reach a verdict. The scientist does not determine the posterior odds.
- Likelihood ratio (LR)
- The scientist's numerical contribution: the probability of the observed evidence if Hp is true, divided by the probability of the same evidence if Hd is true. LR = P(E|Hp) / P(E|Hd). An LR greater than 1 supports Hp; an LR less than 1 supports Hd.
- Transposition fallacy
- The error of treating P(E|Hd) as equal to P(Hd|E). The probability of finding this evidence if the defendant is innocent is not the same as the probability of innocence given this evidence. Also called the prosecutor's fallacy when the direction of the error favours the prosecution.
- Bayes' theorem
- The mathematical rule that relates prior odds, the likelihood ratio, and posterior odds. In the odds form: O(Hp|E) = O(Hp) x LR. This is the formal basis for Bayesian forensic inference and is equivalent to the probability form of the theorem but easier to apply in sequential evidence evaluation.
- Evaluative reporting
- The style of expert testimony in which the scientist reports the LR (or a verbal equivalent) rather than stating a probability of the source proposition being true. Recommended by the Association of Forensic Science Providers, the UK Forensic Science Regulator, and ENFSI guidelines.
The Bayesian equation and what each term means
Bayes' theorem in its odds form is: posterior odds = prior odds x likelihood ratio. Each term has a precise meaning and a clear owner.
| Term | Mathematical symbol | Who sets it | What it contains |
|---|---|---|---|
| Prior odds | O(Hp) / O(Hd) | Fact-finder (judge or jury) | Non-scientific case evidence: witnesses, CCTV, alibi, background rates |
| Likelihood ratio | P(E|Hp) / P(E|Hd) | Forensic scientist | Probability of the observed evidence under each proposition |
| Posterior odds | O(Hp|E) / O(Hd|E) | Fact-finder | Prior odds updated by the LR; the basis for the verdict |
The prior odds reflect everything the fact-finder knows before the scientist gives evidence. In a robbery case where a defendant was found near the scene carrying property matching the victim's description, the prior odds before the DNA evidence is heard might be relatively high. In a case resting entirely on a cold-case database hit with no other evidence, the prior might be lower. The scientist does not know which case they are in and should not try to guess.
The LR summarises what the scientist's examination adds. An LR of 1 means the evidence is equally probable under both propositions: the forensic examination has provided no discriminating power. An LR of 10 means the evidence is ten times more likely if Hp is true than if Hd is true. An LR of 0.1 means the evidence is ten times more likely under Hd: the scientist's evidence supports the defence proposition. The direction matters as much as the magnitude.
What information sets the prior odds
The prior odds are formed from everything that is known about the case before the forensic evidence is admitted. This includes eyewitness identification, surveillance footage, the defendant's location at the relevant time, the defendant's prior relationship to the scene or victim, and the background frequency of the offence in the relevant population. None of these are scientific measurements: they are factual and circumstantial materials that the fact-finder weighs according to legal rules of evidence.
In practice, neither a judge nor a juror assigns an explicit numerical value to the prior odds. They carry an implicit sense of how probable guilt seems before hearing the forensic evidence, and the scientist's LR updates that sense. This implicit approach is how Bayesian reasoning operates in real trials, even when the word 'Bayesian' is never used. The framework is nonetheless useful for understanding what the scientist's evidence can and cannot do: it can move the odds, but cannot set them.
A specific concern arises when the forensic evidence is the primary evidence against the defendant, such as in a cold-hit DNA case where the database match is the only connection between the defendant and the crime. Some statisticians have argued that the appropriate prior in such a case is 1 divided by the number of people in the database, making the prior odds very low. If the LR is large, the posterior odds may still support conviction, but the calculation makes clear that an LR of 1 in 10,000 from DNA evidence in a database of 10,000 people leaves the posterior odds at exactly 1:1, which is not enough for conviction.
See also: Random Match Probability for the related concept of how a single source probability is calculated without reference to prior odds.
The transposition fallacy and its legal consequences
The transposition fallacy is the error of treating P(E|Hd) as equivalent to P(Hd|E). In plain language: the probability of seeing this evidence if the defendant is innocent is not the same as the probability of the defendant being innocent given this evidence. The two quantities can differ by orders of magnitude depending on the prior odds.
The error appears in many forms. A forensic biologist who says 'the probability that someone other than the defendant is the source of this DNA is 1 in a million' has correctly described P(E|Hd) if the reference population is specified. If the biologist then says 'the probability that the defendant is innocent is 1 in a million', they have committed the transposition fallacy: that second statement is the posterior probability of innocence, which requires the prior odds the scientist does not have.
The Netherlands case of the Lucia de Berk prosecution (2003 to 2010) is an example of both the transposition fallacy and the base-rate problem operating together. The statistical expert calculated the probability of so many patient deaths occurring by chance on the shifts of one nurse as 1 in 342 million. The court treated this as the probability of innocence, which it is not. After a decade of litigation, her convictions were quashed. The case has since been studied in probability and law curricula in many countries as a demonstration of what goes wrong when the prior is not handled separately.
In the United States, the Daubert standard (Daubert v Merrell Dow Pharmaceuticals, 509 US 579, 1993) requires federal courts to screen expert testimony for scientific validity and relevance. Testimony that transposes probabilities may fail the reliability criterion. State courts apply varying standards but the same logical error is equally damaging. In India, expert testimony is governed by Section 45 of the Bharatiya Sakshya Adhiniyam 2023 (which replaced the Indian Evidence Act 1872), and while the statute does not specify probability formats, the same error of stating posterior probabilities as if they were LRs would be a logical defect that courts can weigh against the testimony.
The scientist's proper role: providing the likelihood ratio
The forensic scientist's task is to calculate and communicate the LR for the evidence they have examined, under clearly stated propositions. The propositions are usually set at source level (this mark came from the suspect's shoe vs this mark came from some other shoe) rather than offence level (the defendant committed the burglary vs the defendant did not). The scientist works at the level they have expertise in; moving to offence-level propositions requires access to all the case evidence, which the scientist typically does not have.
The European Network of Forensic Science Institutes (ENFSI) published a guideline on evaluative reporting in 2015, recommending the LR framework as the appropriate format for forensic conclusions. The UK Forensic Science Regulator's Codes of Practice require providers to use an evaluative framework consistent with Bayesian principles. The Association of Forensic Science Providers (AFSP) in the UK has issued similar guidance. These documents do not all require a numerical LR: verbal scales that map qualitative terms to LR ranges are permitted, provided the scale is defined and applied consistently.
A common verbal scale pairs terms with LR ranges: 'limited support' might correspond to LR 2 to 10, 'moderate support' to LR 10 to 100, 'strong support' to LR 100 to 1000, 'very strong support' to LR 1000 to 1,000,000, and 'extremely strong support' to LR above 1,000,000. The precise boundaries vary between providers and countries. The UK's Forensic Science Regulator has noted that different scales in use across providers are a source of inconsistency and has encouraged standardisation.
The judge, the jury, and the allocation of the prior
The allocation of the prior odds to the fact-finder is not merely a convention: it reflects the constitutional structure of criminal trials in most systems. In jury systems, the jury evaluates the whole of the evidence and reaches a verdict. No single piece of evidence, including forensic evidence, determines the verdict mechanically. The judge's role is to direct the jury on the legal standard of proof (beyond reasonable doubt in criminal cases) and to evaluate the admissibility of expert testimony, but not to substitute their own numerical prior for the jury's judgment.
In bench trial systems, where the judge is also the fact-finder, the same logic applies: the judge must distinguish between the LR supplied by the expert and their own prior assessment of the non-scientific evidence. The formal Bayesian framework helps to keep these separate, even if the judge does not use arithmetic explicitly. The alternative, where a judge or jury simply accepts the scientist's conclusion about the posterior, is a delegation of the fact-finding function that the legal system is designed to prevent.
Under the Bharatiya Nagarik Suraksha Sanhita 2023 (which replaced the CrPC in India), sessions courts and magistrate courts are the primary fact-finders in serious criminal cases. The principle that the court, not the expert, determines guilt is the same as in common law systems. The European Convention on Human Rights (Article 6, right to a fair trial) has been interpreted by the European Court of Human Rights to require that the fact-finder not simply adopt an expert conclusion without independent evaluation. The same principle appears in the US Constitution's Sixth Amendment right to jury trial.
For background on how statistical reasoning entered courts historically, see History of Statistical Evidence in Courts.
Communicating LRs to courts: numbers, words, and the jury problem
How the LR is communicated to a court is a live research and policy question. DNA evidence, where LRs of billions are common, is typically reported numerically: a random match probability of 1 in 10 billion is a direct statement of P(E|Hd) and is widely understood by fact-finders as very strong evidence. For disciplines where the LR is harder to calculate with precision, such as footwear comparison, fingerprint comparison, or handwriting analysis, a verbal scale is more common.
Jury research has shown that people have systematic difficulties with conditional probability statements. The confusion between P(E|Hd) and P(Hd|E) is not confined to lawyers and experts: jurors make the same error. Some researchers advocate presenting evidence as natural frequencies ('in a population of 10,000 people unrelated to the victim, about 1 would match this profile') rather than probabilities, arguing that frequency formats reduce the transposition error. Other researchers have found that numerical LRs of very large magnitude are treated by jurors as essentially equivalent to certainty, which may itself distort verdicts.
A specific issue arises with very low LRs from non-DNA evidence. If a footwear expert reports an LR of 10 (the mark is 10 times more likely to have come from the defendant's shoe than from a random shoe in the population), this is a meaningful evidential contribution, but jurors may regard it as weak and largely disregard it. Research on anchoring and numeracy suggests that the magnitude of the LR is not linearly perceived: the difference between LR = 10 and LR = 1000 looms larger than the difference between LR = 100,000 and LR = 10,000,000, even though both are differences of two orders of magnitude.
These communication challenges do not undermine the Bayesian framework: they are problems of presentation and comprehension that courts and scientists are still working through. They do underscore why the scientist's role is to provide the LR clearly and correctly, why the judge must direct the jury carefully on what an LR means and does not mean, and why the jury's independent evaluation of all evidence, including the prior information, remains essential. See also Numbers in Forensic Conclusions for more on the challenge of translating numerical evidence for non-specialist audiences.
A forensic biologist states: 'There is a 1 in 5 million chance that someone other than the defendant is the source of this blood.' What does this statement directly express?
Key Takeaways
- Bayesian updating in court follows the equation: posterior odds = prior odds x likelihood ratio. The scientist provides the LR; the fact-finder supplies the prior and reaches the posterior.
- Prior odds are formed from non-scientific case information, such as witness evidence, CCTV, and circumstantial facts. The forensic scientist has no mandate or basis to set the prior.
- The transposition fallacy treats P(E|Hd) as equivalent to P(Hd|E). This error has led to wrongful convictions and appellate reversals in multiple jurisdictions, including England, the Netherlands, and the United States.
- The scientist's proper output is an evaluative statement in LR form, at the source-level proposition they have expertise in. Verbal scales mapping LR ranges to qualitative terms are acceptable provided the scale is defined and applied consistently.
- The allocation of the prior to the fact-finder is not a technicality: it reflects the constitutional principle, shared across jury systems, bench trial systems, and both common law and civil law traditions, that the court, not the expert, determines guilt.
What are prior odds in a criminal case?
How does a likelihood ratio convert prior odds into posterior odds?
Why should a forensic scientist not state a probability of guilt?
What is the prosecutor's fallacy in the context of Bayesian reasoning?
How do different legal systems handle Bayesian evidence evaluation in practice?
Test yourself on Forensic Statistics with free, timed mocks.
Practice Forensic Statistics questionsSpotted an error in this page? Report a correction or read our editorial standards.