Skip to content

Statistical Interpretation of Serological Relationship Evidence

Before DNA profiling, courts relied on blood group typing and serological markers to assess biological relationship claims. This topic covers exclusion probability, paternity index, and combined paternity index calculations, explains how serologists reported findings to courts, and contextualises the continued use of these methods in jurisdictions with limited DNA infrastructure.

Last updated:

Share

Statistical interpretation of serological relationship evidence refers to the mathematical framework used to translate blood group typing results into a probabilistic statement about biological relatedness. From the 1960s through the 1990s, laboratories used ABO, Rh, MNS, Kell, Duffy, Kidd, and protein marker systems to calculate three core statistics: the exclusion probability, the paternity index for each marker, and the combined paternity index (CPI) across all markers tested. The CPI was then converted to a probability of paternity using Bayes' theorem. Courts in the United States, the United Kingdom, Germany, Australia, India, and elsewhere used these calculations to assess disputed paternity and other relationship claims. DNA profiling has replaced serological typing in most high-resource settings, but the statistical logic is identical to what underlies modern STR-based likelihood ratio reporting.

The core problem in relationship testing is the same regardless of which genetic markers are used: a child carries two alleles at each locus, one from each parent. When the mother's alleles are known, the allele the child must have received from the biological father can be identified. This is the obligatory paternal allele. The question then becomes how common that allele is in the relevant male population, and whether the alleged father could have contributed it. If the alleged father cannot have contributed the obligatory allele, he is excluded. If he could have contributed it, a likelihood ratio is calculated comparing the probability of the observed result under the hypothesis that he is the father versus the hypothesis that a random unrelated man from the same population is the father.

Serological relationship testing carried specific limitations that courts and practitioners needed to understand. Blood group typing could exclude, but could never prove, paternity with certainty. The strength of a non-exclusion result depended on the informativeness of the marker systems tested and the accuracy of the population allele frequency data used. Population databases assembled in the 1970s and 1980s for reference populations in one country were sometimes applied to cases involving individuals from different ethnic backgrounds, introducing error. These limitations, and the legal frameworks built around them, remain instructive for understanding how probabilistic evidence is handled in courts today.

Step 1: Identify obligatory paternal allele(OPA)Child allele minusmaternal contributionStep 2: Paternity index per locus (PI)\nPI =P(AF transmits OPA) / allele freq pHomozygous: 1/pHeterozygous: 0.5/pNo OPA: PI = 0(exclusion)Step 3: Combined paternity index (CPI)\nCPI =PI1 x PI2 x ... x PInMultiply across allindependent systemsCPI = likelihood ratioStep 4: State prior probability (usually 0.5neutral)Legal, not scientificchoice; varies by courtStep 5: Probability of paternity(W%)\nEssen-Moller: W = CPI / (CPI + 1)Posterior probabilityReported to court as %Input / identificationCalculation stepsPrior assumptionCourt output
From blood types to court probability: the obligatory paternal allele feeds each locus paternity index (PI = probability AF transmits OPA, divided by population allele frequency); PIs multiply into the combined paternity index; the Essen-Moller formula then converts that likelihood ratio into the posterior probability of paternity using the stated prior.

By the end of this topic you will be able to:

  • Identify the obligatory paternal allele from a mother-child pair and determine whether an alleged father can or cannot have contributed it.
  • Calculate exclusion probability for a single marker system and explain what the value does and does not tell a court.
  • Calculate the paternity index for a single locus and combine indices from multiple systems into a combined paternity index.
  • Convert a combined paternity index to a probability of paternity using Bayes' theorem and explain the role of the prior probability.
  • Describe the limitations of serological paternity statistics and explain how these limitations were communicated to courts before DNA profiling.
Key terms
Obligatory paternal allele
The allele a child must have received from the biological father, identified by subtracting the mother's alleles from the child's genotype. The central object of analysis in serological relationship testing.
Exclusion probability (W)
The proportion of the male reference population that would be excluded as the biological father by the marker panel tested. A high W indicates a discriminating test panel; it does not directly state the probability that the alleged father is the true father.
Paternity index (PI)
A likelihood ratio for a single genetic marker: the probability of the child's genotype if the alleged father is the true father, divided by the probability if a random unrelated man from the population is the father. PI > 1 supports paternity; PI = 0 is an exclusion.
Combined paternity index (CPI)
The product of all individual paternity indices across all marker systems tested. Assumes independence of the marker systems. The CPI is a likelihood ratio; converting it to a probability requires a prior probability.
Probability of paternity (W%)
The posterior probability that the alleged father is the biological father, calculated from the CPI and a prior probability using Bayes' theorem. Most laboratories used a neutral prior of 0.5. Expressed as a percentage in most court reports.
Essen-Moller W
The formula for probability of paternity derived by Erik Essen-Moller in 1938 and widely adopted by forensic laboratories. W = CPI / (CPI + 1) when the prior probability is 0.5. Still the standard conversion used for serological and DNA paternity reports.

The obligatory paternal allele: the starting point

Every serological relationship analysis begins with the same step: identifying the allele the child must have inherited from the biological father. A child inherits one allele at each genetic locus from the mother and one from the father. If the mother's genotype is known, her contribution to the child can be subtracted, leaving the allele that had to come from the paternal side. This is the obligatory paternal allele (OPA).

For the ABO system, consider a child typed as blood group A (genotype AO) and a mother typed as group O (genotype OO). The mother can only have contributed O. The child's A allele must therefore have come from the father. Any man who cannot contribute an A allele is excluded. A man typed as blood group O (genotype OO) carries no A allele and is excluded. A man typed as blood group A or AB carries an A allele and cannot be excluded on the basis of ABO alone.

When the mother's blood group is not available, the analysis is less precise but not impossible. If the child is group AB, the father must have contributed either A or B (since no individual is AA or BB for ABO). A man typed as group O contributes neither A nor B and is excluded regardless of the mother's type. The absence of maternal typing reduces the strength of inclusions but rarely affects exclusions at systems where the alleles are well characterised.

Exclusion probability: what the test panel can discriminate

Exclusion probability (W, sometimes called the probability of exclusion or PE) is a population-level statistic. It answers the question: if a randomly chosen unrelated man from the reference population were tested, what is the probability he would be excluded by this marker panel? A panel with W = 0.99 would exclude 99% of unrelated men. The remaining 1% would share the obligatory allele(s) by chance and could not be excluded.

For a single biallelic system with alleles p and q (where p + q = 1), the probability that a random man carries the obligatory allele depends on allele frequencies and Hardy-Weinberg equilibrium. If the OPA is allele A with frequency p, a random man can have genotype AA (frequency p squared) or AO (frequency 2pq), so the probability of him carrying at least one copy of A is p squared + 2pq = p(p + 2q). The exclusion probability for this locus is 1 minus that value: 1 minus p(p + 2q).

Combined exclusion probability across multiple independent systems is calculated by multiplying the non-exclusion probabilities for each system and subtracting from 1. If system 1 has a non-exclusion probability of 0.10 and system 2 has a non-exclusion probability of 0.05, the combined non-exclusion probability is 0.10 times 0.05 = 0.005, and the combined exclusion probability is 0.995. This is the figure most often cited in court reports from the pre-DNA era: the test panel can exclude 99.5% of unrelated men.

Paternity index: the per-locus likelihood ratio

The paternity index (PI) for a single marker system is a likelihood ratio: the probability of the observed child genotype given that the alleged father is the biological father, divided by the probability of the observed child genotype given that a random unrelated man from the population is the biological father. When PI is greater than 1, the observed results are more consistent with the alleged father being the true father than with a random man. When PI equals 0, the alleged father is excluded.

ScenarioNumerator (Hp)Denominator (Hd)PI value
AF homozygous for OPA (AA), OPA = A1.0 (he must transmit A)p (frequency of A in population)1 / p
AF heterozygous, one allele is OPA (AO), OPA = A0.5 (50% chance he transmits A)p0.5 / p
AF does not carry OPA0 (he cannot transmit the required allele)p0 (exclusion)

In the table, AF denotes the alleged father and p is the population frequency of the obligatory paternal allele. The numerator is the probability that the alleged father transmitted the OPA given his typed genotype. The denominator is the probability that a random man from the population transmitted the same allele. If p is small (the allele is rare), the PI is high: finding a rare allele in both the child and the alleged father is strong evidence that he contributed it. If p is large (the allele is common), the PI is closer to 1: finding a common allele tells you less.

Population allele frequencies are the critical input. Laboratories maintained frequency tables for the populations they served, sometimes stratified by ethnic group or region. German courts in the 1970s relied on frequencies measured in German blood donor cohorts; US laboratories used frequencies measured in European-American, African-American, or Hispanic cohorts depending on the case. Using the wrong frequency table for the case population was recognised as a source of error, and challenged in several appellate decisions in the US during the 1980s and 1990s.

Combined paternity index and probability of paternity

The combined paternity index (CPI) is the product of the individual paternity indices across all marker systems tested, provided those systems are independent. Blood group systems at different chromosomal loci satisfy this requirement. Red cell enzyme systems and serum protein systems tested in the same era were also assumed to be independent of the major blood group loci. The CPI is still a likelihood ratio: it compares the probability of all observed results together under the two competing hypotheses.

Converting the CPI to a probability of paternity requires Bayes' theorem and a prior probability. Let H1 be the hypothesis that the alleged father is the biological father and H2 be the hypothesis that a random unrelated man is. The prior probability of H1 is set before considering the genetic evidence. Most forensic laboratories adopted the convention of a neutral prior of 0.5, treating the two hypotheses as equally likely before testing. The posterior probability of H1, called the probability of paternity (W%), is then CPI divided by (CPI + 1). This is the Essen-Moller formula.

Typical reporting thresholds varied by country. Germany required a probability of paternity of at least 99.73% before a court could order a declaration of paternity, a threshold sometimes called the 'practically proven' standard. US courts used no uniform threshold; some accepted results in the 90% range, others required values above 99%. UK courts took a similar non-prescriptive approach, treating the probability of paternity as one item of evidence among several. Indian courts under the Indian Evidence Act 1872 (now replaced by the Bharatiya Sakshya Adhiniyam 2023) treated blood group exclusions as strong negative evidence but were cautious about affirmative paternity probability claims from serological testing alone.

Reporting to courts: standards and challenges

Serological paternity reports typically followed a standard structure. The laboratory identified the marker systems used, the typed results for the mother, child, and alleged father, the obligatory paternal allele at each system, and whether the alleged father was excluded or included at each. If not excluded, the report gave the individual PI for each system, the CPI, and the probability of paternity calculated with a stated prior. Many reports also included the combined exclusion probability for the panel as a measure of the test's discriminating power.

Courts in common law jurisdictions heard challenges to three aspects of these reports. First, whether the population frequency data was appropriate for the individuals tested. Second, whether the prior probability of 0.5 was a legitimate scientific choice or an assumption that had improperly loaded the result. Third, whether the probability of paternity was a statement about the defendant in particular or a statement about a class of similar defendants. These were not merely theoretical objections; they were raised in appellate courts in the US (People v. Macedonio, 1992; State v. Frye issues around novel scientific evidence) and the UK, and they produced a body of case law that shaped how probabilistic evidence was presented in courts through the 1990s.

The transition to DNA profiling from the late 1980s onward did not eliminate these debates; it intensified them. STR profiling produces much higher likelihood ratios than serological systems, but the conceptual structure is the same: an obligatory allele, a population frequency, a per-locus likelihood ratio, a combined likelihood ratio across loci, and a prior-dependent probability statement. The debates over population databases, mixture interpretation, and prior probability that dominated forensic DNA cases in the 1990s were extensions of the same questions that arose in serological paternity testing two decades earlier.

Remaining uses and the transition to DNA

DNA short tandem repeat (STR) profiling surpassed serological typing for relationship testing in high-resource laboratories during the 1990s. STR profiles at 15 to 20 loci routinely produce CPIs in the billions, generating probabilities of paternity that approach certainty with a neutral prior. They are also more discriminating for exclusions: a single STR mismatch between child and alleged father at multiple loci is an unambiguous exclusion that does not depend on blood group inheritance assumptions.

Serological methods remain relevant in three settings. First, in jurisdictions where DNA infrastructure is not available or not affordable, ABO, Rh, and MNS typing continues to be used as a first-pass screen. A clear exclusion from blood group typing alone ends a case without the cost of DNA analysis. Second, in historical archive cases where blood group typing records exist but no biological material survives for DNA extraction, interpreting the statistical weight of those old records still requires understanding the original framework. Third, the conceptual framework (OPA, PI, CPI, Essen-Moller conversion) is the same framework used in DNA profiling, so serological relationship statistics serve as a pedagogically clear introduction to probabilistic forensic evidence.

Check your understanding
Question 1 of 4· 0 answered

A child is typed as blood group B (genotype BO). The mother is typed as blood group O (genotype OO). What is the obligatory paternal allele?

Key Takeaways

  • The obligatory paternal allele is identified by subtracting the known maternal contribution from the child's genotype; it is the starting point for all serological relationship calculations and determines whether an alleged father can be included or excluded.
  • Exclusion probability measures the discriminating power of the test panel over the reference population; it is not the probability that the specific alleged father is the biological father, a distinction that courts in multiple jurisdictions struggled with.
  • The paternity index at each locus is a likelihood ratio (probability of the result if the alleged father is the father, divided by the probability if a random population man is); the combined paternity index is the product across all independent systems tested.
  • The Essen-Moller formula (W = CPI / (CPI + 1)) converts the CPI to a probability of paternity using a neutral prior of 0.5; changing the prior changes the output, and courts in different jurisdictions adopted different prior conventions and reporting thresholds.
  • DNA STR profiling has superseded serological typing in most high-resource laboratories, but the conceptual framework is the same; serological paternity statistics remain relevant in low-resource settings, historical cases, and as the foundation for understanding DNA likelihood ratio reporting.
What is the paternity index in serological testing?
The paternity index (PI) is the likelihood ratio comparing two hypotheses: that the alleged father contributed the obligatory paternal allele versus that a random unrelated man did. A PI of 20 means the observed blood group results are 20 times more likely if the alleged father is the true father than if a random man from the population is. Each genetic marker system produces its own PI, and they are multiplied together to give the combined paternity index.
What does exclusion probability mean in a paternity case?
Exclusion probability (W) is the proportion of the relevant male population who would be excluded as the biological father by the marker systems tested, assuming the child and mother's types are correct. A W of 0.995 means 99.5% of unrelated men would be excluded by those tests. It does not measure the probability that the alleged father is the true father; it measures the power of the test panel to exclude unrelated men.
How was the combined paternity index calculated from multiple blood group systems?
Each marker system produces an individual paternity index by dividing the probability that the alleged father transmitted the obligatory allele by the allele frequency in the reference population. The combined paternity index (CPI) is the product of all individual paternity indices. For example, if ABO gives PI = 4, MNS gives PI = 3, and Rh gives PI = 5, the CPI is 60. The CPI was then converted to a probability of paternity using Bayes' theorem with a prior probability, most often 0.5.
Why is the probability of paternity not the same as the combined paternity index?
The combined paternity index (CPI) is a likelihood ratio, not a probability. Converting it to a probability of paternity requires a prior probability for the relationship. Using the standard neutral prior of 0.5, the probability of paternity = CPI / (CPI + 1). A CPI of 99 therefore gives a probability of paternity of 99 / 100 = 0.99 or 99%. The choice of prior is a legal and philosophical decision, not a scientific one, and courts in different jurisdictions handled it differently.
Are serological paternity methods still used today?
DNA short tandem repeat (STR) profiling has replaced serological blood group typing for relationship testing in most high-resource forensic laboratories. Serological markers remain relevant in jurisdictions where DNA infrastructure is unavailable or unaffordable, in archived historical cases where only blood group records exist, and as a conceptual framework that underpins DNA-based likelihood ratio reporting.

Test yourself on Forensic Immunology with free, timed mocks.

Practice Forensic Immunology questions

Found this useful? Pass it along.

Share

Spotted an error in this page? Report a correction or read our editorial standards.

Your journey to becoming a forensic professional starts here.

Practice with mock tests, learn from structured notes, and get your questions answered by a global forensic community, all in one place.