Evaluative Opinion Standards and Frameworks

Evaluative opinion in forensic science is governed by a small set of international standards and accreditation requirements that define what a compliant report must say and what it must never claim. This topic covers the ENFSI Guideline for Evaluative Reporting, ILAC G19, and ISO 17025, and explains how they interact with accreditation bodies and admissibility rules across different legal systems.

Last updated: 24 Jun 2026

An evaluative opinion in forensic science is a formal statement expressing how much more or less probable the observed evidence is under one proposition compared to another. Three international documents define the standards that govern such opinions: the ENFSI Guideline for Evaluative Reporting, which sets out what a compliant evaluative report must say and must not claim; ILAC G19, which links evaluative reporting to measurement uncertainty under the accreditation framework; and ISO 17025, the laboratory standard that accreditation bodies use to verify that a laboratory's methods, procedures, and reporting practices are fit for purpose. Together they form an interlocking structure: ISO 17025 creates the organisational baseline, ILAC G19 applies it to forensic reporting, and the ENFSI Guideline provides the discipline-specific language rules.

Before these documents existed, evaluative language in forensic reports varied widely between laboratories, between disciplines, and between individual scientists. Some reports stated conclusions in probabilistic terms; others made categorical source attributions. Courts received expert opinions expressed in incompatible language and had no reliable way to compare the strength of evidence across cases or disciplines. The standards exist to fix that: they create a shared grammar for expressing the inferential weight of evidence, and they give accreditation bodies a mechanism to verify that the grammar is applied consistently.

The standards originated in Europe and have been most widely adopted there, but their influence is global. Laboratories in Australia, New Zealand, Canada, and parts of Asia have adopted ENFSI-aligned reporting practices. The United States has moved more slowly toward standardised evaluative language, in part because the decentralised structure of US forensic science means that different laboratories and different disciplines have different reporting cultures. The 2016 PCAST report from the US President's Council of Advisors on Science and Technology called for exactly the kind of validation and transparent uncertainty reporting that ILAC G19 and ISO 17025 require, pushing US practice toward the same destination from a different starting point.

By the end of this topic you will be able to:

Describe the purpose and scope of the ENFSI Guideline for Evaluative Reporting, including the verbal scale it prescribes and the conclusions it prohibits.
Explain what ILAC G19 requires of forensic laboratories in relation to measurement uncertainty and how that requirement connects to evaluative opinion.
State the ISO 17025 requirements that apply directly to the issuance of opinions and interpretations in forensic testing reports.
Identify the mandatory elements of a compliant evaluative opinion and the prohibited claims that a non-compliant opinion makes.
Explain how accreditation status under these standards interacts with, but does not determine, the admissibility of expert evidence in court.

Key terms

ENFSI Guideline for Evaluative Reporting: A consensus guidance document from the European Network of Forensic Science Institutes that defines how member laboratories should express evaluative opinions. It mandates proposition-based reasoning, likelihood ratio expression, and a standardised verbal scale, and prohibits source-attribution conclusions.
ILAC G19: A guidance document from the International Laboratory Accreditation Cooperation on the use of uncertainty of measurement in forensic science reporting. It requires laboratories to estimate and report measurement uncertainty alongside evaluative opinions, linking this to ISO 17025 accreditation.
ISO 17025: The international standard for testing and calibration laboratories, specifying requirements for competence, impartiality, and consistent operation. For forensic laboratories it mandates method validation, uncertainty estimation, and documented procedures for issuing opinions and interpretations.
Proposition pair: The two competing hypotheses that frame an evaluative opinion: typically a prosecution proposition (Hp) and a defence proposition (Hd). A compliant report must state both propositions explicitly before expressing the likelihood ratio.
Verbal scale: A standardised mapping from likelihood ratio values to descriptive phrases used in court-facing reports. The ENFSI scale runs from 'weak support' through 'moderate', 'moderately strong', 'strong', 'very strong', and 'extremely strong' support. Its purpose is to communicate the inferential weight of evidence without overstating it.
Accreditation body: An independent organisation that assesses whether a laboratory meets the requirements of ISO 17025 and, where relevant, ILAC G19. Examples include UKAS (UK), ANAB (USA), NATA (Australia), and DAkkS (Germany). Accreditation is a third-party endorsement of a laboratory's competence, not a court ruling on admissibility.

The ENFSI Guideline for Evaluative Reporting

The European Network of Forensic Science Institutes published its Guideline for Evaluative Reporting in Forensic Science in 2015, following a decade of discussion among member laboratories about how to standardise the expression of uncertainty and inferential weight. The guideline applies to all forensic disciplines that produce evaluative opinions: DNA, fingerprint, questioned documents, fire investigation, toxicology, and many others. Its core requirement is that evaluative opinions be expressed as likelihood ratios or, where a numerical LR is not available, as verbal equivalents drawn from a defined scale.

The ENFSI verbal scale has six levels of support for the prosecution proposition: weak (LR between 1 and 10), moderate (10 to 100), moderately strong (100 to 1000), strong (1000 to 10,000), very strong (10,000 to 1,000,000), and extremely strong (above 1,000,000). The scale is one-sided: support for the defence proposition is expressed symmetrically in the opposite direction. Every band has a corresponding verbal phrase that is used in the written report and, where the expert gives oral evidence, in testimony.

The guideline also specifies the structure of a compliant evaluative report. The report must: state the items examined and the methods used; set out the prosecution and defence propositions that frame the evaluation; describe the population model or reference database used to estimate the LR; report the LR value or verbal equivalent; state the key assumptions; and list limitations that could affect the conclusion. This structure makes the opinion auditable. A reviewer can check each step independently.

ILAC G19 and Measurement Uncertainty

ILAC G19, most recently updated in 2014, is titled 'Modules in a Forensic Science Process' and addresses the full chain from evidence collection through laboratory analysis to reporting. Its provisions on measurement uncertainty are the most directly relevant to evaluative opinions. The document requires that forensic laboratories identify and quantify the sources of uncertainty in any measurement or analytical result that feeds into an evaluative conclusion.

Measurement uncertainty in this context is not about philosophical doubt: it is the quantified range within which the true value of a measurement is expected to lie, given the method and conditions used. For a breath alcohol analyser, this might be expressed as a concentration value plus or minus a stated margin. For a questioned document examination relying on ink chromatography, it is the reproducibility of the chromatographic profile and the discriminating power of the comparison method. ILAC G19 requires that this uncertainty be propagated through to the evaluative conclusion.

ILAC G19 also addresses the scope of the laboratory's role. It distinguishes between the 'case assessment' stage, where the scientist helps investigators understand what questions the evidence can address, and the 'evaluation' stage, where the analysis and opinion are produced. This distinction matters for accreditation: a laboratory must have documented procedures for both stages, and the opinion produced at the evaluation stage must be traceable to the case assessment decisions made at the earlier stage.

ISO 17025 Requirements for Forensic Opinions

ISO 17025 (most recently revised in 2017) is the foundational standard for any laboratory that generates data used in decision-making. For forensic science, it creates specific requirements in the 'reporting of results' clause. Clause 7.8 of the 2017 standard addresses what a test report must contain. When a laboratory issues an opinion or interpretation, the report must clearly identify it as an opinion, state the basis on which it was formed, and be signed by the qualified person responsible for it.

Requirement	ISO 17025 location	Forensic application
Method validation	Clause 7.2	Every analytical method used to generate data supporting an LR must be validated for its intended forensic purpose
Measurement uncertainty	Clause 7.6	Uncertainty must be estimated for all relevant measurements; reported where applicable
Opinions and interpretations	Clause 7.8.7	Must be clearly identified as opinions, state the basis, and be traceable to the data
Impartiality	Clause 4.1	The laboratory must not allow commercial or other pressures to bias its opinions
Competence of personnel	Clause 6.2	Staff issuing evaluative opinions must have documented qualifications and authorisation

Accreditation to ISO 17025 is granted by national accreditation bodies. The UK Accreditation Service (UKAS) accredits UK forensic laboratories, the ANSI National Accreditation Board (ANAB) covers many US laboratories, and DAkkS covers German laboratories. The accreditation process includes a technical assessment of the methods used, a review of the laboratory's uncertainty estimation procedures, and inspection of sample reports. A laboratory that holds accreditation has had its evaluative opinion procedures verified by an external body against these requirements.

Elements of a Compliant Evaluative Opinion

Drawing together the requirements of the three documents, a compliant evaluative opinion has a defined structure. The following elements are mandatory:

Items examined: a precise description of every item received, its condition on receipt, and the sub-samples or traces analysed.
Methods: the validated analytical methods applied, with a reference to the validation data or accredited procedure.
Proposition pair: the prosecution and defence propositions (Hp and Hd) that frame the evaluation, stated in plain language before the LR is given.
Likelihood ratio or verbal equivalent: the numerical LR where the method supports it, or the appropriate verbal level from the ENFSI scale if a numerical value is not available.
Assumptions: the key assumptions on which the LR depends, including the reference population chosen and any assumptions about the transfer and persistence of trace material.
Limitations: conditions that could reduce the reliability of the conclusion, such as degraded samples, small reference databases, or conditions that fall outside the validated range of the method.

A non-compliant opinion typically fails in one of three ways. It makes a source attribution ('this blood is the victim's'). It conflates the likelihood of the evidence with the probability of guilt. Or it reports an LR without disclosing the assumptions on which it rests, making it impossible for the court, the opposing expert, or a reviewing scientist to evaluate whether the figure is reliable.

The prosecutor's fallacy is the most common form of non-compliance: inverting the conditional probability so that P(evidence | hypothesis) is presented as P(hypothesis | evidence). A compliant evaluative opinion, by requiring that the LR be separated from the posterior probability and that the jury be told explicitly that the prior is theirs to assess, structurally prevents this error.

Accreditation, Admissibility, and Legal Frameworks

Accreditation under ISO 17025 and compliance with ENFSI or ILAC G19 do not determine whether evidence is admissible. Admissibility is a legal question decided under the procedural rules of the relevant jurisdiction. In the United States, the Daubert standard (established in Daubert v. Merrell Dow Pharmaceuticals Inc., 1993) requires trial judges to assess whether scientific testimony is based on sufficient facts, a reliable methodology, and has been applied reliably to the facts of the case. Frye, still applied in some US states, asks whether the method is generally accepted in the relevant scientific community.

In England and Wales, the Criminal Procedure Rules Part 19 and the Criminal Practice Directions set out what expert reports must contain and require that experts understand their overriding duty to the court. The Law Commission's 2011 report on expert evidence recommended a reliability test modelled loosely on Daubert, and while Parliament did not enact the full proposal, courts have increasingly scrutinised whether forensic methods are adequately validated. In India, the Bharatiya Sakshya Adhiniyam 2023 governs the admissibility of expert opinion evidence, continuing the general principle from prior law that expert opinion is admissible where the court needs assistance on a matter of specialised knowledge; the statute does not specify validation requirements, but the quality of the underlying method is a factor courts may examine. The European Court of Human Rights has addressed the fairness of expert evidence under Article 6 of the European Convention on Human Rights in several cases, focusing on whether the defendant had an adequate opportunity to challenge the evidence.

The practical effect is that accreditation and standards compliance are powerful but not decisive. A laboratory with ISO 17025 accreditation and ENFSI-compliant reports is far better placed to withstand a Daubert challenge or a Criminal Procedure Rules objection than one without these credentials. But accreditation does not bind the court, and an admissible report from an accredited laboratory can still be rejected by a jury on the merits of the evidence.

Adoption Challenges and Current Debates

The transition to standards-compliant evaluative reporting has not been uniform across jurisdictions or disciplines. In fingerprint science, some practitioners and professional bodies in the United States have resisted the shift from categorical identification language ('this fingerprint belongs to the suspect') to likelihood ratio language. The argument is partly cultural, rooted in longstanding courtroom practice, and partly technical: validated population databases for fingerprint comparison are less developed than those for DNA, making numerical LRs harder to defend.

In DNA, the transition happened earlier and is more complete. Short tandem repeat (STR) profile matching has had validated population databases since the 1990s, and the random match probability has been a standard feature of DNA evidence for nearly as long. The debate in DNA has moved to more challenging evidence types: low template DNA, mixed profiles, and familial searching, where the uncertainty is greater and the LR calculation more contentious.

Pattern-based disciplines such as toolmark and bite-mark comparison face the most difficulty. The PCAST report found insufficient foundational validity for bite-mark analysis and limited validity for toolmark analysis, meaning that the validation requirement of ISO 17025 is a genuine barrier to compliance rather than a formality. Disciplines that cannot demonstrate foundational validity through published, reproducible studies cannot produce compliant evaluative opinions: they can report observations, but they cannot quantify the weight of those observations without the required validation data.

Worked example

Tracing a fibre comparison through the compliance checklist

A textile fibre found on a victim's clothing is compared with fibres from a suspect's jacket. The forensic scientist must produce an evaluative opinion that complies with ENFSI guidelines and ISO 17025.

Walking through the compliance checklist for a fibre comparison reveals both what a compliant report looks like and where non-compliance typically occurs.

Items examined. The scientist documents: one adhesive lift from the victim's coat (Item 1) and one sample of the suspect's grey wool jacket (Item 2). Both received in sealed packaging; seals intact on receipt. Thirty fibres recovered from Item 1 by targeted tape lift; twelve selected for comparison based on colour and morphology match with Item 2.
Methods. Fibre colour assessed by microspectrophotometry (MSP) using an accredited laboratory procedure (ref: LP-FIB-003, validated range: 400-700 nm, reproducibility data in validation file VF-003-2022). Fibre type assessed by FTIR. Both methods validated for the fibre types present.
Proposition pair. Hp (prosecution): the fibres on the victim's coat originated from the suspect's jacket. Hd (defence): the fibres on the victim's coat originated from an unknown jacket of the same type. Note that Hd is not 'the suspect is innocent': it is a source proposition that the LR can address.
Likelihood ratio. Twelve fibres recovered from Item 1 match Item 2 in colour (MSP), fibre type (FTIR), and diameter. The probability of finding this combination in a random grey wool jacket of this type, drawn from the reference database of 1,400 wool jackets surveyed in 2020, is estimated at 1 in 800. LR = 800. This falls in the 'moderate' band on the ENFSI verbal scale.
Assumptions. The LR assumes: (a) the reference database is representative of the population of grey wool jackets of this type available in the region; (b) the fibres were not transferred from a third garment; (c) no secondary transfer occurred. If assumption (b) or (c) is false, the LR cannot be assigned to this source level.
Limitations. The reference database contains 1,400 entries; a larger database might revise the frequency estimate. The MSP measurements have an uncertainty of plus or minus 5 nm, propagated through the LR calculation. The LR range accounting for this uncertainty is 400 to 1,600, which remains within the 'moderate' band.
What the report does not say. The report does not state 'these fibres came from the suspect's jacket.' It does not express a probability of guilt. It does not use evaluative language such as 'the evidence is consistent with the suspect being at the scene.' These conclusions go beyond what the LR supports and would render the opinion non-compliant.

Check your understanding

Question 1 of 4· 0 answered

Under the ENFSI verbal scale, which phrase corresponds to a likelihood ratio of 5,000?

Key Takeaways

Three interlocking documents govern evaluative reporting: ISO 17025 sets the laboratory baseline, ILAC G19 applies it to forensic reporting and measurement uncertainty, and the ENFSI Guideline provides the discipline-specific language rules including the mandatory verbal scale.
A compliant evaluative opinion must state the proposition pair, the LR or verbal equivalent, the assumptions, and the limitations. It must never make a source attribution or express a posterior probability of guilt.
ILAC G19 requires that measurement uncertainty be estimated and propagated through to the evaluative conclusion, so that the court understands the precision of the analysis, not just its central estimate.
Accreditation under ISO 17025 from bodies such as UKAS, ANAB, or NATA is a third-party verification that a laboratory's methods and reporting procedures meet these requirements, but it does not guarantee admissibility, which remains a legal question under the applicable procedural rules.
Adoption of these standards is uneven across disciplines and jurisdictions: DNA evidence largely complies, while pattern-based disciplines such as bite-mark comparison face foundational validity gaps that prevent compliant evaluative opinions from being issued at all.

What is the ENFSI Guideline for Evaluative Reporting?

The ENFSI Guideline for Evaluative Reporting is a consensus document published by the European Network of Forensic Science Institutes that sets out how forensic scientists in member laboratories should express evaluative opinions using likelihood ratios. It defines the verbal scale for communicating the strength of evidence, specifies mandatory report elements, and prohibits source-attribution conclusions. While it originates in Europe, many non-European laboratories have adopted its principles as a reporting standard.

What does ILAC G19 require for forensic evaluative reporting?

ILAC G19 is a guidance document from the International Laboratory Accreditation Cooperation that addresses the use of uncertainty of measurement in reporting forensic results. It requires that evaluative opinions be accompanied by a statement of the uncertainty associated with the measurement or analytical process, and it links this requirement to ISO 17025 accreditation. Laboratories seeking accreditation must demonstrate that their evaluative opinions are grounded in validated methods with documented uncertainty.

What does ISO 17025 require of forensic laboratories that issue evaluative opinions?

ISO 17025 is the international standard for testing and calibration laboratories. For forensic work, it requires that any method used to generate data supporting an evaluative opinion be validated, that measurement uncertainty be estimated and reported, and that the laboratory have a documented procedure for issuing opinions and interpretations. A laboratory accredited to ISO 17025 has had these requirements verified by an external accreditation body such as UKAS, ANAB, or DAkkS.

What must a compliant evaluative opinion include?

A compliant evaluative opinion must state the proposition pair being evaluated (prosecution proposition and defence proposition), the likelihood ratio or verbal equivalent expressing the strength of the evidence given those propositions, the key assumptions underpinning the analysis, and the limitations of the method used. It must not state a posterior probability of guilt, name the source of the trace with certainty, or claim that the evidence is conclusive without qualification.

How do evaluative opinion standards relate to court admissibility?

Compliance with ENFSI, ILAC G19, and ISO 17025 does not automatically make evidence admissible. Admissibility is decided by the court under the applicable procedural law: Daubert or Frye standards in the United States, the Criminal Procedure Rules in England and Wales, and equivalent rules elsewhere. However, a laboratory with ISO 17025 accreditation and reports that follow ENFSI guidelines is in a much stronger position when the opposing party challenges the reliability of the method or the adequacy of the opinion.

Test yourself on Forensic Statistics with free, timed mocks.

Practice Forensic Statistics questions

Found this useful? Pass it along.

Spotted an error in this page? Report a correction or read our editorial standards.

Key Takeaways

Your journey to becoming a forensic professional starts here.