Descriptive Statistics for Forensic Data

Descriptive statistics summarise a dataset's central tendency and spread, giving scientists and courts a concise picture of what was measured and how much it varied. This topic covers mean, median, mode, variance, and standard deviation applied to physical forensic measurements such as glass refractive indices and fibre diameters, and explains why honest reporting of all summary statistics matters more than selecting the ones that best support a preferred conclusion.

Last updated: 24 Jun 2026

Descriptive statistics are the numerical tools that compress a set of measurements into a small number of values that capture its essential character. In forensic science, where analysts routinely measure dozens or hundreds of physical properties, the mean, median, mode, variance, and standard deviation form the language in which findings are first expressed before any probabilistic inference begins. A forensic glass analyst who measures the refractive index of twenty fragments cannot hand the court twenty numbers; they must reduce those numbers to a summary that is both honest and informative, and descriptive statistics are the means of doing so.

Measures of central tendency tell you where the data sit on a scale. Measures of spread tell you how much the data vary around that centre. Both are necessary. A report that gives only the mean refractive index of a glass sample, without any measure of spread, cannot support the conclusion that the sample is consistent or inconsistent with a reference population, because consistency depends on how tightly or loosely that population clusters around its own mean. Omitting spread is not a minor simplification; it removes the information that carries evidential weight.

The same tools apply across the physical evidence types that forensic laboratories routinely encounter: fibre diameter distributions, particle size distributions in soils, bloodstain measurement series, bullet lead composition datasets, and handwriting stroke-width measurements. The mathematics is the same in each case. What changes is the physical meaning of the measurement and the population database against which the case sample is compared. Guidelines from bodies such as the European Network of Forensic Science Institutes (ENFSI) and the US Scientific Working Group for Materials Analysis (SWGMAT) emphasise full and transparent reporting of descriptive statistics precisely because selective reporting of summaries is one of the commonest ways that forensic evidence misleads courts.

By the end of this topic you will be able to:

Calculate the mean, median, mode, variance, and standard deviation for a small forensic dataset and interpret what each value tells a court.
Explain when the median is a more honest summary of central tendency than the mean, using an example from glass or fibre evidence.
Distinguish variance from standard deviation and explain why standard deviation is preferred in reports intended for non-specialist audiences.
Identify cherry-picking of summary statistics in a forensic report and explain why it is problematic under current evidence standards.
Apply descriptive statistics to a refractive index dataset to assess whether a questioned sample is consistent with a reference population.

Key terms

Mean: The arithmetic average of a dataset, calculated as the sum of all values divided by the number of values. Sensitive to extreme values (outliers), which can pull it away from the bulk of the data.
Median: The middle value of a dataset sorted in ascending or descending order. For an even number of values, it is the average of the two middle values. Resistant to the influence of outliers, making it a useful complement to the mean.
Mode: The value that appears most frequently in a dataset. Most informative for discrete or categorical data; for continuous measurement data it is rarely the primary summary statistic but can indicate clustering.
Variance: The average of the squared deviations from the mean. Because it squares the original unit of measurement, variance is not directly interpretable in the original units, but it is the basis for the standard deviation.
Standard deviation: The square root of the variance. Expressed in the same units as the original measurement, making it interpretable as a typical distance from the mean. The most commonly reported measure of spread in forensic data.
Refractive index (RI): The ratio of the speed of light in a vacuum to its speed in a material. For glass evidence, RI is the primary continuous measurement used to compare fragments; modern instruments can measure to five decimal places.

Measures of central tendency: mean, median, and mode

Central tendency describes the location of a dataset on a measurement scale. The three classical measures are the mean, the median, and the mode, and each answers a slightly different question about the data.

The mean is calculated by summing all values and dividing by the count. If a forensic scientist measures the refractive index of ten glass fragments and records values 1.51832, 1.51840, 1.51835, 1.51828, 1.51841, 1.51833, 1.51839, 1.51836, 1.51830, and 1.51837, the mean is the sum divided by ten. The mean is the natural starting point for comparing a case sample against a reference population mean, but it has a well-known weakness: a single outlier, perhaps a contaminating fragment of quite different composition, can shift the mean substantially. The seven other fragments might all cluster near 1.51835, but one fragment at 1.52100 will pull the mean upward.

The median is the middle value when the data are sorted. For the ten values above, sorted in order, the median is the average of the fifth and sixth values. If a dataset contains that contaminating outlier at 1.52100, the median barely changes, because the middle of the sorted list is still dominated by the cluster of similar values. This resistance to outliers makes the median a useful complement to the mean. When mean and median differ substantially, the analyst should investigate why, because the difference usually signals either outliers or a skewed distribution.

The mode is the most frequently occurring value. In continuous measurement data such as refractive index measured to five decimal places, exact repeated values are uncommon, so the mode is rarely informative unless the data are rounded to fewer decimal places or are inherently discrete. Where the mode is informative is in categorical evidence: if twenty soil particles are classified into five mineralogical types and twelve of them are quartz, the mode is quartz. The mode can also reveal bimodality, which would indicate that the sample contains two distinct sub-populations, perhaps glass from two different sources mixed at the scene.

Measure	What it captures	Sensitivity to outliers	Best used for
Mean	Arithmetic centre	High	Symmetric distributions, reference population comparisons
Median	Middle rank value	Low	Skewed data, presence of outliers
Mode	Most common value	None (counts frequencies)	Categorical data, detecting bimodality

Measures of spread: variance and standard deviation

A measure of central tendency alone is insufficient for forensic comparison. Knowing that the mean refractive index of a reference glass population is 1.51835 tells you nothing about how tightly the population clusters around that value. Two populations can share the same mean but have completely different spreads: one might have all values within 0.00005 of the mean, while another has values ranging 0.0015 on either side. These populations are easily distinguishable despite sharing a mean, but only if spread is also reported.

Variance is calculated by finding the deviation of each data point from the mean, squaring each deviation, and averaging the squared deviations. Squaring serves two purposes: it makes all deviations positive regardless of direction, and it penalises large deviations more than small ones. However, squaring the deviations also squares the unit of measurement. If the original data are in micrometres (fibre diameter), the variance is in square micrometres, which is not a natural unit and is difficult to interpret directly.

Standard deviation solves this by taking the square root of the variance, returning the measure to the original units. A standard deviation of 0.3 micrometres means that a typical fibre measurement falls within about 0.3 micrometres of the mean. In a dataset that follows a normal distribution, approximately 68% of values fall within one standard deviation of the mean, and approximately 95% fall within two standard deviations. These properties underpin many of the comparison rules used in forensic laboratories.

In practice, both variance and standard deviation appear in forensic reports. Variance appears in formal statistical tests such as analysis of variance (ANOVA), which compares spread within groups to spread between groups. Standard deviation appears in descriptive summaries because it is expressed in the same units as the measurement and can be understood by non-specialist readers. A report that gives only the mean and standard deviation of a refractive index dataset, without the raw values or the count, is not complete; the underlying data should also be available for independent verification.

Applying descriptive statistics to glass refractive index data

Glass refractive index measurement is one of the most studied applications of descriptive statistics in forensic science. The primary technique, temperature-controlled immersion microscopy using oils of known refractive index, and later automated instruments such as the Mettler Toledo refractometer, produces a series of continuous measurements for each fragment. A case sample of twenty fragments from a suspect's clothing is compared against a reference sample of thirty fragments from the crime scene window.

The analyst calculates mean and standard deviation for both the case sample and the reference sample. A simple comparison rule used historically is the two-standard-deviation overlap rule: if the mean of the case sample falls within two standard deviations of the mean of the reference, the samples are said to be consistent. This rule has known limitations, particularly when the reference population database is large and the within-source variation is small, because many different source glasses can produce overlapping ranges. More rigorous approaches, covered in related topics on visualising forensic data and common distributions in forensic science, use likelihood ratios that account for both within-source and between-source variation. Descriptive statistics remain the foundation of these more sophisticated approaches.

A forensic analyst working in the UK follows guidelines from the Forensic Science Regulator's Codes of Practice, which require that the analytical method, the number of measurements, and the summary statistics be reported in full. In the United States, SWGMAT guidelines on glass analysis similarly require reporting the refractive index range, mean, and standard deviation for each set of measurements. In Europe, ENFSI guidelines on glass examination specify the same. These requirements exist because without complete reporting, independent experts instructed by the defence cannot check the work.

Descriptive statistics for fibre evidence

Fibre evidence involves both categorical attributes (fibre type, colour, weave construction) and continuous measurements (fibre diameter, twist per centimetre). Descriptive statistics apply to the continuous measurements, while frequency counts and proportions apply to the categorical attributes.

Fibre diameter is measured in micrometres using a calibrated microscope. A batch of fibres from a crime scene garment might yield measurements with a mean of 18.2 micrometres and a standard deviation of 0.8 micrometres. A reference sample from a suspect's garment might show a mean of 18.0 micrometres and a standard deviation of 0.9 micrometres. The overlap of these ranges does not itself constitute a definitive match; it means the measurements are consistent, which is one component of a broader comparison that also includes colour, fibre type, and construction.

When the distribution of fibre diameters is plotted as a histogram, the shape of the distribution often becomes visible. A symmetric, bell-shaped distribution suggests that the variation arises from random manufacturing variation around a target diameter. A bimodal distribution, with two distinct peaks, might indicate that the fibres came from two different batches or even two different garments. The mean and standard deviation alone would not reveal this bimodality; they would describe the overall spread of a dataset that actually contains two distinct sub-populations. This is one reason why descriptive statistics should be accompanied by graphical display whenever the dataset is large enough for a histogram.

The dangers of cherry-picking summary statistics

Cherry-picking in forensic statistics means selecting only the subset of descriptive statistics, or only the subset of data points, that supports a predetermined conclusion. It is a form of confirmation bias with direct consequences for the reliability of evidence presented in court. The same dataset can tell different stories depending on which summaries are reported.

Consider a forensic fibre analyst who measures fifty fibres from a crime scene and fifty from a suspect's garment. Forty-five of the crime scene fibres have diameters between 17 and 19 micrometres; five are outliers between 22 and 25 micrometres. If the analyst reports only the mean and standard deviation of the forty-five consistent fibres, excluding the five outliers without explanation, the report gives a misleadingly tight spread that makes the case sample appear more consistent with the reference than it actually is. Including the five outliers and explaining their significance, whether they suggest contamination, a secondary transfer, or genuine within-sample variation, is what complete honest reporting requires.

The remedy is straightforward: report the full set of measurements (or make them available for inspection), state the number of measurements taken, state any measurements that were excluded and the reason, and report mean, median, and standard deviation for the complete dataset unless there is a documented scientific rationale for excluding specific values. Under the Criminal Procedure Rules in England and Wales, the Bharatiya Sakshya Adhiniyam 2023 in India, the Federal Rules of Evidence in the United States, and the EU's framework for scientific evidence in criminal proceedings, an expert who omits material data from their report is exposed to serious procedural and professional consequences.

Reporting descriptive statistics to courts: presentation and context

The gap between computing a statistic and communicating it effectively to a judge or jury is significant. A statement that the mean refractive index of the case glass is 1.51837 with a standard deviation of 0.00004 conveys nothing to a non-specialist unless the analyst also explains what those numbers mean: that the measurements are tightly clustered, that the standard deviation represents the typical distance of any single measurement from the central value, and how that compares to the reference population.

The preferred presentation in modern forensic evidence is the verbal equivalence scale anchored to a numerical measure. After reporting the descriptive statistics, the analyst states whether the data are consistent or inconsistent, and, where a likelihood ratio framework is used, places that conclusion on a calibrated verbal scale. The descriptive statistics support the conclusion but do not replace it: they are the factual foundation from which the inferential conclusion is drawn. Separating the factual summary from the inferential conclusion is a core requirement of ENFSI guidance on evaluative reporting.

Graphical aids, such as box plots showing the range and interquartile spread, or dot plots showing individual measurements, can help courts understand how the case sample relates to the reference population. Courts in England and Wales have accepted expert graphics as part of evidence for several decades; the US Supreme Court ruling in Daubert v Merrell Dow (1993) and its progeny have reinforced the requirement that the scientific basis of any statistical presentation be clearly explained. Descriptive statistics are the most basic such basis, and presenting them clearly is the starting point for any evaluative report.

Worked example

Comparing glass refractive index datasets: a complete descriptive analysis

A forensic scientist receives twenty glass fragments from a suspect's jacket and twenty-three fragments from a broken window at the scene. Both sets are measured for refractive index. The task is to summarise both datasets and assess whether they are consistent.

Work through each step of the descriptive analysis in sequence, recording the results at each stage before drawing any conclusion.

Record all measurements and count them. List all twenty values for the suspect sample and all twenty-three values for the scene sample. Confirm the count. Note any measurements that look unusual before computing any summary statistic.
Calculate the mean for each dataset. Sum all values in the suspect sample and divide by twenty. Repeat for the scene sample. Suppose the suspect sample gives a mean of 1.51836 and the scene sample gives 1.51834.
Calculate the median for each dataset. Sort both datasets in ascending order. The median of twenty values is the average of the tenth and eleventh. The median of twenty-three values is the twelfth value. If the means and medians are close for each dataset, the distributions are approximately symmetric. If they differ, check for outliers.
Calculate variance and standard deviation. For each dataset, compute the deviation of every value from its mean, square the deviations, sum them, and divide by n minus 1. Take the square root to get the standard deviation. Suppose the suspect sample gives a standard deviation of 0.000038 and the scene sample gives 0.000042.
Assess consistency. The difference between the two means is 0.000020. The pooled standard deviation is approximately 0.000040. The two-standard-deviation range of the scene sample, centred on 1.51834, is 1.51826 to 1.51842. The suspect sample mean of 1.51836 falls within this range, so the datasets are consistent using this criterion.
Report completely. Record the count, mean, median, and standard deviation for both datasets, note whether any values were excluded and why, and state the comparison criterion applied. This information allows an independent expert to verify the conclusion using the same data.

Check your understanding

Question 1 of 4· 0 answered

A forensic fibre analyst measures ten fibre diameters (in micrometres): 18.1, 18.3, 18.0, 18.2, 18.4, 18.1, 18.3, 18.2, 18.0, 25.6. Which single measure of central tendency is least distorted by the value 25.6?

Key Takeaways

Mean, median, and mode each capture a different aspect of central tendency; in forensic data with outliers, the median is more resistant to distortion than the mean, so reporting both gives a more complete picture.
Variance and standard deviation quantify spread around the mean; standard deviation is preferred in reports because it is expressed in the same units as the original measurement, making it interpretable for non-specialist audiences.
Two samples can share the same mean while having completely different spreads, which is why measures of spread are as important as measures of central tendency when comparing forensic samples.
Cherry-picking summary statistics, reporting only those values that support a favoured conclusion while omitting others, is a documented form of confirmation bias in forensic science and is prohibited by professional standards in all major jurisdictions.
Complete reporting requires the count, mean, median, and standard deviation for the full dataset, a note on any excluded values and the reason for exclusion, and access to the underlying measurements for independent verification.

What is the difference between the mean and the median in forensic data?

The mean is the arithmetic average of all values in a dataset, calculated by summing them and dividing by the count. The median is the middle value when the data are sorted in order. In forensic datasets that contain outliers, such as a single contaminated glass fragment with an unusually high refractive index, the median is less affected than the mean. Reporting both gives a fuller picture of the data than either alone.

Why does standard deviation matter when comparing forensic samples?

Standard deviation quantifies how spread out the measurements in a dataset are around the mean. Two glass populations can share the same mean refractive index yet be easily distinguishable if one has a much smaller standard deviation than the other. Without the standard deviation, a summary that reports only the mean can create a false impression that two samples are similar when their spread is quite different.

What is cherry-picking in the context of forensic statistics?

Cherry-picking means selecting only the summary statistics, or only the data points, that support a particular conclusion while ignoring those that do not. In forensic science it is considered a form of confirmation bias and can mislead courts. International guidelines such as those from ENFSI and SWGMAT require analysts to report the full distribution of measurements, not a curated subset.

How is variance related to standard deviation?

Variance is the average of the squared differences between each data point and the mean. Standard deviation is the square root of the variance, which returns the measure to the same units as the original data. Because variance squares the unit (for example, square micrometres for fibre diameter), standard deviation is more intuitive for reporting to non-specialist audiences such as juries.

When should the mode be used in forensic data analysis?

The mode, the most frequently occurring value, is most useful for categorical or discrete data such as glass colour categories, soil types, or fibre type classifications. For continuous measurement data such as refractive index, the mode is rarely informative unless the data cluster into distinct groups, which itself is worth reporting as it may indicate mixed populations from different sources.

Test yourself on Forensic Statistics with free, timed mocks.

Practice Forensic Statistics questions

Found this useful? Pass it along.

Spotted an error in this page? Report a correction or read our editorial standards.

Key Takeaways

Your journey to becoming a forensic professional starts here.