Practice with mock tests, learn from structured notes, and get your questions answered by a global forensic community, all in one place.
The national and regional geochemical soil surveys that forensic geologists use as reference resources, their practical limitations at the crime-scene scale, and the gap that targeted forensic sampling must fill.
Last updated:
Before a forensic geologist can say that a soil sample is unusual for its area, or that it shares a geochemical fingerprint with one location rather than another, they need a baseline. What does soil look like across the region? Which elements are elevated here versus there? National geochemical surveys were built to answer those questions for agricultural, environmental, and mining purposes, but they have turned out to be genuinely useful tools for forensic work as well.
The limitation is scale. Surveys like the BGS G-BASE, the USGS national soil characterisation project, and the FOREGS European baseline sampled the landscape at a density of one point every few square kilometres. That gives a continental or national picture, but a clandestine burial site or a crime scene footprint might cover ten square metres. The jump from regional baseline to forensic comparison requires targeted sampling, and the databases and the targeted collection serve different but complementary roles.
This topic maps the key databases, describes what they actually contain and how they are structured, and then looks honestly at what they cannot do. It also covers the emerging application of machine-learning classification to large geochemical datasets, which is an active research direction with real promise and real caveats for court use.
Britain's best-documented soil geochemistry, and its forensic applications.
G-BASE is the most extensively used national geochemical dataset in UK forensic geology. Surveys began in the 1970s and expanded progressively to cover England, Scotland, and Wales, with separate phases for urban areas at higher sampling density. The standard rural sampling density is approximately one site per 2 km2; urban surveys in major cities are denser at around one per 0.25 km2. At each site, samples are collected from a standardised depth (typically 5-25 cm for soil, stream-bed sediment for drainage surveys) and analysed for a panel of elements that has grown over time and now routinely includes arsenic, lead, cadmium, copper, zinc, nickel, chromium, and a suite of lanthanides, among others.
For forensic purposes, G-BASE does two things well. First, it maps geochemical anomalies, patches of elevated arsenic near old mining areas, high chromium over ultramafic intrusions, elevated lead in urban centres from historical leaded petrol and industrial history. A soil sample with an unusual elemental profile can be provisionally attributed to a geological or industrial province by comparison with the G-BASE atlas. Second, it provides background context for a likelihood ratio calculation: the regional distribution of an element's concentration can be estimated from G-BASE to feed into the denominator of the LR.
National surveys exist across most developed countries, but their methods differ.
The United States Geological Survey has published several national-scale soil geochemical datasets. The North American Soil Geochemical Landscapes Project produced a continental-scale dataset of over 4,800 sites covering the conterminous US, Alaska, Canada, and Mexico in a joint programme. Each site was sampled at three depths (0-5 cm, A horizon, and C horizon) to distinguish surface contamination from parent-material chemistry, which is a useful distinction in forensic work where the analyst needs to know whether an anomaly reflects recent deposition or deep geological character.
Other national surveys with forensic relevance include the Geological Survey of Canada's national dataset, Geoscience Australia's National Geochemical Survey of Australia (NGSA), and national programmes in Germany (BZE), the Netherlands, Sweden (SGU), and India (GSI Geochemical Atlas). Each has different sampling strategies, extraction methods, and element panels, which means that before combining datasets across borders the analyst must check whether the methods are sufficiently comparable. Differences in digestion strength (partial acid digest versus total dissolution) systematically shift elemental concentrations and make direct cross-survey comparison unreliable without a harmonisation step.
Pan-European geochemical coverage in one harmonised dataset.
FOREGS assembled a geochemical baseline for Europe at 845 sampling sites, each characterised by analyses of stream water, stream sediment, floodplain sediment, residual soil (B horizon, representing parent material), and agricultural soil (A horizon, representing land-use influenced chemistry). The joint analytical programme used standardised methods across participating geological surveys, making it one of the most internally consistent continental-scale datasets available.
For forensic geologists, the FOREGS dataset is most useful in two contexts. The first is cases where material may have crossed national boundaries, for example soil or sediment associated with vehicle movements through multiple countries, or sediment adhering to international cargo containers. The FOREGS data can help determine whether a geochemical signature is consistent with one European geological province rather than another, narrowing a case with a continental geographic scope. The second use is identifying distinctive geochemical provinces, areas with anomalous concentrations of particular elements that are unusual at the European scale and therefore highly diagnostic.
A 2-km grid cannot distinguish adjacent fields.
The core limitation of all national surveys for forensic work is their sampling density. Two kilometres separates adjacent sample points in G-BASE rural areas. Soil chemistry can change radically over ten metres where a geological boundary crosses a field, where a drain carries material from one lithology to another, or where historical industrial use has contaminated one plot and not its neighbour. National survey data can tell you what province a sample belongs to, but it cannot distinguish the crime scene from the suspect's back garden 800 metres away if both sit within the same survey grid square.
This is the reason targeted forensic reference sampling is not optional. When an analyst is asked to assess whether a questioned soil sample could have come from a specific location, they must sample that location and its surroundings at a scale that reflects the true geochemical variability of the area. National survey data informs the design of that sampling, helping identify which elements or properties are likely to discriminate at the fine scale and which are uniform enough to be uninformative.
| Resource | Spatial resolution | Forensic use | Cannot do |
|---|---|---|---|
| BGS G-BASE | ~2 km rural, ~500 m urban | Province attribution, background context for LR | Distinguish adjacent locations within a grid square |
| FOREGS | ~200 km between sites | Cross-border geochemical province assignment | Regional or local comparison |
| USGS NASGL | ~50 km average | Continental context, depth-horizon data | State or county-level discrimination |
| Targeted case collection | Metres to tens of metres | LR denominator, fine-scale comparison | Coverage beyond the sampled area |
Chemistry is not the only database that matters.
Geochemical element concentrations are one type of soil fingerprint, but mineralogical composition is another, and in many cases it is the more discriminating of the two. Heavy-mineral assemblages, which are the dense, acid-resistant minerals that persist in soil after weathering of less resistant phases, vary with bedrock geology and distance from the source outcrop. Garnet, zircon, rutile, tourmaline, apatite, and many other heavy minerals have identifiable compositions that can be linked to specific geological formations.
Reference collections for heavy-mineral forensics are less centralised than geochemical databases. The University of Leicester's provenance research group and several geological surveys hold reference mineral suites, but there is no single comprehensive international database equivalent to G-BASE for heavy minerals. Kenneth Pye Associates and the BGS have assembled working reference collections for forensic casework. For a specific case, the analyst often needs to build a bespoke reference collection using material from the relevant geological formations, identified using published geological maps and the BGS or equivalent national survey's borehole and sample archives.
Big datasets meet pattern recognition, with caveats for court.
The size of national geochemical databases, tens of thousands of samples with dozens of elements per sample, makes them natural targets for machine-learning classification. Random forest classifiers, gradient boosting models, and neural networks have all been applied to geochemical datasets with the aim of predicting the geological formation, land use, or geographic region of origin of a test sample. Published results from academic studies are promising: classification accuracy of 85-95% at the geological-formation level has been reported in some datasets.
Two practical obstacles limit immediate forensic application. The first is interpretability. Courts have difficulty with opaque models that cannot explain in plain terms why a sample was classified a particular way. A random forest assigns a probability based on the aggregated output of hundreds of decision trees; the analyst cannot point to a single feature that explains the result the way they can point to an elevated arsenic concentration on a map. The second is validation on forensic-scale samples. Most machine-learning studies have used survey data as both training and test sets. Applying a model trained on 2-km grid samples to a forensic sample collected from a 10-cm2 context is an extrapolation whose error characteristics have not been fully characterised.
Why can the BGS G-BASE survey not replace targeted forensic reference sampling?
Test yourself on Forensic Geology and Geoforensics with free, timed mocks.
Practice Forensic Geology and Geoforensics questionsSpotted an error in this page? Report a correction or read our editorial standards.