Practice with mock tests, learn from structured notes, and get your questions answered by a global forensic community, all in one place.
Scientific Content Analysis (SCAN) claims that deceptive statements have detectable linguistic patterns, but empirical testing consistently finds its accuracy at or below chance, raising serious questions about its continued use in investigations.
Last updated:
Avinoam Sapir had a simple premise: lying is cognitively harder than telling the truth, and that extra effort shows up in the text of a written statement. The words a person chooses when they are being deceptive, he argued, differ in systematic ways from the words they choose when they are being honest. If you knew what to look for you could read the statement, identify the deceptive sections, and tell the interviewer where to press. Sapir developed this into a training programme and a consultancy. Tens of thousands of investigators have since been trained in Scientific Content Analysis.
The appeal is obvious. Most of what investigators do before an arrest is work with language: witness statements, tip-offs, suspect accounts, denial letters. A systematic method for reading those documents and identifying deception would be extraordinarily valuable. The question forensic linguists ask is whether SCAN is that method or whether it is a structured way of imposing a narrative on a text that, under empirical scrutiny, turns out to be no more accurate than guessing.
This topic covers what SCAN claims, the specific indicators it proposes and the reasoning behind them, and then examines the empirical evidence that has accumulated since controlled testing began in the 1990s. The verdict from the research is clear and consistent. The harder question is why the technique survives in practice, and what that survival tells investigators about the difference between a method that feels like it works and a method that demonstrably does.
A checklist for reading lies off a page.
SCAN analysis starts by having the subject write a statement in their own words, without guidance or prompting, describing the event in question. The analyst then reads the statement looking for approximately fifteen proposed indicators. Sapir's reasoning is broadly cognitive: a person telling the truth produces a statement organised around what they actually remember. A person constructing a false account must manage what they claim to remember, what they claim not to remember, and how to make a fabricated account feel genuine, all at once. This extra cognitive load, Sapir argues, leaves traces in the text.
The pronoun shift indicator is central to the method. SCAN predicts that deceptive writers avoid first-person singular pronouns ('I') when describing the events they are lying about, and may shift to 'we' or drop the subject altogether ('Then went to the car'). The reasoning is that using 'I' claims personal agency and first-person experience: a liar supposedly avoids it at the precise moment of fabrication. Similarly, 'lack of conviction' language (hedges, memory disclaimers) is treated as suspicious because an honest person claiming a clear memory would not hedge.
When you test SCAN against ground truth, it does not work.
Aldert Vrij, professor of applied social psychology at the University of Portsmouth, led the most sustained empirical programme testing SCAN. In a series of studies in the 1990s and 2000s, Vrij and colleagues had participants produce truthful and deceptive accounts, then gave the statements to SCAN-trained analysts, untrained raters, and sometimes to automated analysis. Results across studies converged: SCAN analysts performed at chance (50 percent accuracy in a two-option true/false task) and sometimes below it.
A 2008 study by Bogaard, Meijer, Vrij and colleagues published in Psychology, Crime and Law directly tested whether the SCAN indicators appeared more often in deceptive statements than truthful ones. Most did not. Pronoun shifts and hedging language appeared with similar frequency in truthful accounts. The indicator that came closest to significance was 'missing time', but even this showed only a weak and unstable correlation. A 2020 systematic review by ten Brinke and colleagues found no peer-reviewed study demonstrating above-chance SCAN accuracy.
The theoretical foundations are equally weak. SCAN assumes that deceptive and truthful writers consistently differ in their use of specific features. But linguistic variation is driven by many factors: education, first language, topic familiarity, emotional state, cultural norms around assertion and hedging. A writer who habitually hedges for politeness, or who genuinely does not remember a peripheral detail, will produce 'lack of conviction' language that SCAN flags as deceptive. The signal SCAN is looking for, if it exists at all, is completely obscured by the noise of ordinary linguistic variation.
A method that does not work can persist if it is never tested fairly.
SCAN's continued use in investigative practice, particularly in US law enforcement and in Dutch and Israeli police contexts, represents a genuine puzzle given the strength of the empirical evidence against it. Several factors explain the persistence.
Describing what a text does and proving that description reveals lies are completely different claims.
The most important conceptual point about SCAN, and the one that separates forensic linguists from SCAN practitioners, is the distinction between linguistic description and validated deception detection. A linguist can describe a statement: this text uses fewer first-person pronouns than a typical account; this passage contains a higher-than-average density of hedging language; this section is organised differently from the surrounding narrative. All of those observations might be accurate.
What a linguist cannot do, on the basis of those observations alone, is conclude that the writer is lying. To move from description to detection requires a validated empirical relationship between the described feature and ground-truth deception, and that relationship has not been established for SCAN's indicators. This is not a theoretical objection. It is the same standard applied to fingerprint comparisons, DNA mixture interpretation, and bite-mark analysis: show your error rate, demonstrate that your conclusions are grounded in a reproducible, testable method.
What is the central empirical finding from controlled studies of SCAN accuracy?
Test yourself on Forensic Linguistics with free, timed mocks.
Practice Forensic Linguistics questionsSpotted an error in this page? Report a correction or read our editorial standards.