Practice with mock tests, learn from structured notes, and get your questions answered by a global forensic community, all in one place.
Large language models impose their own statistical signature on text, disrupting traditional authorship attribution, while AI detection tools carry documented false-positive rates and no validated court-ready method yet exists for determining whether a specific human used an LLM to write or rewrite content.
Last updated:
For decades, forensic authorship attribution worked from a stable assumption: each writer has a distinctive style that persists across samples, measurable by statistical analysis of vocabulary, syntax, and function-word patterns. That assumption is now under pressure. Large language models can produce fluent, coherent text on demand, write in any requested register, and when used to edit or rewrite a human author's draft, replace the features a stylometrist would rely on.
This creates problems in several directions at once. Academic integrity systems flag student essays. Fraud investigators wonder whether a signed document was actually written by the person who signed it. Defamation cases hinge on authorship. Copyright claims require knowing how much of a creative work is human. And in all of these cases, the tools that law, courts, and institutions want to reach for : AI detection software : are producing false-positive rates that make them dangerous to use as primary evidence.
This topic covers how LLMs generate text and why that process disrupts stylometry, what the detection tools actually measure and what their documented limitations are, the legal questions that turn on human-AI authorship attribution, and what forensic linguistics can honestly say it can and cannot determine today.
The model does not write the way a person writes : and the difference is measurable, but not always in the right direction.
A large language model generates text token by token, at each step sampling from a probability distribution over the vocabulary conditioned on everything that came before. The model has no communicative intention, no memory of authoring previous documents, and no idiolect of its own. What it has is a statistical pattern learned from billions of words of training text, which it reproduces in a characteristically smooth and predictable way.
The result is text with several measurable properties. It tends to be low in perplexity: the word choices are statistically predictable given the context. It tends to be low in burstiness: the variation in sentence complexity across paragraphs is smaller than in typical human writing. It tends to avoid the kinds of slightly-off collocations and idiosyncratic constructions that human writers produce when their phrasing is slightly non-standard. And it is nearly free of the consistency over time that makes a person's writing recognisable as theirs : each generation is independent.
The complication is that these properties are not unique to LLMs. A human writer producing formal academic or professional prose makes similar choices: simple sentence structures, cautious vocabulary, avoidance of colloquialisms, consistent tone. A non-native English speaker constructing carefully correct text may produce low-perplexity, low-burstiness prose indistinguishable by a classifier from LLM output. This is where the false-positive problem originates.
When the model edits your prose, it replaces your fingerprint with its own.
Traditional stylometry assumes that a text reflects a single author's stable linguistic habits. Feed 10,000 words by a known author into Burrows's Delta or a similar method, and the software learns to recognise that author's function-word profile, sentence-length distribution, and vocabulary range. Then compare an unknown text, and the distance metric tells you whether it falls within that author's cluster.
This breaks in two distinct ways when an LLM is involved. The first is full generation: if a human provides a detailed prompt and the model generates all the prose, the output will cluster with the model's characteristic distribution, not the human's. The human's creative decisions : the ideas, the structure, the argument : are present, but the linguistic surface, which is what stylometry measures, belongs to the model.
The second failure mode is editing or paraphrasing. A human writes a rough draft, passes it to an LLM with the instruction "polish this", and accepts the output. The ideas and rough structure are human, but the function-word distribution, the sentence boundaries, and the vocabulary choices have been modified by the model's statistics. The resulting text will not cluster reliably with the human's known writing, and it will not cluster cleanly with pure LLM output either. It sits in an ambiguous middle ground that current tools cannot resolve.
| Scenario | Human idiolect preserved? | LLM signature present? | Stylometry useful? |
|---|---|---|---|
| Human writes without LLM | Yes | No | Yes, as baseline |
| LLM generates from detailed prompt | No | Yes | No : attributes to model |
| Human draft, LLM polishes | Partial | Partial | Unreliable |
| Human writes, LLM rewrites entirely | Minimal | Yes | No : model dominates |
| Human writes with light LLM suggestions | Mostly yes | Minor | Partially useful |
The tools courts and universities want to use are not ready for evidential weight.
AI text detection tools : GPTZero, Originality.ai, Turnitin's AI detection feature, and the classifiers built into academic integrity platforms : work by measuring properties of the text that correlate with LLM output in their training data. The most commonly cited are perplexity (computed against a reference language model) and burstiness (variance in per-sentence perplexity). Some tools also measure coherence, structural regularity, and the frequency of specific n-gram patterns associated with LLMs.
The underlying problem is that the classifiers are trained on a snapshot of LLM output at a specific moment. As models improve and as humans learn to prompt them to produce less stereotypically "AI" text, the classifiers' training distribution drifts from the current target. Furthermore, watermarking schemes : where text is generated with a statistical pattern embedded that a detector can verify : exist in research form but are not yet deployed consistently by major model providers in a way that allows third-party verification.
OpenAI released and then withdrew its own AI text classifier in 2023, citing accuracy concerns. The withdrawal itself is informative: a provider with access to its own model's internal probability distributions could not build a reliable public classifier even with that advantage. Third-party tools working only from the surface text face a harder problem still.
The law has not caught up with the technology, and forensic linguistics sits in the gap.
Three legal contexts are currently generating the most live questions about AI-assisted authorship: copyright, defamation, and fraud. Each asks a slightly different version of the same underlying question: to what extent is this text attributable to a specific human being?
Honesty about the limits of the field is its most important current contribution.
The current state of the field can be summarised plainly. A forensic linguist examining a questioned text can offer the following:
Professional bodies including the International Association of Forensic Linguists and the UK's Forensic Science Regulator have not yet produced specific guidance on AI-text evidence, reflecting how recently the issue has become practically urgent. The absence of a validated, peer-reviewed, court-ready methodology is not a gap that expert testimony can currently bridge by assertion.
The tools are catching up, slowly, with the problem they need to solve.
Several lines of work may change the picture. Cryptographic watermarking, where tokens are generated with an embedded statistical pattern that allows the model provider to verify later whether a text came from their system, is technically viable and has been prototyped by researchers at Google and elsewhere. If model providers adopt standardised watermarking, it could eventually provide a reliable mechanism for court-admissible AI attribution. The adoption problem is commercial: providers may be reluctant to enable easy identification of their outputs.
Authorship verification : rather than attribution : is a more tractable short-term problem. Rather than asking which of many possible authors wrote a text, verification asks: is the distribution of features in this text consistent with the known writing of person X, yes or no? Applied to the negative case (it is not consistent), this can reliably flag the possibility of AI assistance without claiming certainty about the mechanism. This framing is more defensible in court than a positive AI-generation finding.
The deeper question the field has not yet resolved is whether human-LLM collaborative writing, as it becomes a normal part of professional document production, remains a forensically tractable question at all. If everyone uses LLMs to polish their prose, the population of comparison texts will itself be LLM-influenced, and the baseline shifts. At that point, the question "did this person use an LLM?" may become as unanswerable as "did this person use a spell-checker?"
Why does LLM-assisted editing of a human draft make stylometric attribution unreliable?
Test yourself on Forensic Linguistics with free, timed mocks.
Practice Forensic Linguistics questionsSpotted an error in this page? Report a correction or read our editorial standards.