Skip to content

Corpus

Definition

A principled, structured collection of texts or transcripts used as the basis for systematic frequency analysis. In forensic work a comparison corpus provides the baseline against which features in a disputed text are measured.

Related terms

Dialect
A variety of language defined by a geographic region or social group, characterised by systematic differences in pronunciation, vocabulary, and grammar from...
Discourse structure
The way a text or conversation is organised above the sentence level: the sequence of moves in an argument, the turn-taking structure...
Function words
Grammatical words, prepositions, conjunctions, articles, pronouns, with little independent content meaning but high frequency in any text. Because they are used without...
Idiolect
The language variety specific to an individual, comprising their characteristic vocabulary, syntactic preferences, spelling habits, punctuation patterns, and discourse-level style. Authorship attribution...
Register
The variety of language associated with a particular situation, task, or relationship. Register varies along dimensions of formality, technicality, and interactional mode....

Explained in

Your journey to becoming a forensic professional starts here.

Practice with mock tests, learn from structured notes, and get your questions answered by a global forensic community, all in one place.