Practice with national-level exam (FACT, FACT Plus, NET, CUET, etc.) mocks, learn from structured notes, and get your doubts solved in one place.
The instruments and decision frameworks for detecting feigned mental disorder and cognitive impairment: the TOMM (Test of Memory Malingering) two-trial forced-choice protocol; SIRS-2 (Structured Interview of Reported Symptoms) eight scales for over-reported psychopathology; M-FAST (Miller Forensic Assessment of Symptoms Test) brief screen; the Slick-Sherman-Iverson 1999 criteria for malingered neurocognitive dysfunction (4-level classification); the difference between malingering, factitious disorder, and the Rogers Detection of Deception model.
Last updated:
Malingering sits at the intersection of clinical psychology, forensic investigation, and legal philosophy in ways that make it one of the most contested areas in the entire field. The word carries a moral charge that the clinical term "feigning" does not: to malinger is to lie deliberately for gain, and labelling someone a malingerer in a legal proceeding is accusing them of a deliberate deception with potentially severe consequences. A false positive malingering determination can destroy a genuine claimant's compensation case, result in a genuinely ill defendant receiving inadequate treatment, or provide a prosecutor with a convenient narrative that short-circuits a serious mental-health defence. The instruments discussed in this topic were built to reduce those errors, not to eliminate human judgment.
The conceptual landscape is more complex than a simple true/false division between genuinely ill and feigning. Richard Rogers's taxonomy, developed in the 1980s and refined through successive editions of his text Clinical Assessment of Malingering and Deception, identifies five possible response styles: honest responding (the baseline we hope for), symptom amplification (exaggerating real symptoms), malingering proper (fabricating or dramatically amplifying symptoms for external gain), defensiveness (denying genuine symptoms), and irrelevant or random responding. Each response style leaves a different footprint on personality instruments, cognitive tests, and structured symptom interviews, and each calls for a different clinical and legal response.
The legal consequences of a malingering determination are jurisdiction-specific and sometimes asymmetric. In the United States, a malingering diagnosis can lead to dismissal of a personal-injury claim, affect parole decisions, and critically undermine a mental-state defence. In England and Wales, Crown Prosecution Service guidance recognises that apparent symptom exaggeration in a defendant with a genuine mental disorder may still not amount to malingering in the clinical sense, because genuine disorders amplify actual symptoms. In India, NIMHANS clinical guidance acknowledges that malingering assessment instruments standardised on Western populations may have altered sensitivity in Indian forensic populations, where cultural norms about expressing distress differ from those that shaped the instrument development. The assessment challenge is to get it right in the individual case, not to be right on average.
*The difference between malingering and factitious disorder is the motive for the deception, not the deception itself. Courts care about both, but for different reasons.*
DSM-5 describes malingering as the intentional production of false or grossly exaggerated physical or psychological symptoms motivated by external incentives such as avoiding military duty, avoiding work, obtaining financial compensation, evading criminal prosecution, or obtaining drugs. Two features distinguish malingering from other forms of symptom misrepresentation. First, the motivation is external: there is a concrete, identifiable gain the person is seeking. Second, the deception is intentional: the person knows they are misrepresenting their condition. DSM-5 lists malingering as a condition that may be a focus of clinical attention (a V-code) rather than a mental disorder, because the behaviour itself is not a symptom of psychopathology.
Factitious disorder (formerly Munchausen syndrome in the ego-syntonic form) differs from malingering in motivation: the person with factitious disorder produces symptoms to assume the sick role, with no obvious external incentive. The motivation is internal psychological need rather than external gain. From a legal standpoint, factitious disorder does not support a criminal defence of insanity, but it may be relevant to mitigation, because the person who produces false symptoms compulsively for psychological reasons is in a different moral and clinical position from the person who calculates that feigning will benefit them financially.
The Rogers response-style model. Rogers (1984, revised 2008) articulated the response-style taxonomy that has organised the field. Genuine responding produces a profile consistent with the person's actual psychological status. Symptom amplification involves exaggerating real symptoms; this is extremely common and does not necessarily meet the threshold for malingering. Fabrication involves creating entirely false symptoms without any genuine psychopathological substrate. In clinical practice, pure fabrication is uncommon; most individuals who malinger in forensic assessments have some genuine psychopathological features and exaggerate them substantially. This mixed picture is the most common finding in forensic assessments and the hardest to classify cleanly.
Partial malingering and the classification challenge. The Slick-Sherman-Iverson (1999) criteria for malingered neurocognitive dysfunction (discussed in Section 4) introduced a four-level classification that explicitly acknowledges partial malingering. The Rogers taxonomy similarly distinguishes between the amplifier (who exaggerates real symptoms) and the malingerer proper (who fabricates symptoms without genuine substrate). For legal purposes, courts and clinicians have sometimes treated any upward response-style distortion as malingering, but this conflation is clinically inaccurate. An expert who uses "malingering" where "symptom amplification" is the more accurate description will face a well-prepared cross-examiner who knows the literature.
*The TOMM's two-trial structure is specifically designed to distinguish genuine severe amnesia from deliberate underperformance on a forced-choice recognition task where chance is 50%.*
The Test of Memory Malingering (Tombaugh, 1996) is the most widely validated, most commonly administered performance validity test for memory in forensic neuropsychological practice. It takes approximately 15-20 minutes to administer and is designed to detect insufficient effort or deliberate underperformance on a visual recognition memory task.
TOMM structure. The TOMM consists of three trials. In Trial 1, the subject views 50 black-and-white drawings of common objects for 3 seconds each. A recognition trial immediately follows, presenting each studied object paired with a new object in a forced-choice format. Trial 2 repeats the same recognition procedure with the same 50 objects after a second study phase. The optional Retention Trial replicates Trial 2 format 15 minutes later. Because the task involves simple, distinctive visual stimuli, even patients with severe genuine amnesia (from Alzheimer's disease, post-anoxic amnesia, or severe TBI) typically score at or above the cut-off on Trial 2.
TOMM cut-off scores. A score of 45 or below on Trial 2 (out of 50) is the conservative cut-off typically cited in the validation literature for identifying performance invalidity. Tombaugh's original validation study demonstrated that 95% of genuine patients, including those with severe memory impairment, scored 45 or above on Trial 2. Scores below 45, and particularly scores approaching chance (25), indicate that the person is performing systematically worse than someone guessing randomly, which is impossible if the person is attending to the task normally. Several US federal courts, including in personal-injury and criminal-forensic contexts, have admitted TOMM evidence as meeting Daubert reliability standards.
Cross-jurisdictional TOMM validation. The TOMM has been validated in German (Merten et al., 2007), UK (Yanez et al.), Spanish (Alarcon et al.), and several other European and Latin American samples. The Australian ANZAPPL forensic psychology guidelines include the TOMM as a recommended performance validity measure. In India, a validation study using NIMHANS patient populations is not published as of 2025; however, NIMHANS forensic psychology practice has adopted the TOMM as a clinical tool with the acknowledgment that the cut-off scores derived from North American and European samples are applied provisionally, pending Indian normative data.
TOMM limitations. The TOMM assesses performance validity specifically for visual object recognition memory, not for overall cognitive performance validity or symptom validity in the personality domain. A defendant who performs validly on the TOMM may still be exaggerating psychiatric symptoms on a self-report instrument. Conversely, a low TOMM score does not establish malingering of psychiatric symptoms. The TOMM is one element in a comprehensive PVT battery and should be used alongside instruments from different validity domains.
*The SIRS was designed to detect feigned psychiatric symptoms, not feigned cognitive deficits. This domain specificity matters when interpreting discordant results across instruments.*
The Structured Interview of Reported Symptoms, Second Edition (Rogers, Bagby, and Dickens, 2009, updating the original 1992 SIRS) is a 172-item structured interview that takes approximately 30-45 minutes to administer. The SIRS-2 is specifically designed to detect over-reported or fabricated psychiatric symptoms through eight primary scales, each targeting a different pattern of symptom misrepresentation.
The eight SIRS-2 primary scales. Each scale was derived from empirical research on how genuine patients and known feigners differ in their symptom reporting patterns. The Rare Symptoms scale (RS) contains items describing symptoms rarely endorsed by genuine patients; elevation suggests fabrication of implausible symptoms. The Symptom Combinations scale (SC) pairs symptoms that rarely co-occur in genuine disorders; endorsement of multiple implausible symptom combinations suggests fabrication of syndrome profiles the person does not genuinely have. The Improbable and Absurd Symptoms scale (IA) contains descriptions of bizarre, physically impossible, or clinically inconceivable symptoms that no genuine patient would endorse; any endorsement is highly suspicious. The Blatant Symptoms scale (BL) contains overtly pathological symptoms that a coached feigner might exaggerate, useful as a contrast to more subtle scales. The Subtle Symptoms scale (SU), Selectivity of Symptoms scale (SEL), Severity of Symptoms scale (SEV), and Reported vs Observed Symptoms scale (RO) each target different over-reporting mechanisms.
SIRS-2 decision rules. The SIRS-2 uses a conservative decision framework with three classifications: genuine, indeterminate, and feigning. The feigning classification requires meeting specific scale-score thresholds that have been validated in known-groups studies. The indeterminate category is intentional: it acknowledges that many individuals fall in a zone where the data are insufficient to make a confident classification either way. Rogers has been explicit that the SIRS-2 should not be used to rule out feigning with confidence; it is more powerful for detecting feigning when it is present than for confirming genuine responding.
SIRS-2 modifications for non-English populations. The SIRS-2 was initially validated in English. Validated Spanish and French translations have been published, and cross-cultural validation studies in European contexts have generally replicated the original scale structure and classification accuracy. For Indian forensic populations, the SIRS-2 has been used in NIMHANS research studies with translated items, but a full normative and validation study in Indian populations has not been published. The cross-cultural challenge for the SIRS-2 is particularly acute for scales like IA (Improbable and Absurd Symptoms), where what counts as culturally plausible versus implausible may vary substantially, and for RS (Rare Symptoms), where the rarity data are derived from North American clinical populations.
SIRS-2 and the credibility of psychopathological claims in criminal proceedings. In the US, SIRS-2 evidence has been admitted under Daubert in numerous federal district courts. The Eighth Circuit in United States v. Johnson (2008) upheld admission of SIRS-based feigning testimony. In Canada, the Supreme Court's approach to expert evidence under R v. Mohan (1994) requires that expert evidence assist the trier of fact and be from a properly qualified expert. SIRS-2 testimony has been admitted in Ontario and British Columbia provincial courts in criminal mental-state proceedings. In England and Wales, SIRS-2 based expert testimony falls under CPR Part 19 and must meet the reliability standard of R v. Atkins and Atkins (EWCA 2009). In India, SIRS-2 based expert opinion would be tendered under BSA 2023 § 39 with full disclosure of the normative limitations.
*The SSI criteria were the first published framework that treated malingering assessment as a multi-criterion classification problem rather than a single-cutoff determination.*
Slick, Sherman, and Iverson (1999) published a landmark paper in The Clinical Neuropsychologist proposing specific criteria for the classification of Malingered Neurocognitive Dysfunction (MND). The criteria emerged from recognition that the field lacked a standardised framework for making malingering determinations in neuropsychological practice, leading to inconsistent and sometimes indefensible expert testimony. The SSI criteria remain the most widely cited framework in neuropsychological malingering assessment, though subsequent research has proposed modifications.
The SSI four-level classification. The SSI framework classifies MND into four levels: Definite MND, Probable MND, Possible MND, and Not MND. Definite MND requires unambiguous evidence of the intentional production of false or exaggerated symptoms (such as a direct admission) combined with the absence of genuine neurological disease that could account for the performance pattern. Probable MND requires meeting at least two of the criteria groups (performance below chance on forced-choice measures, or discrepancies between test performance and documented neurological findings, combined with self-reported history inconsistencies or collateral information inconsistencies). Possible MND requires meeting only one criterion group, in the absence of psychopathological factors that might account for performance invalidity.
SSI Criteria Group A: below-chance performance. This is the strongest single criterion: performance below chance (below 50% correct on a two-alternative forced-choice task) is statistically impossible through genuine impairment and indicates active avoidance of correct responses, which requires knowing the correct answers. A TOMM Trial 2 score below 25 meets Criteria Group A. The Rey 15-Item Memory Test, the Victoria Symptom Validity Test, and the Dot Counting Test all have analogous below-chance thresholds that, when met, satisfy Criteria Group A.
SSI Criteria Group B: neuropsychological evidence. This group includes performance on established neuropsychological tests substantially below what is expected for any genuine neurological disorder (at or below the first percentile across multiple measures), significant discrepancy between self-reported history and documented medical records, and significant discrepancy between current test performance and documented prior performance or functional abilities. Group B criteria are harder to meet than Group A but provide converging evidence when multiple items are satisfied.
Criticisms and modifications. Criticisms of the SSI criteria include the difficulty of operationalising the individual criteria, the circularity risk in the "known or expected" neurological impairment threshold, and the absence of specific cut-off scores for each item. Bianchini, Greve, and Glynn (2005) proposed the Slick-Sherman-Iverson revised model with more precise operationalisations. Larrabee (2008, 2012) has published a series of papers refining the Criteria Group A threshold calculations. The broader critique, articulated most forcefully by Rogers and colleagues, is that the SSI framework is specific to neurocognitive feigning and does not directly apply to feigned psychiatric symptoms, for which the SIRS-2 classification framework is more appropriate.
Cross-jurisdictional use of SSI criteria. The SSI criteria have been explicitly applied in US federal Daubert hearings involving TBI personal-injury claims and in SSDI disability determination proceedings. In Canada, forensic neuropsychological reports using SSI classification have been admitted in both civil and criminal proceedings. UK forensic neuropsychological practice has adopted SSI criteria alongside NAN (2005, 2010) guidance. Indian forensic practice has applied SSI criteria informally in NIMHANS expert reports, with the acknowledgment that the below-chance performance thresholds are valid regardless of normative population because they are based on probability logic, not normative comparison.
*The M-FAST was designed for a specific forensic gap: quick, reliable mental-health-symptom screening in jail and court settings where a full SIRS-2 is impractical.*
The Miller Forensic Assessment of Symptoms Test (M-FAST; Miller, 2001) is a 25-item structured interview that takes 5-10 minutes to administer, designed as a brief screen for feigned mental illness in forensic settings where time and resources make a full SIRS-2 impractical. The M-FAST is not a replacement for the SIRS-2; it is a screen that identifies individuals who require more comprehensive evaluation.
M-FAST structure and validity. The M-FAST produces a total score and seven subscale scores based on different strategies for detecting feigned psychopathology: Reported vs Observed Symptom (RO), Extreme Symptomatology (ES), Rare Combinations (RC), Unusual Hallucinations (UH), Unusual Symptom Course (USC), Negative Image (NI), and Suggestibility (S). The total score threshold for further evaluation is typically set at 6 or above, at which point more comprehensive malingering assessment is indicated. Miller's original validation study demonstrated good sensitivity and specificity relative to SIRS classification in a criminal forensic population.
M-FAST in jail and court settings. The M-FAST was specifically validated in pretrial detainee populations in the US, making it particularly relevant for jail-based forensic screening when a defendant's mental state is queried by the court before formal evaluation has been completed. Nitch et al. (2003) replicated the M-FAST's classification accuracy in a larger forensic inpatient sample. The brief administration time makes the M-FAST practical in court-holding cell settings where the SIRS-2's 30-45 minute administration requirement would be logistically difficult.
Limitations of the M-FAST. The M-FAST's brevity means it captures fewer of the over-reporting strategies than the SIRS-2, and its sensitivity is correspondingly lower in samples where feigning is subtle or partial. The M-FAST also has a higher false-positive rate in genuine severely ill patients who endorse unusual symptom content for clinical rather than strategic reasons, making it less appropriate as a sole basis for a malingering determination. The standard guidance is that M-FAST elevation should trigger full SIRS-2 evaluation, and that the M-FAST total score alone should not be used to make final malingering determinations.
*Rogers's framework is notable for treating response-style assessment as a hypothesis-testing exercise rather than a threshold-crossing exercise.*
Richard Rogers (1984, 2008, 2018) developed the Detection of Deception (DoD) model as a systematic approach to response-style assessment in forensic contexts that goes beyond single-instrument threshold interpretation. The model emphasises the integration of multiple sources of evidence, the explicit consideration of alternative hypotheses, and the probabilistic framing of malingering conclusions.
The DoD decision sequence. The DoD model begins with an assessment of genuine psychopathological status using validated clinical instruments (including the MMPI-2-RF or PAI with validity scales, and structured diagnostic interviews). Only after establishing what the genuine psychopathological picture is, relative to collateral history and clinical presentation, does the model move to response-style assessment. The sequence matters: evaluating response style before evaluating genuine psychopathology is, in Rogers's model, methodologically backward because it risks misclassifying amplification of genuine symptoms as fabrication.
Integration of multiple data sources. The DoD model requires that the response-style conclusion be supported by convergent evidence from at least two of the following: validity-scale elevations on personality instruments (MMPI-2-RF, PAI), structured interview data (SIRS-2, M-FAST), performance validity test data (TOMM, WMT), collateral history inconsistencies, and observed-vs-reported symptom discrepancies. A malingering determination based on a single data source (one elevated validity scale, or one PVT failure) does not meet the DoD standard for a confident classification.
Alternative hypotheses in response-style assessment. The DoD model explicitly requires the clinician to consider and document alternative explanations for response-style indicators before concluding that intentional feigning is the best explanation. The most important alternative hypotheses are: (1) genuine severe psychopathology producing valid-appearing over-reporting indicators (because genuinely severely ill people endorse many unusual symptoms); (2) genuine cognitive impairment producing PVT failure (because cognitively impaired individuals may not understand forced-choice instructions); (3) cultural factors producing systematically unusual item-endorsement patterns that mimic over-reporting; and (4) compliance-motivated distortion (the individual endorses symptoms they have heard described, or endorses whatever the examiner seems to be looking for, without intentional fabrication).
Application in Indian and other non-Western contexts. The DoD model's emphasis on alternative hypotheses is particularly important in cross-cultural forensic assessments. Several Indian researchers, including those at NIMHANS and the Central Institute of Psychiatry in Ranchi, have noted that culturally normative patterns of distress expression in South Asian populations (such as the cultural emphasis on somatic presentation of psychological distress, or norms around expressing suffering in legal-adversarial contexts) can produce score profiles that Western-trained instruments classify as suspicious. The DoD model's hypothesis-testing framework explicitly accommodates this concern by requiring the clinician to rule out cultural explanations before settling on intentional deception as the interpretation.
*A malingering finding that ends one legal proceeding can open another. The forensic psychologist needs to understand what they are setting in motion.*
The legal consequences of a malingering finding differ substantially by jurisdiction and proceeding type. Understanding these consequences is part of the forensic psychologist's responsibility, because an incomplete or poorly communicated malingering finding can have unforeseen legal effects.
Criminal proceedings: mental-state defences. In the US, a malingering finding by the prosecution's expert directly attacks the viability of an insanity or diminished-capacity defence. Under the Federal Insanity Defense Reform Act 1984 (following the Hinckley verdict), the burden is on the defence to establish insanity by clear and convincing evidence. A credible malingering assessment can shift the jury's evaluation of the defence evidence. In England and Wales, if a defendant is assessed as malingering an insanity defence, this may lead the court to treat the expert as not credible on the substantive mental-state question. In India, under BNS 2023 § 22, the burden on establishing the unsound mind defence is on the accused, and a prosecution-commissioned malingering assessment effectively contests that burden. The criminal consequences of the underlying offence mean that a false-positive malingering determination in a criminal case potentially carries the most severe individual consequences of any forensic assessment context.
Civil proceedings: compensation and disability claims. In civil personal-injury litigation in the US, UK, and India, a malingering finding can result in dismissal of the claim, reduction of damages, or adverse costs orders. In the US, Social Security Disability Insurance (SSDI) and Veterans Affairs (VA) proceedings use neuropsychological malingering assessments routinely, with PVT failure affecting benefit determinations. In the UK, Personal Independence Payment (PIP) assessments have incorporated neuropsychological validity testing in complex TBI cases, with PVT findings influencing award decisions. In India, Motor Accident Claims Tribunal (MACT) proceedings increasingly accept neuropsychological expert evidence, and NIMHANS forensic reports addressing response validity have been admitted as expert evidence under BSA 2023 § 39.
Risk assessment and parole. In the US and Canada, malingering of psychiatric symptoms in prison settings can affect both mental-health treatment access and risk assessment outcomes. A defendant who malingers severe psychiatric symptoms to avoid general-population housing may find that the malingering determination, once made, appears in their risk-assessment report and affects parole consideration. In England and Wales, HM Prison and Probation Service guidance includes performance and symptom validity assessment as a component of forensic mental-health services within prison.
| Instrument | Target domain | Format | Key cut-off / decision rule | Primary limitation |
|---|---|---|---|---|
| TOMM | Memory performance validity | Structured forced-choice task, ~15-20 min | Trial 2 score below 45/50 = performance invalidity | Memory-domain only; does not detect feigned psychiatric symptoms |
| SIRS-2 | Feigned psychiatric symptoms | Structured interview, 172 items, ~30-45 min | Scale elevations meeting threshold = feigning; indeterminate zone explicit | Requires trained interviewer; limited cross-cultural validation outside North America and Western Europe |
| M-FAST | Feigned psychiatric symptoms (screen) | Structured interview, 25 items, ~5-10 min | Total score 6 or above = further evaluation indicated | Brief screen only; higher false-positive rate in genuinely severely ill patients |
| SSI criteria | Malingered neurocognitive dysfunction (framework) | Classification framework integrating multiple data sources | Definite MND requires below-chance performance plus intentionality evidence | Operationalisation of individual criteria can be inconsistent across evaluators |
| MMPI-2-RF Fp-r | Feigned psychiatric symptoms (embedded) | Self-report validity scale within MMPI-2-RF | T-score above 80 raises concern; above 90 strongly indicates over-reporting | Requires full MMPI-2-RF administration; contested interpretation at middle elevations |
| PAI NIM + MAL + RDF | Feigned psychopathology (embedded and index) | Validity scales within PAI; MAL and RDF as supplementary indices | NIM T above 73; MAL elevation; RDF composite threshold | NIM may be elevated in genuinely severely disturbed patients |
A defendant in a competency evaluation scores 23 out of 50 on TOMM Trial 2. Which of the following is the most accurate characterisation of this finding under the Slick-Sherman-Iverson criteria?
Test yourself on Forensic Psychology with free, timed mocks.
Practice Forensic Psychology questions