Malingering and Response-Style Detection

The instruments and decision frameworks for detecting feigned mental disorder and cognitive impairment: the TOMM (Test of Memory Malingering) two-trial forced-choice protocol; SIRS-2 (Structured Interview of Reported Symptoms) eight scales for over-reported psychopathology; M-FAST (Miller Forensic Assessment of Symptoms Test) brief screen; the Slick-Sherman-Iverson 1999 criteria for malingered neurocognitive dysfunction (4-level classification); the difference between malingering, factitious disorder, and the Rogers Detection of Deception model.

Last updated: 17 Jun 2026

Malingering is the intentional production of false or grossly exaggerated symptoms for external gain, distinguished from factitious disorder (internal psychological motivation) and from symptom amplification (exaggeration of genuine symptoms). Detection relies on convergent evidence across performance validity tests, structured symptom interviews, and personality-instrument validity scales: no single instrument can support a standalone malingering determination. The principal instruments are the TOMM (memory performance validity), SIRS-2 (feigned psychiatric symptoms), and M-FAST (brief jail-setting screen), interpreted within the Slick-Sherman-Iverson classification framework and Rogers's Detection of Deception model, both of which require at least two independent data sources before a malingering classification is made.

Malingering carries a moral charge that the clinical term "feigning" does not: to malinger is to produce false symptoms deliberately for gain, and a malingering determination in a legal proceeding is an accusation of deliberate deception with potentially severe consequences. A false-positive determination can destroy a genuine claimant's compensation case, leave a genuinely ill defendant without adequate treatment, or undermine a legitimate mental-state defence. The instruments discussed in this topic were built to reduce those errors, not to eliminate clinical judgment. The personality-battery validity scales that complement these instruments are covered in MMPI-2-RF, PAI, and MCMI-IV, and the cognitive-performance validity tests are introduced in intelligence and cognitive assessment.

Key takeaways

The TOMM (Test of Memory Malingering) uses a two-trial forced-choice recognition format; a Trial 2 score below 45/50 indicates performance invalidity, and any score below the 25/50 chance level meets SSI Criteria Group A for Malingered Neurocognitive Dysfunction.
The SIRS-2 (Structured Interview of Reported Symptoms, Second Edition) has eight primary scales targeting different over-reporting strategies; its three-way output (genuine, indeterminate, feigning) is intentional, because the instrument is more powerful for detecting feigning than for confirming genuine responding.
The M-FAST (25-item brief screen, 5-10 minutes) is designed for jail and pretrial settings; a score of 6 or above flags the need for full SIRS-2 evaluation but cannot support a standalone malingering determination.
The Slick-Sherman-Iverson (1999) criteria classify Malingered Neurocognitive Dysfunction into four levels (Definite, Probable, Possible, Not MND) based on convergent evidence; below-chance performance on any forced-choice task automatically meets Criteria Group A regardless of normative population.
Rogers's Detection of Deception model requires establishing the genuine psychopathological picture before assessing response style, and demands convergent evidence from at least two independent data sources before a malingering classification can be made.

The conceptual landscape is more complex than a simple true/false division between genuinely ill and feigning. Richard Rogers's taxonomy, developed in the 1980s and refined through successive editions of his text Clinical Assessment of Malingering and Deception, identifies five possible response styles: honest responding (the baseline we hope for), symptom amplification (exaggerating real symptoms), malingering proper (fabricating or dramatically amplifying symptoms for external gain), defensiveness (denying genuine symptoms), and irrelevant or random responding. Each response style leaves a different footprint on personality instruments, cognitive tests, and structured symptom interviews, and each calls for a different clinical and legal response.

The legal consequences of a malingering determination are jurisdiction-specific and sometimes asymmetric. In the United States, a malingering diagnosis can lead to dismissal of a personal-injury claim, affect parole decisions, and critically undermine a mental-state defence. In England and Wales, Crown Prosecution Service guidance recognises that apparent symptom exaggeration in a defendant with a genuine mental disorder may still not amount to malingering in the clinical sense, because genuine disorders amplify actual symptoms. In India, NIMHANS clinical guidance acknowledges that malingering assessment instruments standardised on Western populations may have altered sensitivity in Indian forensic populations, where cultural norms about expressing distress differ from those that shaped the instrument development.

By the end of this topic you will be able to:

Distinguish malingering, factitious disorder, and symptom amplification using DSM-5 definitions and Rogers's five-category response-style taxonomy.
Apply TOMM cut-off scores to determine performance invalidity and identify when a Trial 2 score meets SSI Criteria Group A for below-chance performance.
Interpret SIRS-2 three-way output (genuine, indeterminate, feigning) in light of the instrument's eight primary scales and its known limitations in non-Western forensic populations.
Apply the Slick-Sherman-Iverson four-level classification (Definite, Probable, Possible, Not MND) using the correct criteria groups and convergent-evidence requirements.
Implement the Rogers Detection of Deception decision sequence, including ruling out alternative hypotheses (genuine severe psychopathology, cognitive impairment, cultural factors) before reaching a malingering conclusion.

Conceptual Framework: Malingering, Factitious Disorder and the Response-Style Spectrum

DSM-5 describes malingering as the intentional production of false or grossly exaggerated physical or psychological symptoms motivated by external incentives such as avoiding military duty, avoiding work, obtaining financial compensation, evading criminal prosecution, or obtaining drugs. Two features distinguish malingering from other forms of symptom misrepresentation. First, the motivation is external: there is a concrete, identifiable gain the person is seeking. Second, the deception is intentional: the person knows they are misrepresenting their condition. DSM-5 lists malingering as a condition that may be a focus of clinical attention (a V-code) rather than a mental disorder, because the behaviour itself is not a symptom of psychopathology.

Factitious disorder (formerly Munchausen syndrome in the ego-syntonic form) differs from malingering in motivation: the person with factitious disorder produces symptoms to assume the sick role, with no obvious external incentive. The motivation is internal psychological need rather than external gain. From a legal standpoint, factitious disorder does not support a criminal defence of insanity, but it may be relevant to mitigation, because the person who produces false symptoms compulsively for psychological reasons is in a different moral and clinical position from the person who calculates that feigning will benefit them financially.

The Rogers response-style model. Rogers (1984, revised 2008) articulated the response-style taxonomy that has organised the field. Genuine responding produces a profile consistent with the person's actual psychological status. Symptom amplification involves exaggerating real symptoms; this is extremely common and does not necessarily meet the threshold for malingering. Fabrication involves creating entirely false symptoms without any genuine psychopathological substrate. In clinical practice, pure fabrication is uncommon; most individuals who malinger in forensic assessments have some genuine psychopathological features and exaggerate them substantially. This mixed picture is the most common finding in forensic assessments and the hardest to classify cleanly.

Partial malingering and the classification challenge. The Slick-Sherman-Iverson (1999) criteria for malingered neurocognitive dysfunction (discussed in Section 4) introduced a four-level classification that explicitly acknowledges partial malingering. The Rogers taxonomy similarly distinguishes between the amplifier (who exaggerates real symptoms) and the malingerer proper (who fabricates symptoms without genuine substrate). For legal purposes, courts and clinicians have sometimes treated any upward response-style distortion as malingering, but this conflation is clinically inaccurate. An expert who uses "malingering" where "symptom amplification" is the more accurate description risks a well-founded challenge from a cross-examiner familiar with the literature.

Rogers five response styles: each style has a distinct motivation, instrument footprint, and required clinical action; amplification of genuine symptoms is not malingering and demands a separate response.

TOMM: Test of Memory Malingering

The Test of Memory Malingering (Tombaugh, 1996) is the most widely validated performance validity test for memory in forensic neuropsychological practice. It takes approximately 15-20 minutes to administer and is designed to detect insufficient effort or deliberate underperformance on a visual recognition memory task.

TOMM structure. The TOMM consists of three trials. In Trial 1, the subject views 50 black-and-white drawings of common objects for 3 seconds each. A recognition trial immediately follows, presenting each studied object paired with a new object in a forced-choice format. Trial 2 repeats the same recognition procedure with the same 50 objects after a second study phase. The optional Retention Trial replicates Trial 2 format 15 minutes later. Because the task involves simple, distinctive visual stimuli, even patients with severe genuine amnesia (from Alzheimer's disease, post-anoxic amnesia, or severe TBI) typically score at or above the cut-off on Trial 2.

TOMM cut-off scores. A score of 45 or below on Trial 2 (out of 50) is the conservative cut-off typically cited in the validation literature for identifying performance invalidity. Tombaugh's original validation study demonstrated that 95% of genuine patients, including those with severe memory impairment, scored 45 or above on Trial 2. Scores below 45, and particularly scores approaching chance (25), indicate that the person is performing systematically worse than someone guessing randomly, which is impossible if the person is attending to the task normally. Several US federal courts in personal-injury and criminal-forensic contexts have admitted TOMM evidence as meeting Daubert reliability standards.

Cross-jurisdictional TOMM validation. The TOMM has been validated in German (Merten et al., 2007), UK (Yanez et al.), Spanish (Alarcon et al.), and several other European and Latin American samples. The Australian ANZAPPL forensic psychology guidelines include the TOMM as a recommended performance validity measure. In India, a validation study using NIMHANS patient populations is not published as of 2025; however, NIMHANS forensic psychology practice has adopted the TOMM as a clinical tool with the acknowledgment that the cut-off scores derived from North American and European samples are applied provisionally, pending Indian normative data.

TOMM limitations. The TOMM assesses performance validity specifically for visual object recognition memory, not for overall cognitive performance validity or symptom validity in the personality domain. A defendant who performs validly on the TOMM may still be exaggerating psychiatric symptoms on a self-report instrument. Conversely, a low TOMM score does not establish malingering of psychiatric symptoms. The TOMM is one element in a comprehensive PVT battery and should be used alongside instruments from different validity domains.

TOMM three-trial structure: Trial 1 and Trial 2 provide the primary validity determinations; the 45-of-50 cut-off on Trial 2 is the validated threshold for performance invalidity.

SIRS-2: Structured Interview of Reported Symptoms

The Structured Interview of Reported Symptoms, Second Edition (Rogers, Bagby, and Dickens, 2009, updating the original 1992 SIRS) is a 172-item structured interview that takes approximately 30-45 minutes to administer. The SIRS-2 is specifically designed to detect over-reported or fabricated psychiatric symptoms through eight primary scales, each targeting a different pattern of symptom misrepresentation.

The eight SIRS-2 primary scales. Each scale was derived from empirical research on how genuine patients and known feigners differ in their symptom reporting patterns. The Rare Symptoms scale (RS) contains items describing symptoms rarely endorsed by genuine patients; elevation suggests fabrication of implausible symptoms. The Symptom Combinations scale (SC) pairs symptoms that rarely co-occur in genuine disorders; endorsement of multiple implausible symptom combinations suggests fabrication of syndrome profiles the person does not genuinely have. The Improbable and Absurd Symptoms scale (IA) contains descriptions of bizarre, physically impossible, or clinically inconceivable symptoms that no genuine patient would endorse; any endorsement is highly suspicious. The Blatant Symptoms scale (BL) contains overtly pathological symptoms that a coached feigner might exaggerate, useful as a contrast to more subtle scales. The Subtle Symptoms scale (SU), Selectivity of Symptoms scale (SEL), Severity of Symptoms scale (SEV), and Reported vs Observed Symptoms scale (RO) each target different over-reporting mechanisms.

SIRS-2 decision rules. The SIRS-2 uses a conservative decision framework with three classifications: genuine, indeterminate, and feigning. The feigning classification requires meeting specific scale-score thresholds that have been validated in known-groups studies. The indeterminate category is intentional: it acknowledges that many individuals fall in a zone where the data are insufficient to make a confident classification either way. Rogers has been explicit that the SIRS-2 should not be used to rule out feigning with confidence; it is more powerful for detecting feigning when it is present than for confirming genuine responding.

SIRS-2 modifications for non-English populations. The SIRS-2 was initially validated in English. Validated Spanish and French translations have been published, and cross-cultural validation studies in European contexts have generally replicated the original scale structure and classification accuracy. For Indian forensic populations, the SIRS-2 has been used in NIMHANS research studies with translated items, but a full normative and validation study in Indian populations has not been published. The cross-cultural challenge for the SIRS-2 is particularly acute for scales like IA (Improbable and Absurd Symptoms), where what counts as culturally plausible versus implausible may vary substantially, and for RS (Rare Symptoms), where the rarity data are derived from North American clinical populations.

SIRS-2 and the credibility of psychopathological claims in criminal proceedings. In the US, SIRS-2 evidence has been admitted under Daubert in numerous federal district courts. The Eighth Circuit in United States v. Johnson (2008) upheld admission of SIRS-based feigning testimony. In Canada, the Supreme Court's approach to expert evidence under R v. Mohan (1994) requires that expert evidence assist the trier of fact and be from a properly qualified expert. SIRS-2 testimony has been admitted in Ontario and British Columbia provincial courts in criminal mental-state proceedings. In England and Wales, SIRS-2 based expert testimony falls under CPR Part 19 and must meet the reliability standard of R v. Atkins and Atkins (EWCA 2009). In India, SIRS-2 based expert opinion would be tendered under BSA 2023 § 39 with full disclosure of the normative limitations.

The Slick-Sherman-Iverson Criteria for Malingered Neurocognitive Dysfunction

Slick, Sherman, and Iverson (1999) published a landmark paper in The Clinical Neuropsychologist proposing specific criteria for the classification of Malingered Neurocognitive Dysfunction (MND). The criteria emerged from recognition that the field lacked a standardised framework for making malingering determinations in neuropsychological practice, leading to inconsistent and sometimes indefensible expert testimony. The SSI criteria remain the most widely cited framework in neuropsychological malingering assessment, though subsequent research has proposed modifications.

The SSI four-level classification. The SSI framework classifies MND into four levels: Definite MND, Probable MND, Possible MND, and Not MND. Definite MND requires unambiguous evidence of the intentional production of false or exaggerated symptoms (such as a direct admission) combined with the absence of genuine neurological disease that could account for the performance pattern. Probable MND requires meeting at least two of the criteria groups (performance below chance on forced-choice measures, or discrepancies between test performance and documented neurological findings, combined with self-reported history inconsistencies or collateral information inconsistencies). Possible MND requires meeting only one criterion group, in the absence of psychopathological factors that might account for performance invalidity.

SSI Criteria Group A: below-chance performance. This is the strongest single criterion: performance below chance (below 50% correct on a two-alternative forced-choice task) is statistically impossible through genuine impairment and indicates active avoidance of correct responses, which requires knowing the correct answers. A TOMM Trial 2 score below 25 meets Criteria Group A. The Rey 15-Item Memory Test, the Victoria Symptom Validity Test, and the Dot Counting Test all have analogous below-chance thresholds that, when met, satisfy Criteria Group A.

SSI Criteria Group B: neuropsychological evidence. This group includes performance on established neuropsychological tests substantially below what is expected for any genuine neurological disorder (at or below the first percentile across multiple measures), significant discrepancy between self-reported history and documented medical records, and significant discrepancy between current test performance and documented prior performance or functional abilities. Group B criteria are harder to meet than Group A but provide converging evidence when multiple items are satisfied.

Criticisms and modifications. Criticisms of the SSI criteria include the difficulty of operationalising the individual criteria, the circularity risk in the "known or expected" neurological impairment threshold, and the absence of specific cut-off scores for each item. Bianchini, Greve, and Glynn (2005) proposed the Slick-Sherman-Iverson revised model with more precise operationalisations. Larrabee (2008, 2012) has published a series of papers refining the Criteria Group A threshold calculations. The broader critique, articulated most forcefully by Rogers and colleagues, is that the SSI framework is specific to neurocognitive feigning and does not directly apply to feigned psychiatric symptoms, for which the SIRS-2 classification framework is more appropriate.

Cross-jurisdictional use of SSI criteria. The SSI criteria have been explicitly applied in US federal Daubert hearings involving TBI personal-injury claims and in SSDI disability determination proceedings. In Canada, forensic neuropsychological reports using SSI classification have been admitted in both civil and criminal proceedings. UK forensic neuropsychological practice has adopted SSI criteria alongside NAN (2005, 2010) guidance. Indian forensic practice has applied SSI criteria informally in NIMHANS expert reports, with the acknowledgment that the below-chance performance thresholds are valid regardless of normative population because they are based on probability logic, not normative comparison.

M-FAST: A Brief Screening Tool for Feigned Psychopathology

The Miller Forensic Assessment of Symptoms Test (M-FAST; Miller, 2001) is a 25-item structured interview that takes 5-10 minutes to administer, designed as a brief screen for feigned mental illness in forensic settings where time and resources make a full SIRS-2 impractical. The M-FAST is not a replacement for the SIRS-2; it is a screen that identifies individuals who require more comprehensive evaluation.

M-FAST structure and validity. The M-FAST produces a total score and seven subscale scores based on different strategies for detecting feigned psychopathology: Reported vs Observed Symptom (RO), Extreme Symptomatology (ES), Rare Combinations (RC), Unusual Hallucinations (UH), Unusual Symptom Course (USC), Negative Image (NI), and Suggestibility (S). The total score threshold for further evaluation is typically set at 6 or above, at which point more comprehensive malingering assessment is indicated. Miller's original validation study demonstrated good sensitivity and specificity relative to SIRS classification in a criminal forensic population.

M-FAST in jail and court settings. The M-FAST was specifically validated in pretrial detainee populations in the US, making it particularly relevant for jail-based forensic screening when a defendant's mental state is queried by the court before formal evaluation has been completed. Nitch et al. (2003) replicated the M-FAST's classification accuracy in a larger forensic inpatient sample. The brief administration time makes the M-FAST practical in court-holding cell settings where the SIRS-2's 30-45 minute administration requirement would be logistically difficult.

Limitations of the M-FAST. The M-FAST's brevity means it captures fewer of the over-reporting strategies than the SIRS-2, and its sensitivity is correspondingly lower in samples where feigning is subtle or partial. The M-FAST also has a higher false-positive rate in genuine severely ill patients who endorse unusual symptom content for clinical rather than strategic reasons, making it less appropriate as a sole basis for a malingering determination. The standard guidance is that M-FAST elevation should trigger full SIRS-2 evaluation, and that the M-FAST total score alone should not be used to make final malingering determinations.

The Rogers Detection of Deception Model

Richard Rogers (1984, 2008, 2018) developed the Detection of Deception (DoD) model as a systematic approach to response-style assessment in forensic contexts that goes beyond single-instrument threshold interpretation. The model emphasises the integration of multiple sources of evidence, the explicit consideration of alternative hypotheses, and the probabilistic framing of malingering conclusions.

The DoD decision sequence. The DoD model begins with an assessment of genuine psychopathological status using validated clinical instruments (including the MMPI-2-RF or PAI with validity scales, and structured diagnostic interviews). Only after establishing what the genuine psychopathological picture is, relative to collateral history and clinical presentation, does the model move to response-style assessment. The sequence matters: evaluating response style before evaluating genuine psychopathology is, in Rogers's model, methodologically backward because it risks misclassifying amplification of genuine symptoms as fabrication.

Integration of multiple data sources. The DoD model requires that the response-style conclusion be supported by convergent evidence from at least two of the following: validity-scale elevations on personality instruments (MMPI-2-RF, PAI), structured interview data (SIRS-2, M-FAST), performance validity test data (TOMM, WMT), collateral history inconsistencies, and observed-vs-reported symptom discrepancies. A malingering determination based on a single data source (one elevated validity scale, or one PVT failure) does not meet the DoD standard for a confident classification.

Alternative hypotheses in response-style assessment. The DoD model explicitly requires the clinician to consider and document alternative explanations for response-style indicators before concluding that intentional feigning is the best explanation. The most important alternative hypotheses are: (1) genuine severe psychopathology producing valid-appearing over-reporting indicators (because genuinely severely ill people endorse many unusual symptoms); (2) genuine cognitive impairment producing PVT failure (because cognitively impaired individuals may not understand forced-choice instructions); (3) cultural factors producing systematically unusual item-endorsement patterns that mimic over-reporting; and (4) compliance-motivated distortion (the individual endorses symptoms they have heard described, or endorses whatever the examiner seems to be looking for, without intentional fabrication).

Application in Indian and other non-Western contexts. The DoD model's emphasis on alternative hypotheses is particularly important in cross-cultural forensic assessments. Several Indian researchers, including those at NIMHANS and the Central Institute of Psychiatry in Ranchi, have noted that culturally normative patterns of distress expression in South Asian populations (such as the cultural emphasis on somatic presentation of psychological distress, or norms around expressing suffering in legal-adversarial contexts) can produce score profiles that Western-trained instruments classify as suspicious. The DoD model's hypothesis-testing framework explicitly accommodates this concern by requiring the clinician to rule out cultural explanations before settling on intentional deception as the interpretation.

Legal Consequences of Malingering Findings Across Jurisdictions

The legal consequences of a malingering finding differ substantially by jurisdiction and proceeding type. Understanding these consequences is part of the forensic psychologist's responsibility, since an incomplete or poorly communicated malingering finding can have unforeseen legal effects.

Criminal proceedings: mental-state defences. In the US, a malingering finding by the prosecution's expert directly attacks the viability of an insanity or diminished-capacity defence. Under the Insanity Defense Reform Act of 1984 (following the Hinckley verdict), the burden is on the defence to establish insanity by clear and convincing evidence. A credible malingering assessment can shift the jury's evaluation of the defence evidence. In England and Wales, if a defendant is assessed as malingering an insanity defence, this may lead the court to treat the expert as not credible on the substantive mental-state question. In India, under BNS 2023 § 22, the burden on establishing the unsound mind defence is on the accused, and a prosecution-commissioned malingering assessment effectively contests that burden. The criminal consequences of the underlying offence mean that a false-positive malingering determination in a criminal case potentially carries the most severe individual consequences of any forensic assessment context.

Civil proceedings: compensation and disability claims. In civil personal-injury litigation in the US, UK, and India, a malingering finding can result in dismissal of the claim, reduction of damages, or adverse costs orders. In the US, Social Security Disability Insurance (SSDI) and Veterans Affairs (VA) proceedings use neuropsychological malingering assessments routinely, with PVT failure affecting benefit determinations. In the UK, Personal Independence Payment (PIP) assessments have incorporated neuropsychological validity testing in complex TBI cases, with PVT findings influencing award decisions. In India, Motor Accident Claims Tribunal (MACT) proceedings increasingly accept neuropsychological expert evidence, and NIMHANS forensic reports addressing response validity have been admitted as expert evidence under BSA 2023 § 39.

Risk assessment and parole. In the US and Canada, malingering of psychiatric symptoms in prison settings can affect both mental-health treatment access and risk assessment outcomes. A defendant who malingers severe psychiatric symptoms to avoid general-population housing may find that the malingering determination, once made, appears in their risk-assessment report and affects parole consideration. In England and Wales, HM Prison and Probation Service guidance includes performance and symptom validity assessment as a component of forensic mental-health services within prison.

Worked example

SIRS-2 malingering detection in a disability compensation evaluation

A claimant seeking PTSD-based disability compensation scores extremely high on symptom severity. The evaluator suspects feigning. How is this investigated?

Scene: Michael Boateng, 42, a former paramedic, claims total disability on the basis of severe PTSD following a mass-casualty incident. He reports experiencing daily flashbacks, complete inability to work, significant memory impairment, and hearing voices. His self-report on the PTSD Checklist scores in the extreme range. The disability insurer commissions an independent forensic evaluation.

Step 1: The evaluator notes three clinical features inconsistent with the claimed severity: Michael arrives at the assessment centre independently, navigates a complex multi-story car park without difficulty (inconsistent with the reported memory impairment), and engages in animated conversation about his football team before the formal assessment begins. These inconsistencies raise the possibility of symptom amplification. The evaluator commences a systematic response-style assessment.

Step 2: The SIRS-2 (Structured Interview of Reported Symptoms-2, Rogers et al. 2011) is administered. Michael scores in the Feigning range on three of the eight primary scales: Rare Symptoms (RS: endorsing highly improbable symptoms), Symptom Combinations (SC: pairing symptoms that rarely co-occur in genuine disorders), and Selectivity of Symptoms (SEL: reporting implausible selectivity in which symptoms he experiences). The MMPI-2-RF is also administered: scales F, Fp, and the RBS (Response Bias Scale) are all elevated at or above T-90, consistent with over-reporting of psychopathology.

Step 3: The evaluator applies the Rogers (2008) detection strategy framework. Rare symptoms: Michael endorses seven symptoms marked in the SIRS-2 as occurring in fewer than 5 percent of genuine PTSD cases. Symptom combinations: he reports simultaneous total amnesia for the traumatic event and vivid daily flashbacks, which are mechanistically inconsistent (amnesia and intrusive recall involve different memory encoding processes). Floor-level performance: on the TOMM, Michael scores 28/50 on trial 2, below the chance level of 25, consistent with a response strategy aimed at appearing impaired.

Conclusion: The evaluator concludes that Michael's presentation is characterised by probable feigning of symptoms across multiple independent detection strategies. The disability claim is denied. The report notes that this conclusion does not establish that Michael experienced no traumatic event or no genuine distress; it establishes only that the severity he reported is inconsistent with the patterns expected in genuine PTSD. This distinction, required by both the APA Specialty Guidelines and BPS practice guidance, is essential: a finding of malingering is a response-style conclusion, not a diagnostic contradiction of the claimed trauma.

Key terms

Malingering: Intentional production of false or grossly exaggerated symptoms motivated by external incentives; a DSM-5 V-code (not a mental disorder) applicable when incentive and intentionality are both present.
Response style: The systematic approach an individual uses in reporting symptoms and behaviour during assessment; includes genuine, amplification, fabrication, defensiveness, and random responding.
TOMM: Test of Memory Malingering; 50-item forced-choice visual recognition task; Trial 2 score below 45/50 indicates performance invalidity; most widely validated memory PVT.
SIRS-2: Structured Interview of Reported Symptoms, Second Edition; 172-item structured interview with eight primary scales for detecting feigned psychiatric symptoms; yields genuine, indeterminate, or feigning classification.
M-FAST: Miller Forensic Assessment of Symptoms Test; 25-item brief screen for feigned psychopathology; designed for jail and pretrial settings where full SIRS-2 is impractical.
Slick-Sherman-Iverson criteria: 1999 classification framework for Malingered Neurocognitive Dysfunction; four levels (Definite, Probable, Possible, Not MND) based on convergent evidence criteria.
Below-chance performance: Performance on a two-alternative forced-choice task below the 50% chance level; statistically impossible through genuine impairment; meets SSI Criteria Group A for definite MND.
Rogers Detection of Deception model: Multi-source, hypothesis-testing framework for response-style assessment; requires convergent evidence from at least two data sources and explicit consideration of alternative hypotheses before malingering classification.
Performance validity test (PVT): Any test designed to detect inadequate effort or deliberate underperformance in cognitive assessment; TOMM is the leading example for memory; embedded PVTs derive validity indices from standard neuropsychological measures.
Factitious disorder: Intentional production of symptoms motivated by psychological need to assume the sick role, without external incentive; distinct from malingering in motivation; not a basis for insanity defence.
Symptom amplification: Exaggeration of genuine symptoms beyond their actual severity; common and does not automatically meet the malingering threshold; requires integration with all assessment data before classification.
MAL (Malingering Index): PAI configuration-based supplementary index developed for forensic populations to detect feigned psychopathology; used alongside NIM and clinical scales.

Instrument	Target domain	Format	Key cut-off / decision rule	Primary limitation
TOMM	Memory performance validity	Structured forced-choice task, ~15-20 min	Trial 2 score below 45/50 = performance invalidity	Memory-domain only; does not detect feigned psychiatric symptoms
SIRS-2	Feigned psychiatric symptoms	Structured interview, 172 items, ~30-45 min	Scale elevations meeting threshold = feigning; indeterminate zone explicit	Requires trained interviewer; limited cross-cultural validation outside North America and Western Europe
M-FAST	Feigned psychiatric symptoms (screen)	Structured interview, 25 items, ~5-10 min	Total score 6 or above = further evaluation indicated	Brief screen only; higher false-positive rate in genuinely severely ill patients
SSI criteria	Malingered neurocognitive dysfunction (framework)	Classification framework integrating multiple data sources	Definite MND requires below-chance performance plus intentionality evidence	Operationalisation of individual criteria can be inconsistent across evaluators
MMPI-2-RF Fp-r	Feigned psychiatric symptoms (embedded)	Self-report validity scale within MMPI-2-RF	T-score above 80 raises concern; above 90 strongly indicates over-reporting	Requires full MMPI-2-RF administration; contested interpretation at middle elevations
PAI NIM + MAL + RDF	Feigned psychopathology (embedded and index)	Validity scales within PAI; MAL and RDF as supplementary indices	NIM T above 73; MAL elevation; RDF composite threshold	NIM may be elevated in genuinely severely disturbed patients

What is the difference between a performance validity test and a symptom validity test?

Performance validity tests (PVTs) measure whether the individual was putting in adequate effort during cognitive testing. They detect underperformance on tasks of ability. Symptom validity tests (SVTs) measure whether the individual's symptom report on self-report or interview measures is credible. The TOMM is a PVT. The SIRS-2 is an SVT. A person can pass PVTs while exaggerating psychiatric symptoms, or fail PVTs while reporting psychiatric symptoms accurately. Both domains need to be assessed in a comprehensive forensic evaluation, and findings in one domain do not automatically generalise to the other. The cognitive PVT context is covered in [intelligence and cognitive assessment](/topics/forensic-psychology/intelligence-and-cognitive-assessment-in-forensic-context); personality validity scales in [MMPI-2-RF, PAI, and MCMI-IV](/topics/forensic-psychology/mmpi-2-rf-pai-and-mcmi-personality-batteries).

Can the M-FAST alone support a malingering determination in a criminal proceeding?

No. The M-FAST is a brief screen designed to identify individuals who warrant more comprehensive evaluation. It has known false-positive rates in genuinely severely ill populations, and its 25-item structure captures only a fraction of the over-reporting strategies assessed by the full SIRS-2. A malingering determination in any legal proceeding requires convergent evidence from multiple sources, consistent with the Rogers Detection of Deception model. M-FAST elevation should trigger SIRS-2 administration and integration with personality test validity scale data before any classification is made.

What alternative explanations must be ruled out before a below-chance TOMM score can be attributed to deliberate malingering?

Below-chance performance on a two-alternative forced-choice task like the TOMM (scores below 25 on a 50-item trial) is statistically impossible through genuine impairment alone, because it requires knowing the correct answers in order to select the wrong ones. However, the following must be ruled out: failure to understand the forced-choice instructions (possible in severe intellectual disability or acute psychosis); task completion in a way that systematically reverses choices due to confusion about the task structure; genuine severe dementia-related disruption of the forced-choice strategy. These alternatives are typically ruled out through clinical observation during testing, through cognitive status assessment, and through interview data. In practice, below-chance TOMM performance is the single strongest evidence of performance invalidity in the SSI framework precisely because its alternative explanations are so limited.

In an Indian forensic assessment, how should the psychologist address cultural norms about expressing distress that may inflate over-reporting indicators?

This is a genuine concern documented in the cross-cultural psychopathology literature. The appropriate response is to: (1) acknowledge the cultural context explicitly in the report; (2) use the Rogers Detection of Deception hypothesis-testing framework and list cultural expression as an explicit alternative hypothesis; (3) integrate information from collateral sources who know the individual outside the assessment setting, where cultural expression patterns are observed rather than measured; (4) use multiple instruments from different domains rather than relying on a single indicator; and (5) note any published cross-cultural validity data (or the absence thereof) for each instrument used. This approach maintains the integrity of the malingering assessment while explicitly addressing the cultural validity threat. The broader framework for evaluating cross-cultural instrument use is set out in [forensic assessment and test validity](/topics/forensic-psychology/foundations-of-forensic-assessment-and-test-validity).

How should a forensic report communicate a malingering finding without overstepping clinical expertise boundaries?

The report should state the specific classification (for example, Probable Malingered Neurocognitive Dysfunction under SSI criteria) and the evidence supporting it (the specific PVT failures, the SIRS-2 or MMPI-2-RF findings, the collateral inconsistencies). It should not characterise the individual as dishonest, fraudulent, or criminal. It should not predict what the legal consequences of the classification should be. It should explicitly state the alternative hypotheses that were considered and ruled out, and it should state the limitations of the classification including the base rate in the relevant population and the instrument-specific false-positive rates. This approach provides the court with the clinical finding and its basis, leaving the legal inference to the trier of fact.

Practice

Question 1 of 5· 0 answered

A defendant in a competency evaluation scores 23 out of 50 on TOMM Trial 2. Which of the following is the most accurate characterisation of this finding under the Slick-Sherman-Iverson criteria?

Test yourself on Forensic Psychology with free, timed mocks.

Practice Forensic Psychology questions

Found this useful? Pass it along.

Spotted an error in this page? Report a correction or read our editorial standards.

Your journey to becoming a forensic professional starts here.