Sampling Techniques in Fraud Audits

Sampling allows auditors to draw reliable conclusions about large transaction populations without examining every record, but fraud audits require a different selection logic than routine compliance audits. This topic covers attribute sampling, monetary unit sampling, stratified sampling, and the targeted, risk-directed approaches used when fraud signals have already been identified.

Last updated: 24 Jun 2026

Sampling techniques allow auditors to examine a portion of a transaction population and draw conclusions about the whole, but fraud audits impose a different logic than routine statutory audits. In a statutory audit, representative random sampling is appropriate because the auditor aims to form an opinion on the financial statements as a whole. In a fraud audit, fraud is rarely distributed evenly across a population: it clusters around specific people, processes, time windows, or control weaknesses. Statistical sampling methods, including attribute sampling, monetary unit sampling (MUS), and stratified sampling, provide the mathematical framework for projecting sample results to the population and quantifying confidence. Targeted, risk-directed selection methods layer on top of that framework to concentrate audit effort where fraud signals have already appeared.

The Association of Certified Fraud Examiners (ACFE) and audit standard-setters including the Public Company Accounting Oversight Board (PCAOB) in the United States, the Financial Reporting Council (FRC) in the United Kingdom, and the Institute of Chartered Accountants of India (ICAI) all acknowledge that the auditor's objective in a fraud examination differs from the objective in a routine audit. The sampling plan must reflect that difference. A plan designed to detect material misstatement at a 5% risk threshold will not necessarily surface a fraud that affects 0.3% of transactions by count but 18% of transactions by value.

Modern fraud audits combine classical sampling theory with data analytics. The analytics layer processes the entire population to surface anomalies and risk indicators; the sampling layer then selects transactions for detailed examination. Understanding how to design, execute, and document a sampling plan is a core competency for fraud examiners, forensic accountants, and internal auditors working in any jurisdiction.

By the end of this topic you will be able to:

Distinguish attribute sampling, monetary unit sampling, and stratified sampling, and select the appropriate method for a given fraud audit scenario.
Calculate a sample size for attribute sampling given a confidence level, expected deviation rate, and tolerable deviation rate.
Explain why monetary unit sampling overweights large transactions and why that property is useful in fraud examinations.
Design a stratified sampling plan that concentrates effort on high-risk transaction strata without inflating total sample size.
Describe how risk-directed judgmental selection supplements statistical sampling when fraud has been indicated in a specific transaction stream.

Key terms

Attribute sampling: A statistical sampling method that tests for the presence or absence of a specific characteristic in a population item, such as an authorised signature or a matched purchase order. Results are expressed as an estimated deviation rate with a stated confidence level.
Monetary unit sampling (MUS): A probability-proportional-to-size method that treats each currency unit in the population as a sampling unit. Larger transactions have a higher probability of selection. Results are expressed as an upper error limit in currency, making them easier to evaluate against a materiality threshold.
Stratified sampling: A sampling design that divides the population into homogeneous subgroups (strata) and samples each stratum separately. Allows the auditor to apply higher sampling intensity to high-risk strata and lower intensity to low-risk strata, improving efficiency without sacrificing coverage.
Tolerable deviation rate (TDR): The maximum rate of control deviations the auditor is willing to accept before concluding that a control cannot be relied upon. Setting TDR too high understates risk; setting it too low inflates sample size unnecessarily.
Risk-directed selection: A judgmental sampling approach in which items are chosen because they exhibit specific risk indicators: unusual amounts, unusual payees, bypass of normal approval workflows, or patterns identified by data analytics. Not statistically projectable but essential when fraud is suspected in a defined transaction stream.
Confidence level: The probability that the sample result falls within the auditor's acceptable error margin. A 95% confidence level means the auditor accepts a 5% risk of concluding that a control is effective when it is not (alpha risk, or risk of incorrect acceptance).

Why sampling logic differs in fraud audits

A routine financial statement audit uses sampling to form a reasonable opinion on the accuracy of reported figures. Audit standards, such as ISA 530 (International Standard on Auditing on Audit Sampling) adopted across more than 120 countries, permit the auditor to accept some level of undetected misstatement as long as it is below the materiality threshold and the sampling approach is statistically sound. The implicit assumption is that errors are distributed throughout the population in a broadly representative way.

Fraud breaks that assumption. A payroll fraud affecting a single ghost employee may represent less than 1% of total transactions but nearly 100% of a specific cost centre. A procurement fraud built on a single shell vendor may affect a handful of invoices that are individually below the approval threshold that would trigger management review. In both cases, a representative random sample from the full population is likely to miss the fraud entirely, not because the sample is too small, but because the fraud is concentrated in a subpopulation that the random draw is unlikely to hit.

Fraud auditing standards and practitioner guidance, including the ACFE Fraud Examiners Manual and the UK Fraud Advisory Panel guidance, consistently hold that the sampling plan must incorporate the auditor's assessment of where fraud is likely to be concealed. This does not mean abandoning statistical rigour. It means treating statistical sampling and risk-directed selection as complementary tools: statistical sampling provides a defensible basis for population-level conclusions, while risk-directed selection ensures that the highest-risk items receive scrutiny regardless of whether chance places them in the random draw.

Attribute sampling: testing controls and deviations

Attribute sampling is the standard method for testing whether internal controls are operating as intended. The auditor defines one attribute (for example, every payment over a specified threshold must bear two authorised signatures), selects a random sample from the population of transactions to which the control applies, and counts the number of exceptions. The result is projected to the population as an estimated deviation rate with a stated confidence interval.

Three parameters drive the sample size calculation. The confidence level (typically 90% or 95% in fraud contexts, versus 80% or 85% in routine audits) reflects the auditor's risk tolerance. The expected deviation rate is the auditor's prior estimate of how often the control fails; a higher expected rate requires a larger sample. The tolerable deviation rate (TDR) is the maximum failure rate the auditor will accept before treating the control as unreliable. The sample size formula, or the equivalent lookup in standard sampling tables, produces the minimum number of items that, if no more than TDR exceptions are found, allows the auditor to conclude at the stated confidence level that the true population deviation rate is at or below TDR.

Parameter	Routine audit setting	Fraud audit setting
Confidence level	80% to 90%	90% to 99%
Expected deviation rate	1% to 3%	0% to 5% depending on prior signals
Tolerable deviation rate	5% to 10%	2% to 5% (lower threshold for fraud risk)
Resulting sample size	Smaller	Larger (due to higher confidence and lower TDR)
Selection method	Random	Random plus targeted risk-directed additions

In fraud audits, attribute sampling is most useful for procurement controls (dual-approval, vendor matching), payroll controls (new-hire authorisation, bank account change verification), and expense reimbursement controls (receipt requirements, manager approval). When the attribute test reveals a deviation rate above TDR, the auditor cannot rely on the control and must expand examination of the transactions that fall outside the control, using risk-directed selection to investigate the exceptions in detail.

Monetary unit sampling: probability proportional to size

Monetary unit sampling (MUS) is a probability-proportional-to-size method: each currency unit in the population is a potential sampling unit, and a transaction is selected if any of its currency units falls in the sample. A transaction worth 50,000 dollars is 50,000 times more likely to be selected than a transaction worth one dollar. This built-in overweighting of large transactions is not a bias; it is a design feature that aligns the sample with the auditor's concern about financial impact.

The MUS sample size is determined by dividing the population value by a sampling interval. The interval is calculated from the tolerable misstatement (the maximum error in currency the auditor can accept) and the confidence factor (a value derived from the Poisson distribution for the stated confidence level). For example, at 95% confidence with zero expected errors, the confidence factor is approximately 3. If the tolerable misstatement is 500,000 dollars, the sampling interval is 500,000 divided by 3, approximately 167,000 dollars. The auditor then selects every transaction whose cumulative monetary value crosses a multiple of the interval.

In fraud audits, MUS is particularly suited to accounts payable and revenue testing, where individual transaction values vary widely. A procurement fraud built on inflated invoices will produce transactions with values systematically higher than legitimate transactions, increasing their probability of selection in a MUS draw. A fraud built on many small transactions below an approval threshold, however, will be systematically underrepresented in a MUS sample, which is precisely why MUS must be supplemented by a separate targeted sample of small-value transactions when that fraud pattern is indicated.

Stratified sampling: concentrating effort by risk

Stratified sampling divides the population into subgroups before sampling. Each stratum is sampled independently, and the results can be combined into a population-level estimate or kept separate if the strata are to be assessed individually. The primary advantage is flexibility: the auditor can apply a high sampling intensity (small interval, large sample fraction) to the stratum carrying the most fraud risk and a lower intensity to lower-risk strata, keeping the total sample size manageable.

Common stratification bases in fraud audits include transaction size (top 5% by value, middle 45%, bottom 50%), vendor or payee category (related-party vendors, sole-source vendors, new vendors registered in the last 12 months), geographic location (regions with known control weaknesses), and approval pathway (items approved by a specific individual, items that bypassed standard workflows). The stratification variables should be chosen based on the fraud risk assessment rather than convenience.

In practice, many fraud auditors apply 100% examination to the highest-risk stratum when it is small enough to be practical. The 2020 ACFE Report to the Nations found that asset misappropriation schemes are the most frequent fraud type globally, and within that category, billing schemes and expense reimbursement schemes account for a large share. Both typically involve identifiable characteristics (unusual payee names, round-number amounts, missing supporting documents) that make stratification on those characteristics a direct detection strategy rather than a statistical exercise.

Risk-directed and judgmental selection

Judgmental sampling is not the same as arbitrary selection. When fraud auditors use judgmental selection, they are applying structured professional judgment based on explicit risk criteria. Data analytics tools, including Benford's Law analysis, duplicate payment detection, gap analysis in sequence-numbered documents, and velocity analysis of payee relationships, generate specific transaction flags. Each flagged item is a candidate for judgmental inclusion in the sample regardless of whether random selection would have reached it.

Judgmental selection must be documented. The auditor records the risk criteria applied, the number of items flagged, the number examined, and the results. Without documentation, a selection that looks targeted and purposeful in the working papers can appear arbitrary to a court, a regulator, or a peer reviewer. In jurisdictions where the fraud examiner's report may be used in evidence, including under the Bharatiya Sakshya Adhiniyam 2023 in India, the US Federal Rules of Evidence, and the UK Civil Procedure Rules, the methodology of selection is subject to scrutiny and must be defensible.

A combined sampling plan, used in major fraud investigations in both the public and private sectors, typically has three layers. The first layer is a statistical random sample of the general population, producing a population-level projection. The second layer is a stratified oversample of high-risk strata, producing stratum-level assessments. The third layer is a targeted judgmental sample of items flagged by analytics or informant information, producing case-specific evidence. The three layers address different evidentiary needs and should not be conflated in the working papers.

Sample size, documentation, and court-ready evidence

Sample size decisions in fraud audits must be documented with the same rigour as the examination itself. The working paper for each sampling plan should state the population definition and size, the sampling unit, the method of selection, the parameters used (confidence level, TDR or tolerable misstatement, expected rate or error), the resulting sample size, the actual items selected (with enough detail to reconstruct the selection), the examination results, and the conclusion or projection.

Statistical sampling projections require care in communication. An attribute sample result stating that the estimated deviation rate is 4.2% with a 95% confidence interval of 2.8% to 6.1% means different things to a forensic accountant, a corporate audit committee, a regulatory investigator, and a judge. The fraud examiner's report must translate the statistical conclusion into plain language that explains the practical significance without overstating the certainty. Courts in the United States, United Kingdom, and increasingly in India have accepted statistically derived sampling evidence, but expert witnesses have been challenged on the basis of sample size adequacy, selection bias, and projection methodology.

Professional standards across jurisdictions set minimum documentation requirements. In the US, PCAOB AS 2315 (Audit Sampling) and the AICPA's AU-C Section 530 require that the auditor document the basis for the sampling plan and the results. In the EU, the equivalent is ISA 530 as adopted by member-state audit regulators. In India, the ICAI's Standard on Auditing 530 mirrors ISA 530. Forensic engagements that may result in litigation or regulatory proceedings should apply the most stringent of the applicable standards.

Worked example

Designing a sampling plan for a procurement fraud investigation

A regional public works department is suspected of awarding contracts to shell vendors connected to a procurement officer. The population is 8,400 vendor invoices totalling 42 million dollars over three years. This example walks through a three-layer sampling plan.

The fraud risk assessment identified three indicators: a cluster of new vendors registered in the six months before a key contract cycle, several invoices with round-number amounts just below the manager-approval threshold, and one vendor whose bank account changed twice in 18 months. The sampling plan was designed to address each indicator while also covering the general population.

Layer 1: Attribute sample on the dual-approval control. The population is all 8,400 invoices. At 95% confidence, expected deviation rate 2%, tolerable deviation rate 5%, the sample size table yields 181 items. Items are selected by systematic random sampling (random start, fixed interval). This tests whether the dual-approval control was consistently applied across the full population.
Layer 2: Stratified oversample of new vendors. Data analytics identified 63 invoices from vendors registered within six months of the contract cycle, totalling 1.1 million dollars. All 63 are examined (100% of the stratum). This stratum is too small and too high-risk for sampling; full examination is more efficient and more defensible.
Layer 3: Targeted sample of sub-threshold round-number invoices. Analytics flagged 214 invoices with amounts between 9,500 and 9,999 dollars (the approval threshold was 10,000 dollars) with round-hundred amounts. Sixty items are selected judgmentally, prioritising vendors with more than two such invoices in the period and invoices lacking a verifiable purchase order number.
Execution: each sample item is examined for the presence of a legitimate purchase order, a delivery confirmation, a signed invoice, and a valid vendor registration. Items from the new-vendor stratum are cross-referenced against the company registration database and the procurement officer's personal association disclosures.
Results and reporting: the attribute sample found 9 deviations in 181 items (4.9%), just below TDR but above the expected rate, indicating the dual-approval control is marginally unreliable. The new-vendor stratum examination found 11 invoices with no valid purchase order and 3 whose bank accounts matched the procurement officer's personal details. The sub-threshold judgmental sample found 22 items with no delivery confirmation. The three layers are reported separately: the statistical layer supports a population projection; the stratum and judgmental layers support specific case findings.

Check your understanding

Question 1 of 4· 0 answered

An auditor sets a 95% confidence level, a 2% expected deviation rate, and a 5% tolerable deviation rate for an attribute sample. Compared to the same plan at 90% confidence, what effect does raising the confidence level to 95% have on sample size?

Key Takeaways

Fraud concentrates in specific transaction streams, so a sampling plan designed for representative coverage of the full population will often miss it. Fraud audit sampling combines statistical methods with risk-directed selection to address both population-level assessment and targeted investigation.
Attribute sampling tests whether a control characteristic is present or absent and projects a deviation rate to the population with a stated confidence level. It is the standard method for assessing whether internal controls such as dual-approval or vendor matching are operating effectively.
Monetary unit sampling overweights large transactions by using the currency unit as the sampling unit. It is well-suited to procurement and accounts payable testing where inflated invoice fraud is suspected, but it structurally undersamples small transactions and must be supplemented when structuring or sub-threshold fraud is indicated.
Stratified sampling allows the auditor to apply 100% or near-100% examination to high-risk strata (new vendors, related-party transactions, sub-threshold amounts) while using a lower sampling fraction for lower-risk strata, keeping total sample size manageable and audit effort focused.
Statistical and judgmental samples must be reported separately. Projecting a combined statistical-plus-judgmental sample to the population is methodologically invalid and legally vulnerable. Judgmental items are documented and reported as individual findings, not projected.

What is the difference between statistical and judgmental sampling in fraud audits?

Statistical sampling uses probability-based selection so that every item in the population has a known chance of inclusion, allowing auditors to project results to the whole population with a quantified confidence level. Judgmental sampling uses the auditor's professional judgment to select items believed most likely to contain errors or fraud. In fraud audits, both methods are used: statistical sampling to assess the overall population and judgmental sampling to investigate specific high-risk items flagged by data analytics or informant tips.

What is monetary unit sampling and why is it used in fraud audits?

Monetary unit sampling (MUS) treats every dollar (or rupee, or pound) in the population as a sampling unit rather than every transaction. This means larger transactions have a proportionally higher probability of selection, which aligns well with fraud audits because large transactions often carry larger risk and larger potential loss. MUS produces results expressed as an upper error limit in currency rather than a percentage rate, making it easier to communicate financial materiality to non-technical stakeholders.

What is attribute sampling and how does it apply to fraud detection?

Attribute sampling tests for the presence or absence of a specific characteristic in a population, such as whether a purchase order has an authorised signature or whether a payee appears on the approved vendor list. The auditor selects a sample, counts the number of exceptions, and projects a rate of deviation across the whole population with a stated confidence level. In fraud audits, attribute sampling is useful for testing whether internal control procedures are being bypassed systematically, which is a common pattern in procurement and payroll fraud.

Why does fraud auditing require targeted rather than purely random sampling?

Random sampling is designed to be representative, but fraud is rarely evenly distributed. Fraudsters exploit specific weaknesses: a particular approval threshold, a single vendor relationship, a specific time window when a supervisor is absent. Purely random sampling may miss concentrated fraud by diluting it across a large population. Fraud auditors use risk indicators, data analytics results, and informant information to direct sampling toward the transaction streams where fraud is most likely, supplementing random sampling with targeted selection of high-risk items.

What is stratified sampling and when is it preferred in fraud audits?

Stratified sampling divides the population into subgroups (strata) with shared characteristics, such as transaction size bands or vendor categories, and then samples each stratum separately. This allows the auditor to apply higher sampling intensity to the strata with the greatest fraud risk without inflating the overall sample size unnecessarily. It is preferred when the population is heterogeneous and risk is concentrated in identifiable subgroups, which is the common situation in procurement fraud, expense reimbursement fraud, and payment diversion schemes.

Test yourself on Forensic Auditing and Fraud Examination with free, timed mocks.

Practice Forensic Auditing and Fraud Examination questions

Found this useful? Pass it along.

Spotted an error in this page? Report a correction or read our editorial standards.

Key Takeaways

Your journey to becoming a forensic professional starts here.