PRNU Extraction, Matching, and the PCE Statistic

PRNU camera attribution requires a denoising step to isolate the sensor residual, averaging many residuals to build a reference pattern, and then computing the Peak-to-Correlation Energy statistic to decide whether a query image came from a specific camera.

Last updated: 19 Jun 2026

PRNU extraction works by applying a denoising filter to each image and subtracting the result from the original pixel data, leaving a noise residual dominated by the sensor's fixed spatial pattern. Building a reliable camera reference requires averaging residuals from many images so that random shot noise cancels while the PRNU accumulates. Attribution decisions are made by computing the Peak-to-Correlation Energy (PCE) statistic between a query residual and a camera reference, then comparing that value against a threshold calibrated for a defined false-positive rate. The reliability of the result depends on image resolution, JPEG quality, and the number of reference images available.

Every digital photograph carries the camera sensor's PRNU fingerprint mixed into its noise floor, but you cannot see it. The scene content, quantisation from JPEG compression, and random shot noise all drown it out. Extracting PRNU from an image is an exercise in separation: use a denoising filter to strip away the part that looks like an image, and what remains is dominated by the fixed spatial pattern you care about.

The practical problem has two parts. First, estimating a reference PRNU pattern for a camera you want to fingerprint: take a set of images from that camera, denoise each one, and average the residuals so that random noise cancels and the stable PRNU accumulates. Second, testing a query image of unknown origin: denoise it, compare its residual to the reference using a correlation statistic, and decide whether the result is high enough to attribute the image to that camera.

This topic covers both parts in detail: the denoising options, the normalised cross-correlation and its PCE extension, the statistical hypothesis-testing framework, and the practical variables (image count, JPEG quality, resolution) that govern how reliable the result can be.

By the end of this topic you will be able to:

Describe the denoising-and-subtraction procedure used to isolate a PRNU noise residual from a single image.
Explain how averaging multiple residuals improves reference pattern quality and why signal-to-noise ratio grows as the square root of image count.
Calculate or interpret a PCE value, and explain why PCE is preferred over raw normalised cross-correlation for attribution decisions.
Apply the hypothesis-testing framework to a camera attribution scenario, including threshold calibration and interpretation of false-positive and false-negative risks.
Identify how JPEG compression quality and video re-encoding degrade PRNU signal and adjust attribution confidence accordingly.

Key terms

Noise residual (W_hat): The per-image estimate of PRNU, computed by subtracting a denoised version of the image from the original. It contains PRNU plus random noise. Averaging many residuals produces a cleaner reference pattern.
Wavelet denoiser: A denoising filter that works in the wavelet transform domain, thresholding coefficients at each scale and direction to remove random noise while preserving edges. Used widely for PRNU extraction before BM3D became standard.
BM3D (Block-Matching 3D): A state-of-the-art denoising algorithm that groups similar image patches across the image, transforms them jointly, thresholds in the transform domain, and inverts. Its stronger scene-content removal yields cleaner PRNU residuals than earlier wavelet denoisers.
NCC (Normalised Cross-Correlation): The standard measure of similarity between a query noise residual and a reference PRNU pattern. It produces a value between -1 and +1, where 1 indicates perfect agreement. In camera attribution a threshold is set above which attribution is accepted.
PCE (Peak-to-Correlation Energy): A refined statistic that divides the peak of the cross-correlation surface by the energy of the non-peak region. PCE is more sensitive than raw NCC for detecting weak PRNU signals, and it is the preferred decision statistic in most contemporary PRNU systems.
Hypothesis testing in camera attribution: The framework in which H0 is 'the query image was not taken by this camera' and H1 is 'it was'. The PCE or NCC value is compared to a threshold calibrated from empirical distributions of true and false matches, and a decision is made with stated false-positive and false-negative rates.

Denoising to isolate the PRNU residual

The PRNU residual from a single image is computed as:

W_hat = I - F(I)

where I is the original pixel array and F(I) is the denoised version produced by a filter F. The residual contains the PRNU component (which is spatially correlated and fixed across images) plus random shot noise (which is independent across images and will average away). The quality of the residual depends entirely on how well F removes scene content without also removing the PRNU.

Two denoising approaches dominate the literature. Wavelet-based denoisers, commonly the Daubechies 8-tap filter used in the original Lukáš et al. implementation, operate by thresholding wavelet coefficients at each scale. They are computationally fast and work well for natural images. BM3D, introduced by Dabov et al. in 2007, achieves markedly better scene-content removal by exploiting self-similarity across the image through block matching. BM3D denoising residuals have a higher signal-to-noise ratio for PRNU than wavelet residuals, particularly for images with strong textural content, and it has become the preferred denoiser in high-accuracy PRNU systems.

Estimating the reference pattern from multiple images

The reference PRNU pattern for a camera is estimated by averaging the residuals from N images taken by that camera:

K_hat = (1/N) · sum(W_hat_i)

Because the random noise in each residual is independent, it averages toward zero with a rate proportional to 1/sqrt(N). The PRNU component, being constant across images, accumulates linearly. The signal-to-noise ratio of the reference therefore grows as sqrt(N), so doubling the number of images improves it by about 41 percent.

Reference pattern estimation: averaging N residuals cancels random noise and accumulates PRNU.

The images used for reference estimation should be varied in scene content: outdoor, indoor, bright, dark, textured, smooth. Using images that are too similar to each other (for example, a sequence of near-identical shots) means the scene-content residuals may partially correlate across images rather than cancelling, biasing the reference. In practice, casework analysts often work with whatever natural images are stored on a seized device, which tends to provide adequate variety.

Flat-field images (uniformly lit surface): maximise PRNU signal per image (bright uniform field keeps the multiplicative term high); 10-20 flat-field images can match 100+ natural images in reference quality.
Natural images (typical photographs): available on any seized device; 50-200 images needed for reliable attribution at resolutions of 8-24 megapixels.
Video frames: individual frames can be extracted and used as natural images; consecutive frames are highly correlated so spacing (every 10th frame or more) is needed to avoid bias.

Normalised cross-correlation and the PCE statistic

Given a query noise residual W_hat_q and a reference pattern K_hat_c for candidate camera c, the normalised cross-correlation is:

NCC = corr(W_hat_q, K_hat_c) / (||W_hat_q|| · ||K_hat_c||)

This ranges from -1 to +1. For a true source-camera match on an uncompressed image, NCC values above about 0.01 are considered significant (the absolute scale is low because the signal rides on top of much larger random noise). For a non-source camera the value hovers near zero. The NCC threshold for a positive attribution decision depends on the resolution and the desired false-positive rate.

PCE improves on raw NCC by asking not just 'is the correlation high?' but 'how prominent is the correlation peak relative to the surrounding surface?' The cross-correlation is computed in the frequency domain (via FFT), yielding a two-dimensional correlation surface. PCE is the peak value of that surface divided by the energy of the rest of it:

PCE = peak^2 / ((1/(MN-1)) · sum_rest(values^2))

A sharp, isolated peak against a flat background gives a high PCE. A broad, noisy correlation surface with no clear peak gives a low PCE. For true camera matches, PCE values in the hundreds or thousands are common. For non-source cameras, PCE typically stays below 60. These distributions overlap only at the low end of the true-match distribution, which is precisely the regime that requires careful interpretation.

PCE: a high isolated peak on the correlation surface indicates a true camera match.

Statistical hypothesis testing for camera attribution

Camera attribution is framed as a binary hypothesis test. The null hypothesis H0 is that the query image was not taken by the candidate camera. The alternative H1 is that it was. The analyst sets a threshold PCE_threshold, and the decision rule is: if PCE >= PCE_threshold, attribute to this camera; otherwise do not.

The threshold is calibrated empirically by testing a validation set of true matches (images of known source) and non-matches (images from different cameras). The distribution of PCE values under H0 is estimated from non-source pairs, and PCE_threshold is set so that the false-positive rate (attributing to the wrong camera) is acceptably low. In forensic practice, false-positive rates of 1 in 10,000 or lower are targeted, with the exact rate depending on the size of the candidate pool.

Decision outcome	What it means	Driven by
True positive (H1 accepted, correct)	Image correctly attributed to the source camera	High PCE from genuine PRNU match
False positive (H1 accepted, wrong)	Image wrongly attributed to a non-source camera	Chance high PCE; increases with pool size
True negative (H0 accepted, correct)	Image correctly excluded from non-source camera	Low PCE from unrelated noise patterns
False negative (H0 accepted, wrong)	Source image missed; not attributed correctly	Low PCE from heavy compression or processing

Required image count and the effect of JPEG quality

The number of reference images required for reliable attribution is not fixed. It depends on image resolution, JPEG quality, and the false-positive rate targeted. Several empirical studies have converged on practical guidance.

For a 12-megapixel sensor at JPEG quality 90+: 50 natural images are typically sufficient for a PCE-based attribution decision at a false-positive rate of 1 in 10,000. Reference quality saturates around 200-300 images for most sensors.
For a 3-megapixel sensor: more images are needed to compensate for the smaller pixel count. Studies suggest 100-200 images for comparable reliability.
For JPEG quality around 80: each image carries roughly 50-60% of the PRNU signal it would carry uncompressed. More reference images partially compensate, but the query residual is also weaker, so the PCE for a true match is lower and the distribution overlaps more with non-matches.
For JPEG quality below 70: PRNU signal is severely degraded in query images. Attribution becomes unreliable unless the candidate pool is small and additional corroborating evidence exists.

Resolution matters because more pixels means more PRNU samples per image, and more independent samples improve the correlation statistic. This is why smartphones with small sensors and then-lower resolutions in the early 2010s presented more challenging attribution cases than contemporary 20-megapixel cameras, even though their PRNU physics is identical.

PRNU matching for video frames

Video files present two complications for PRNU analysis: inter-frame compression and temporal correlation. Modern video codecs (H.264, H.265) compress most frames as motion-compensated differences from a reference frame rather than as independent images. Only I-frames (key frames) are encoded like independent JPEG images. P-frames and B-frames have different noise statistics and lower PRNU content.

The practical approach is to extract I-frames from the video and treat them as natural images for both reference estimation and query testing. Consecutive I-frames at typical video bit rates still contain enough PRNU signal for attribution, but the effective JPEG quality equivalent is often in the 70-85 range, meaning more frames are needed than photographs at the same nominal resolution. Research including work by Taspinar et al. has confirmed that PRNU attribution from video I-frames is feasible with 200-500 frames and that the PCE distributions for true and false matches retain the same structure as still-image attribution.

Worked example

Attribution of an anonymous tip photograph to a specific device

From a single query image and a pool of five candidate phones to a statistical attribution.

An investigator receives an anonymous photograph of a document that appears to have been taken inside a government building. Five employees had phones that might have been inside the relevant area. The photograph was shared via a messaging app and is a re-encoded JPEG estimated at quality factor 82. The task is to determine whether the image can be attributed to any of the five candidate phones.

Reference estimation. From each of the five phones, the analyst extracts all available natural images from the camera roll (ranging from 80 to 340 images per device). Each image is denoised with BM3D, and the residuals are averaged to produce five reference patterns K_hat_1 through K_hat_5.
Query residual extraction. The anonymous tip image is denoised with the same BM3D denoiser, producing W_hat_q.
PCE computation. The cross-correlation surface between W_hat_q and each of the five reference patterns is computed in the frequency domain. PCE is computed for each candidate. Results: Camera 1: PCE = 8.3. Camera 2: PCE = 11.1. Camera 3: PCE = 847.4. Camera 4: PCE = 6.9. Camera 5: PCE = 14.2.
Decision. The analyst's laboratory threshold for this image size and quality level (validated at a false-positive rate of 1 in 5,000) is PCE = 60. Camera 3 exceeds the threshold by a factor of 14. All other candidates are well below it.
Reporting. The analyst reports that the tip image is consistent with originating from Camera 3 (belonging to employee 3), with a PCE value of 847.4 against a validated false-positive threshold of 60, and notes that the JPEG re-encoding at estimated quality 82 reduces but does not eliminate the signal. A second opinion from an independent laboratory is recommended before evidentiary use.

This example illustrates both the power and the discipline of PCE-based attribution. A very high PCE against one candidate with near-baseline values for all others is the pattern that builds genuine confidence. A moderately elevated PCE across several candidates, or a single moderate PCE, should prompt caution and further analysis.

Check your understanding

Question 1 of 4· 0 answered

Why are multiple images from a camera averaged during reference pattern estimation rather than using a single image?

Key Takeaways

The PRNU residual is isolated by denoising each image and subtracting the result from the original; BM3D achieves better scene-content removal than earlier wavelet denoisers, yielding cleaner residuals.
A reference pattern for a camera is built by averaging residuals from many images of that camera; random noise cancels and PRNU accumulates, with quality improving as sqrt(N) of images used.
PCE (Peak-to-Correlation Energy) is the preferred attribution statistic; it measures how prominent the cross-correlation peak is relative to background structure and outperforms raw NCC for weak or compressed PRNU signals.
Attribution is a hypothesis test with a threshold calibrated to a target false-positive rate; the threshold is not universal and must be validated for the specific image resolution and quality in each case.
JPEG compression and video re-encoding degrade PRNU signal; quality factors below about 70-80 require larger image sets, lower-confidence claims, and recalibrated thresholds.

How is the PRNU reference pattern extracted from a set of images?

Each image is passed through a denoising filter (commonly a wavelet denoiser or BM3D) that removes the scene content and leaves a noise residual. The residuals from many images of the same camera are averaged. The scene-dependent random noise cancels across images, and the stable, spatially correlated PRNU pattern accumulates. The result is the estimated reference fingerprint for that camera.

What is the PCE statistic and why is it preferred over raw NCC?

The Peak-to-Correlation Energy ratio divides the peak value of the cross-correlation surface between a query residual and a reference pattern by the energy of the rest of that surface. Raw normalised cross-correlation gives one number between -1 and +1 but does not account for how prominent that peak is relative to background structure. PCE is sharper and more sensitive to weak PRNU signals, and empirical studies have shown that it better separates true attributions from chance correlations.

How many reference images are needed for reliable camera attribution?

The minimum depends on image resolution and content variability. For a 12-megapixel camera, studies have shown that 50 natural images typically yield a reference pattern strong enough for reliable attribution. Fewer images leave more random noise in the reference, reducing PCE for true matches. More images improve the reference up to a saturation point around 200-300 for most sensors.

How does JPEG quality affect PRNU extraction?

JPEG compression quantises the high-frequency DCT coefficients that carry much of the PRNU signal. At quality factors below about 80 the PRNU signal in each image is noticeably degraded, and the PCE for a true match drops. At quality factors above 90 the impact is small. Matching a low-quality query to a reference built from high-quality images is still possible but requires careful threshold calibration.

Can flat-field images be used instead of natural images for reference estimation?

Yes. Images of a uniformly illuminated surface produce higher per-image PRNU signal because the bright uniform field maximises the multiplicative term. Fewer flat-field images are needed to achieve the same reference quality as many natural images. The tradeoff is that flat-field images require controlled conditions; in casework the analyst usually works with whatever natural images are available on the seized device.

Test yourself on Forensic Audio, Video and Image Analysis with free, timed mocks.

Practice Forensic Audio, Video and Image Analysis questions

Found this useful? Pass it along.

Spotted an error in this page? Report a correction or read our editorial standards.

Key Takeaways

Your journey to becoming a forensic professional starts here.