Noise and CFA Inconsistency Detection

Camera sensors imprint a characteristic noise signature on every image they produce. Analysing noise level, noise model fit, and CFA interpolation residuals across image regions can localise forgeries and distinguish steganographic modifications from post-capture compositing.

Last updated: 19 Jun 2026

Noise and CFA inconsistency detection identifies image regions that do not share the noise characteristics of the host camera sensor. Every sensor imprints a characteristic noise signature on its images, combining signal-dependent shot noise and a fixed readout noise floor. The Bayer colour filter array adds a second layer of structure: demosaicing creates a periodic inter-pixel correlation that is consistent throughout an authentic image and disrupted wherever content from a different source has been composited in. Comparing local noise statistics and CFA correlation strength against a fitted camera noise model can localise forged regions and, when combined with PRNU and JPEG analysis, support a conclusion about image authenticity.

Every image sensor imprints a characteristic noise signature on its output. Photon counting is a statistical process, and the electrons generated by each photosite vary randomly around the true signal; readout circuits add a fixed-pattern noise floor on top of this. The combined signature is sensor-specific and is present in every pixel of every image that sensor produces. When part of an image is replaced or inserted from a different source, the noise in that region will not match the surrounding sensor fingerprint, and that mismatch is what noise-based forensics exploits.

The colour filter array adds a second layer of structure. Most sensors use a Bayer mosaic, where each photosite captures only red, green, or blue. Demosaicing software reconstructs the full-colour image by interpolating the missing channels, and this interpolation creates a predictable periodic correlation between neighbouring pixels. This correlation is a spatial fingerprint: it is consistent throughout a genuine image and is disrupted wherever content has been spliced in from a source that used a different interpolation algorithm, or from a rendered image that bypassed the CFA process entirely.

The sections below cover noise-based forgery analysis in sequence: Gaussian noise estimation from homogeneous patches, camera noise model fitting and local variance maps, CFA interpolation artefacts and how splicing breaks the pattern, Bayesian forgery localisation using noise residuals, and the distinction between steganographic modifications and compositing forgery.

By the end of this topic you will be able to:

Estimate camera noise level from homogeneous image patches using block-based and wavelet-based methods and interpret a local noise-level map.
Fit a camera noise model (variance = aI + b) to an image and identify regions whose local noise variance deviates significantly from the model prediction.
Explain how Bayer CFA demosaicing creates a periodic inter-pixel correlation, and describe how splicing breaks that pattern through phase shifts or algorithm mismatches.
Apply Bayesian hypothesis testing to noise residuals to produce a forgery probability map rather than a fixed-threshold anomaly flag.
Distinguish steganographic noise signatures (global, uniform modification) from compositing forgery signatures (localised anomaly), and identify the conditions under which that distinction becomes ambiguous.

Key terms

Camera noise model: A mathematical description of a sensor's noise as a function of signal level, combining signal-dependent shot noise and signal-independent readout noise. The model is specific to the sensor, ISO, and exposure conditions.
CFA (Colour Filter Array): The mosaic of colour filters placed over a sensor so that each photosite captures one colour channel. The Bayer pattern (RGGB arrangement) is the most common.
Demosaicing: The interpolation process that reconstructs missing colour channels at each pixel from adjacent CFA samples. Different algorithms (bilinear, AHD, VNG, deep-learning) produce different interpolation fingerprints.
CFA phase: The spatial offset of the CFA mosaic pattern in an image. Pasting a region from a different image or after a non-CFA-aligned crop can introduce a phase shift that breaks the expected periodic pattern.
Noise residual: The high-frequency component of an image obtained by subtracting a denoised version from the original. The residual retains camera-model fingerprint noise and manipulation artefacts.
Steganography: The practice of hiding data within a digital image by making imperceptible modifications to pixel values. In the forensics context, steganographic modifications produce a noise signature that must be distinguished from forgery signatures.

Estimating noise from homogeneous image patches

To isolate noise, analysts identify regions where the signal is approximately constant, so that pixel-to-pixel variation is dominated by noise rather than image content. These are called homogeneous patches.

A block-based homogeneity test divides the image into small non-overlapping blocks (typically 8x8 or 16x16 pixels) and computes a gradient or local variance measure for each block. Blocks with low gradient and low local variance are classified as homogeneous candidates. Within each candidate block, the pixel variance is a direct estimate of the noise variance. Aggregating across many homogeneous blocks gives a robust estimate of the noise level for that image.

The estimated noise level is a global statistic for the image. For forgery detection, the key step is to compute local noise estimates across the image, producing a noise-level map. Regions that deviate significantly from the global noise level are anomalous and warrant further investigation. A pasted region from a lower-noise source (for example, from a higher-ISO image or a different camera) will appear as a low-noise patch in an otherwise uniformly noisy field, or as a high-noise patch in a low-noise field.

Camera noise model fitting and local variance maps

Real camera noise is not purely Gaussian. Shot noise (Poisson-distributed, arising from the discrete nature of photons) has variance proportional to the signal intensity. Readout noise from the analogue-to-digital converter is signal-independent. The combined noise variance as a function of pixel intensity follows a roughly linear relationship: variance(I) = a * I + b, where I is pixel intensity, a is the shot-noise coefficient, and b is the readout noise floor. This is the camera noise model.

Fitting this model to an image uses homogeneous patches at different brightness levels to sample the variance-versus-intensity curve and estimate a and b. The model is characteristic of the specific sensor and ISO. A forensic analyst with the model for a claimed source device can then test whether every region of the image is consistent with that model.

Camera noise model: variance vs. intensity, showing shot and readout noise components.

Local variance maps extend this to forgery localisation. The image is divided into small windows, the local noise variance is estimated in each window, and the result is normalised against the expected variance from the fitted model at the local mean brightness. Windows that show significantly higher or lower normalised variance than predicted are flagged. A clustered region of deviant windows is consistent with compositing.

CFA interpolation artefacts and how splicing breaks the pattern

The Bayer CFA assigns a red, green, or blue filter to each photosite in a repeating 2x2 mosaic. Because a pixel in the demosaiced image has its missing two channels interpolated from neighbours, adjacent pixels are not independent. Specifically, the green channel samples every second pixel in alternating rows and columns, and the interpolated value at a given pixel depends on its neighbours. This produces a periodic spatial correlation in the demosaiced image that peaks at frequencies corresponding to the 2-pixel CFA period.

Popescu and Farid (2005) demonstrated that this periodic correlation can be detected via the autocorrelation of the image or via its 2D Fourier transform, and that it is a reliable signature of CFA-demosaiced images. The specific form of the correlation also encodes the interpolation algorithm used, providing a second level of fingerprinting beyond simply 'was this demosaiced?'

Splicing breaks the CFA pattern in two ways. First, content pasted from an image that used a different interpolation algorithm will have a different correlation structure, detectable by fitting the expected autocorrelation model and measuring the residual. Second, a paste that shifts the CFA phase, because the source image was cropped at a non-even offset, introduces a phase discontinuity at the splice boundary. The phase is detectable locally, and a phase step at a sharp boundary is consistent with splicing.

Signal	Authentic image	Copy-move from same image	Splice from different image
CFA correlation present?	Yes, uniform	Yes, may have phase shift if offset	No, or different algorithm signature
Noise level consistent?	Yes	Yes (same sensor)	No (different sensor or ISO)
Noise model fit?	Yes	Yes	No (different a and b coefficients)
Illumination consistent?	Yes	Yes	Possibly no

Bayer CFA phase discontinuity at a splice boundary: left half shows the authentic RGGB mosaic in correct phase; right half shows pasted content with a 1-pixel horizontal phase shift, exposing a misaligned column at the boundary where G and R swap positions.

Bayesian forgery localisation using noise residuals

Bayesian methods frame forgery localisation as a hypothesis test at each pixel or block, replacing ad-hoc threshold-setting on a noise-level map with a principled statistical decision. The null hypothesis is that the local noise is consistent with the fitted camera noise model. The alternative is that the noise deviates in a way consistent with content from a different source. The likelihood ratio of these two hypotheses, computed from the local noise residual, produces a forgery probability map.

The noise residual at a pixel is the difference between the observed pixel value and a denoised estimate, obtained by applying a denoising filter to the image. If the denoising filter is well-matched to the camera noise model, the residual in authentic regions should look like zero-mean Gaussian noise with variance equal to the model prediction. Regions where the residual has higher or lower variance, or where it has a structured (non-random) pattern, are anomalous.

Steganographic versus forgery noise signatures

Noise analysis is used in both steganalysis and forgery detection. The two tasks are distinct and produce different noise signatures.

Classic spatial-domain steganography (LSB replacement, LSB matching) embeds data by modifying the least-significant bit of pixel values throughout the entire image. The modification is visually imperceptible but disrupts the natural noise distribution. Specifically, LSB replacement causes the histogram of pixel values to become statistically regular in a detectable way (the WS steganalysis method and its successors exploit this). The modification is global, affecting the whole image's noise statistics uniformly, though adaptively-chosen embedding sites in smooth regions may concentrate the modifications.

Compositing forgery produces localised noise inconsistencies concentrated in the inserted region and at its boundary. The rest of the image retains the original camera noise model. This spatial locality is the key discriminator. A noise anomaly that is uniform across the entire image, with no localised region of inconsistency, is more consistent with steganography or global processing than with regional compositing.

Global noise floor elevation + no local anomaly: consider steganography or global filter application.
Local noise anomaly at spatially coherent region: consistent with compositing; check boundaries for CFA phase shift and JPEG ghost.
Local noise anomaly at edges only: consider JPEG blocking artefacts, sharpening, or border effects before concluding compositing.

Limitations and corroboration requirements

Noise-based methods have specific failure modes. In-camera noise reduction applied by modern smartphones can render all regions equally smooth, masking both the genuine camera noise fingerprint and the inconsistency that would indicate a splice. High ISO images have higher shot noise that can swamp the noise model deviation produced by a carefully-matched pasted region.

Camera models that use computational photography (multi-frame averaging, night mode, HDR fusion) process images from multiple exposures and combine them, producing noise statistics that differ significantly from a single-exposure model. Applying a single-exposure noise model to a computational photograph will produce apparent noise anomalies across the entire image that have nothing to do with manipulation.

The required corroboration is the same across all passive methods: a noise anomaly that appears independently in JPEG compression analysis, CFA correlation analysis, and geometric consistency checks is substantially more reliable than a noise anomaly found by one method alone. A report that presents noise-level inconsistency as a standalone forgery indicator is vulnerable to challenge on all of the above grounds.

Worked example

Noise and CFA analysis on a social-media photograph in an insurance claim

A smartphone image, a suspect patch, and three noise-based signals that converge.

An insurer receives a smartphone photograph purportedly showing hail damage to a vehicle. The image has been shared via a messaging application that re-compresses images. The claimed device is a smartphone known to use a Bayer CFA with bilinear demosaicing at the compression quality settings involved. The suspect region is a circular area on the car's bonnet showing more severe denting than the surrounding panels.

Camera noise model fit. The analyst estimates noise from homogeneous patches in the sky and the smooth car paintwork. The fitted noise model (a = 0.012, b = 3.4 at this ISO) is consistent with the claimed smartphone sensor specification. Local noise variance in the suspect region is 0.9 sigma below the model prediction for its local mean brightness, indicating the region is quieter than expected. This is consistent with the patch having been sourced from an image processed with stronger noise reduction.
CFA autocorrelation map. The analyst computes the local CFA correlation strength across the image. Outside the suspect region, correlation strength is consistent with bilinear demosaicing. Within the suspect region, the correlation is 40% weaker and the phase is shifted by 1 pixel horizontally, consistent with content pasted from an image cropped at a non-even offset.
Noise residual spatial pattern. The noise residual within the suspect region lacks the fine-grained texture expected from a camera sensor and shows a smoother, quasi-periodic pattern more consistent with a rendered or retouched image than with a photographed metallic surface.
Finding. Three noise-based indicators converge: the region is quieter than the camera model predicts, the CFA correlation is weaker and phase-shifted, and the noise residual texture is inconsistent with the surrounding photographic content. These findings are consistent with the suspect region having been composited from a different source image. The finding is qualified: the analysis cannot determine the nature or origin of the source content, only that the suspect region's noise characteristics are inconsistent with the host image.

Check your understanding

Question 1 of 4· 0 answered

Why do forensic noise estimates use homogeneous image patches rather than the whole image?

Key Takeaways

Camera noise combines signal-dependent shot noise and constant readout noise; fitting this model to an image establishes a sensor fingerprint that forged regions deviate from.
CFA demosaicing creates a periodic inter-pixel correlation that is characteristic of the interpolation algorithm; spliced regions break this pattern through phase shifts or algorithm mismatches.
Bayesian forgery localisation tests each image block against the fitted noise model and produces a probability map, turning noise analysis into a principled hypothesis test rather than a threshold heuristic.
PRNU (Photo Response Non-Uniformity) is the most powerful camera fingerprint, linking images to specific sensors; inconsistency in the PRNU map within an image localises content from a different camera.
Steganography raises the global noise floor; compositing creates localised noise anomalies. Spatial localisation distinguishes the two, though adaptive steganography and computational photography complicate this distinction.

How is Gaussian noise estimated from an image patch?

Candidate homogeneous patches are identified using a block-based homogeneity test. Within each candidate block, the pixel variance estimates the noise variance. Aggregating across many homogeneous blocks gives a robust estimate. Wavelet-based methods estimate noise from the high-frequency subband and are more robust on complex textures.

What is a camera noise model and what does it tell a forensic analyst?

The camera noise model describes how noise variance depends on pixel intensity: variance = a * I + b, where a is the shot-noise coefficient and b is the readout noise floor. The model is sensor- and ISO-specific. Regions of an image that deviate from the fitted model may have originated from a different sensor.

What is a CFA and how do CFA interpolation artefacts arise?

A colour filter array places coloured filters over sensor photosites so each captures one channel. Demosaicing interpolates the missing channels from neighbours, creating periodic inter-pixel correlation. This correlation is a spatial fingerprint of the interpolation algorithm and is present in all CFA-demosaiced images.

How do forged regions break the CFA pattern?

Pasted content may have a different CFA phase (from a non-aligned crop), a different interpolation algorithm's fingerprint, or no CFA correlation at all (if from a rendered image). These inconsistencies are detectable via the autocorrelation or Fourier-domain analysis of the demosaiced image.

How do steganographic noise signatures differ from forgery noise signatures?

Classic steganography spreads modifications throughout the image, raising the global noise floor uniformly. Compositing forgery creates localised noise inconsistencies bounded to the inserted region. Spatial localisation of the noise anomaly is the key discriminator, though adaptive steganographic methods and computational photography can complicate this.

Test yourself on Forensic Audio, Video and Image Analysis with free, timed mocks.

Practice Forensic Audio, Video and Image Analysis questions

Found this useful? Pass it along.

Spotted an error in this page? Report a correction or read our editorial standards.

Key Takeaways

Your journey to becoming a forensic professional starts here.