Practice with mock tests, learn from structured notes, and get your questions answered by a global forensic community, all in one place.
How forensic scientists and provenance systems detect whether an image was captured by a camera or generated by an AI, covering GAN fingerprints, diffusion model spectral signatures, C2PA cryptographic manifests, and the limitations of binary classifiers on unseen generators.
Last updated:
Every photograph ever taken carries invisible marks of how it was made: the noise pattern of the sensor, the optical imperfections of the lens, the compression fingerprint of the in-camera processor. Every image generated by an AI carries a different set of marks: the periodic patterns of transposed-convolution upsampling, the spectral distribution of a diffusion model's denoising residual, the absence of any camera-native statistics at all. The forensic task is to read those marks and reach a reliable conclusion about the image's origin.
This matters at two levels. At the individual-image level, a court or investigative body may need to know whether a specific image showing an alleged event was taken by a camera or generated by a model. At the systemic level, the ability to make any photographic truth claim is eroding as synthetic images become indistinguishable to human viewers. The provenance infrastructure response to that erosion, led by standards like C2PA, attempts to solve the problem not by detecting synthesis artefacts but by establishing a verifiable chain of custody for the image from capture to publication.
This topic covers the forensic detection methods for AI-generated images, from the GAN fingerprint as a device-fingerprint analogue to the spectral signatures specific to diffusion models, and then the provenance infrastructure layer, C2PA and invisible watermarking proposals. It is honest about the limitations: binary classifiers break on out-of-distribution generators, provenance chains only work if cooperating parties adopt them, and watermarks can be removed. The conclusion is not despair but a realistic assessment of what the current toolkit can and cannot support in court or investigative settings.
Each trained GAN leaves a detectable mark on every image it generates.
In traditional digital camera forensics, PRNU is a reliable way to link a photograph to a specific device. The per-pixel sensitivity variation of a camera sensor is stable across photographs and can be extracted by averaging many images and subtracting the low-frequency content. Marra et al. (2019) demonstrated that GAN generators have an analogue: each trained model's weights imprint a periodic fingerprint on every image it generates. The fingerprint is model-specific, not architecture-specific. Two ProGAN models trained with different random seeds produce detectable, distinguishable fingerprints.
The practical forensic workflow mirrors PRNU attribution. A reference fingerprint is extracted from a set of known images from the model under investigation, typically by averaging the high-pass residuals of hundreds of generated samples. The fingerprint is then correlated with the residual of the query image. High correlation indicates the query was generated by that model; low correlation rules it out. This approach can attribute a specific generated image to a specific model instance, which is potentially useful for source tracing in disinformation investigations.
Diffusion images lack camera-native noise and carry U-Net upsampling periodicity instead.
Diffusion models generate images through a denoising process that operates in latent space and decodes to pixel space via a VAE (variational autoencoder) decoder. The decoder's upsampling layers introduce spectral periodicity, similar to but distinct from the GAN checkerboard. More forensically significant is the absence of properties that all genuine photographs carry: camera sensor noise (photon shot noise, dark current), optical vignetting, chromatic aberration, and the spatial correlation structure of natural image statistics in the very high-frequency band.
| Property | Genuine photograph | Diffusion model output |
|---|---|---|
| Sensor noise | Present: photon shot noise + dark current per pixel | Absent: denoising residual instead |
| High-frequency spectral slope | Follows natural scene statistics (1/f) | Deviates; flatter or steeper depending on model |
| Optical artefacts | Vignetting, chromatic aberration, barrel distortion | Absent unless post-processed |
| EXIF metadata | Authentic device + GPS + settings chain | None, or fabricated |
| Compression history | Single in-camera JPEG or raw decode | Often re-encoded; no original raw |
| Spectral periodicity | None from optics (slight from demosaicing) | U-Net upsampling periodicity in decoder |
Corvi et al. (2023) published a systematic study of diffusion model artefacts across Stable Diffusion, DALL-E 2, and several other systems, finding that detectors trained on GAN output performed significantly worse on diffusion output. They proposed features specific to the diffusion denoising residual that improved cross-family generalisation, but noted that performance on novel, unseen diffusion model checkpoints was still substantially lower than on held-out test sets from known models.
Attaching a verifiable creation history to every image, signed at every step.
The Coalition for Content Provenance and Authenticity (C2PA) was formed in 2021 by Adobe, Microsoft, BBC, Intel, and Truepic, with the stated aim of building an open technical standard for media provenance. The core concept is a manifest: a structured record attached to a media file that describes its creation history, with each step signed by the tool or actor responsible for it. A photograph taken with a C2PA-enabled camera gets a manifest signed with the camera's hardware key. Each edit in C2PA-aware software adds a signed update. If the image was generated by an AI tool, that tool records its contribution.
C2PA does not make forgery impossible. An attacker who controls the signing keys, or who captures a screen-grab of an authentic image to strip the original manifest, circumvents the chain entirely. What C2PA provides is a positive claim: when the chain is intact, the creation history it describes can be verified. When the chain is absent, the image makes no authenticated provenance claim, which is itself information. As of 2024, Leica, Sony, and Nikon have released or announced C2PA-enabled camera firmware, and Adobe Photoshop generates C2PA manifests for exported files.
Marking AI output at generation time, invisibly and persistently.
Invisible watermarking proposes embedding a signal in generated images during model inference, before any post-processing, so that all output from a cooperating generator is marked as synthetic. Several technically distinct approaches exist. Spectral watermarking places a low-amplitude pattern at specific spatial frequencies chosen to survive moderate JPEG compression. Latent-space watermarking applies a fixed perturbation to the latent representation before decoding, so the mark appears in every generated image from that model. Semantic watermarking trains the model to include a specific detectable pattern in generated textures without visible distortion.
A classifier trained on last year's generators may fail on this year's.
The core limitation of binary AI-vs-real classifiers is the same as for deepfake video detectors: they learn the artefact signature of the generators in their training set. When a novel generator with a different architecture is released, the classifier's features may not activate on the new output, causing accuracy to collapse. Wang et al. (2020) demonstrated this with CNNDetect, showing that a ResNet-50 classifier trained on ProGAN output achieved near-perfect accuracy on ProGAN test images but dropped to chance-level accuracy on several other GAN architectures when trained only on ProGAN.
Proposed mitigations include training on augmented data that mimics a wide range of post-processing operations, so the classifier learns to detect features that survive processing rather than artefacts specific to one generator; using frequency-domain features that are more architecture-agnostic; and ensemble approaches that combine signals from multiple detection methods. None fully solves the OOD problem, particularly against diffusion models, which have different spectral properties from the GANs used in most published training sets as of 2022-2023.
| Approach | In-distribution accuracy | OOD accuracy | Notes |
|---|---|---|---|
| Pixel-domain CNN (e.g., XceptionNet) | Very high | Low | Learns generator-specific textures |
| Frequency-domain classifier | High | Moderate | More architecture-agnostic spectral features |
| GAN fingerprint correlation | High for attribution | Not applicable for unseen models | Requires reference outputs from target model |
| Noiseprint / residual CNN | High | Moderate | Transfers across some generator families |
| C2PA manifest check | N/A (provenance, not detection) | Applies to all cooperating generators | Fails if chain absent or circumvented |
What does it mean to assert that a photograph shows something that happened?
The photographic truth claim, the implicit assertion that a photograph depicts a real event, has been legally and socially significant since the 19th century. Courts in common-law and civil-law jurisdictions have developed evidentiary standards for photographic evidence that presume an authentic capture chain unless challenged. Digital image forensics inherited and extended this framework, developing tools to detect manipulation of genuine photographs.
AI-generated images introduce a qualitatively different challenge. The question is no longer whether a genuine photograph was altered but whether any camera-capture event occurred at all. A fully synthetic image has no original: it was never a photograph. The forensic and evidentiary consequence is that authentication cannot proceed by establishing chain-of-custody back to a capture device, because no capture device exists. Instead, the burden falls on detection of synthesis artefacts, provenance manifest verification, or contextual and OSINT corroboration of the depicted events.
What is the forensic analogy between PRNU and the GAN fingerprint identified by Marra et al. (2019)?
Test yourself on Forensic Audio, Video and Image Analysis with free, timed mocks.
Practice Forensic Audio, Video and Image Analysis questionsSpotted an error in this page? Report a correction or read our editorial standards.