Practice with mock tests, learn from structured notes, and get your questions answered by a global forensic community, all in one place.
Every digital image is a grid of numbers shaped by the sensor that captured it, the demosaicing algorithm that filled in the gaps, and the format that compressed and stored the result. Those choices leave forensically exploitable signatures in every file.
Last updated:
A photograph looks like a window onto reality. Under the hood it is a grid of integers produced by a semiconductor sensor, shaped by a colour filter mosaic, processed by a demosaicing algorithm, and then compressed by a codec that throws away information it judges invisible to the human eye. Each of those steps leaves a trace in the file, and those traces are the raw material of digital image forensics.
For an analyst, a manipulated image is not just one that looks suspicious. It is one whose internal statistics break the pattern that honest capture would have produced. A patch pasted from another photograph disrupts the local noise texture. Re-encoding at a different quality setting adds a second layer of JPEG artefacts on top of the first. Geometric resampling changes the correlation structure between neighbouring pixels. Every operation that touched the file left a mark, and understanding the normal marks of the capture pipeline is what makes the abnormal ones visible.
This topic builds the foundation for all of that. It covers how sensor physics and the Bayer mosaic create the raw data, how demosaicing reconstructs a full-colour image from it, how RAW, JPEG, PNG, and TIFF differ in what they preserve and what they discard, and how bit depth, colour spaces, and the sensor noise model feed into forensic methods. Get the physics right and the analytical techniques downstream make sense.
Every forensic image analysis starts at the moment a photon hits silicon.
A digital camera sensor is a rectangular grid of photosites, each a small photodiode that accumulates charge in proportion to the light it receives during an exposure. When the shutter closes, the camera reads out those charge values and converts them to digital integers. The resulting grid, one integer per photosite, is the rawest form of the image data.
Sensors are inherently monochrome. To record colour, most consumer and professional cameras place a Bayer Colour Filter Array (CFA) over the sensor. The most common pattern is a 2×2 tile with one red, one blue, and two green filters. Green is doubled because the human visual system is most sensitive to green wavelengths, and the extra green sample reduces noise in the luminance channel.
The Bayer CFA means each photosite records only one of the three colour channels. Demosaicing is the step that estimates the missing two channels at every site using neighbouring values. The quality and specific algorithm of demosaicing varies between camera manufacturers. Canon, Nikon, and Sony each ship different interpolation kernels, and those kernels leave characteristic spatial correlations in the pixel array that have been used since the mid-2000s as camera-identification signals.
The format determines how much of the original signal survives for analysis.
Not every image file carries the same forensic richness. The format a camera writes, or that a user later converts to, determines how much of the original sensor signal is available to an examiner. Each format makes a different set of trade-offs between file size, image quality, and information preservation.
| Format | Compression | Bit depth (typical) | Forensic note |
|---|---|---|---|
| RAW (DNG, CR2, NEF) | Lossless or none | 12–16 bits per channel | Maximum noise, demosaic, and metadata information retained |
| TIFF | Lossless (LZW/ZIP) or none | 8–16 bits per channel | No lossy degradation; large files; metadata often present |
| PNG | Lossless (DEFLATE) | 8 or 16 bits per channel | No compression artefacts; common in screenshots and edited images |
| JPEG | Lossy (DCT) | 8 bits per channel | Block artefacts, chroma subsampling; multiple saves compound the loss |
JPEG is the dominant format in investigations because it is the default for most phone cameras and social media platforms. Each time a JPEG is opened and saved at a given quality level, the DCT quantisation step reintroduces quantisation error on top of whatever was already there. This double-compression signature is one of the oldest and most reliable signals in JPEG forensics.
PNG is the standard for screenshots, graphics, and images that have been edited and explicitly saved without lossy compression. Because PNG is lossless, the pixel values are exact. This means noise analysis and clone-detection methods retain their full power. The absence of JPEG block artefacts in a PNG claimed to be a camera-original should itself prompt scrutiny: most cameras do not output PNG natively.
The numbers in an image file are not raw light measurements. They are carefully encoded values, and that encoding matters.
Bit depth sets how many discrete tonal steps each colour channel can represent. An 8-bit channel has 256 steps; a 12-bit RAW channel has 4,096; a 16-bit channel has 65,536. This matters forensically because editing operations that spread or compress the tonal range leave characteristic gaps or spikes in the histogram.
Colour spaces define how numerical RGB triples map to actual colours. sRGB, the default for the web and most consumer cameras, covers roughly 35% of all visible colours. Adobe RGB covers a wider gamut and is common in professional photography. ProPhoto RGB covers nearly all of the Pointer gamut. Converting between these spaces is not lossless: out-of-gamut values are clipped, and in-gamut values are numerically remapped, altering the statistical fingerprints that camera-identification methods rely on.
Gamma encoding is a non-linear mapping applied before storage. Human vision is more sensitive to differences in dark tones than in bright ones, so images are stored with a curve (gamma ≈ 2.2 for sRGB) that allocates more code values to shadows and fewer to highlights. An analyst working with raw sensor output must account for this encoding or noise estimates will be skewed toward bright pixels.
Imperfection in the sensor is the source of its most powerful forensic signature.
A sensor that perfectly measured every photon it received would produce identical output from identical scenes. Real sensors do not. Several independent noise processes add uncertainty to each photosite reading, and understanding them is the basis for both noise-consistency analysis and camera fingerprinting.
Camera identification via PRNU, developed by Jan Lukáš, Jessica Fridrich, and Miroslav Goljan at Binghamton University in 2006, works by extracting the high-frequency noise residual from an image after suppressing the scene content with a denoising filter. Averaging this residual across many images amplifies the stable PRNU component and cancels random noise. The resulting fingerprint can be correlated against the residual from a questioned image. The method has been validated in court proceedings in multiple jurisdictions and is now routinely applied to images from phone cameras recovered in criminal investigations.
Resolution is a claim about output size, not about image quality. That distinction catches forgers.
Resolution has two distinct meanings that are often conflated. Pixel dimensions (e.g., 4000×3000 pixels) describe the actual data in the file. PPI (pixels per inch) is a metadata tag that tells printing software at what density to render those pixels on paper. PPI has no effect on the pixel data: the same 4000×3000 file looks identical on screen whether its embedded PPI is 72 or 300. Only the printed output size changes.
This matters for document fraud. A forger who scales a small image up to produce a larger one introduces resampling artefacts: the interpolation algorithm creates smooth gradients where the original had sharp transitions, and periodic patterns appear in the Fourier spectrum of the image. A document examiner comparing pixel-level sharpness across different regions of a scanned form can use these artefacts to detect that one region was captured at a different resolution and then resampled to match.
Each stage of the imaging pipeline leaves a distinct mark. Here is what analysts look for.
The imaging pipeline is a chain of decisions: sensor design, Bayer pattern, demosaicing algorithm, in-camera processing, colour-space assignment, format, and compression. Each link in that chain writes something into the file. Forgeries that alter the pixel content break one or more of those consistent writes. The analyst's job is to know what each link normally writes and to recognise the breakage.
Why does the Bayer CFA use two green photosites in each 2×2 tile instead of one?
Test yourself on Forensic Audio, Video and Image Analysis with free, timed mocks.
Practice Forensic Audio, Video and Image Analysis questionsSpotted an error in this page? Report a correction or read our editorial standards.