DCT Block Boundaries and Quantisation Grid Analysis

JPEG and video codecs divide images into 8x8 pixel blocks before applying the discrete cosine transform, leaving a grid of quantisation artifacts that encode the compression history of the image. Forensic analysts exploit inconsistencies in blocking artifact phase and strength to detect spliced regions and double-compression at misaligned grids.

Last updated: 24 Jun 2026

DCT block boundary and quantisation grid analysis is a technique for detecting image tampering by measuring the spatial pattern of compression artifacts left by JPEG encoding. JPEG divides an image into a fixed 8x8 pixel grid, applies the discrete cosine transform (DCT) to each block independently, and then quantises the resulting coefficients using a quantisation table chosen by the encoder. This process introduces discontinuities at block boundaries, called blocking artifacts, whose strength and phase are determined by the quantisation table and by the position of the grid origin. When a region is cut from a second image and composited into a JPEG image, or when a JPEG is re-compressed after editing, the blocking artifacts in the manipulated area show a different phase, different strength, or different characteristic frequency pattern from the surrounding original content. Systematic measurement of these differences forms the basis of a forensic detection method that is independent of image content.

The practical power of this approach lies in its passive, non-destructive nature. An analyst works on the JPEG file as received, without altering it, and derives evidence purely from the mathematical structure of the compression. The method does not require knowledge of the original, unmodified image. It applies to photographs from any source, to video stills, to screenshots, and to synthetic images that have been JPEG-compressed at any point in their creation chain. Limitations do exist: very high quality settings, strong post-processing, and some AI-generated images reduce artifact strength to the point where detection becomes unreliable.

The forensic literature on this topic began to consolidate in the early 2000s with Farid and Lyu's work on statistical image forensics, and the JPEG ghost technique introduced by Farid in 2009 became a widely taught practical tool. Subsequent work formalised the block grid phase estimator and the double-quantisation histogram detector. These methods now appear in commercial forensic platforms and in peer-reviewed testimony in courts across multiple jurisdictions, including the United States, the United Kingdom, and EU member states.

By the end of this topic you will be able to:

Explain how JPEG quantisation creates blocking artifacts and why their phase and strength are diagnostically significant.
Describe how misaligned double compression differs from aligned double compression and identify the signature of each in DCT coefficient histograms.
Explain the JPEG ghost technique and apply the concept to determine whether a suspicious region was saved at a different quality factor from the surrounding image.
Identify how block-grid phase inconsistencies reveal spliced regions, and describe the phase estimation procedure used to localise the boundary.
Describe how DCT-based analysis extends to video intra-frames and articulate what evidence it contributes to a court report alongside other authentication findings.

Key terms

Discrete Cosine Transform (DCT): A mathematical transform that expresses a block of pixel values as a sum of cosine functions at different frequencies. JPEG applies the DCT to 8x8 pixel blocks. The resulting 64 DCT coefficients represent the spatial frequency content of the block, with low-frequency terms in the top-left of the coefficient matrix and high-frequency terms in the bottom-right.
Quantisation table: A matrix of 64 integer divisors, one per DCT coefficient position, that controls how much each frequency component is rounded during JPEG compression. Dividing by a larger number loses more information but achieves higher compression. The quantisation table is embedded in the JPEG file header and is directly readable, giving analysts access to the compression parameters used.
Blocking artifact: A visual discontinuity at the boundary between two adjacent 8x8 blocks in a JPEG image, caused by independent quantisation of each block. Blocking artifacts manifest as abrupt changes in tone or colour at 8-pixel intervals. Their strength increases with higher compression (lower quality factor) and their spatial phase is fixed by the grid origin used during encoding.
Block grid phase: The horizontal and vertical offset of the 8x8 encoding grid relative to the image coordinate origin. A phase of (0,0) means the grid starts at the top-left pixel. A phase of (3,5) means the grid is shifted 3 pixels right and 5 pixels down. When a pasted region comes from a differently-aligned source, its phase differs from the host image, and that difference is detectable.
Double JPEG compression: The situation where an image has been JPEG-encoded, decoded to a bitmap, and then JPEG-encoded again. The second quantisation step leaves a characteristic pattern in DCT coefficient histograms, specifically zeros at multiples of the first quantisation step that the second pass cannot fill. Detecting this pattern indicates the image has been through at least two JPEG saves, which is consistent with editing.
JPEG ghost: A visualisation technique introduced by Farid (2009) that reveals regions compressed at a quality factor different from the rest of the image. The image is repeatedly re-compressed at different quality factors and the pixel-wise difference from the original is computed. A pasted region that was originally saved at quality Q will show minimum difference when recompressed at Q, creating a visible ghost of its extent against the surrounding area.

How JPEG compression creates a forensic signature

JPEG compression begins by dividing the image into a tiling of 8x8 pixel blocks. Each block is transformed to the frequency domain using the DCT, producing 64 coefficients. These coefficients are then divided by the corresponding entries in the quantisation table and rounded to the nearest integer. The rounding step is where information is discarded, and it is also where the forensic signature is created.

When the image is decoded, the quantised coefficients are multiplied back by the quantisation table values and inverse-transformed. The reconstructed pixel values differ from the originals by a rounding error bounded by half the quantisation step. At block boundaries, adjacent blocks were quantised independently, so the boundary between two reconstructed blocks is typically not smooth: there is a small but measurable discontinuity. This discontinuity, the blocking artifact, has a fixed spatial period of 8 pixels and a phase determined by where the grid origin was placed during encoding.

The quantisation table is stored in the JPEG file header in the DHT and DQT marker segments. A forensic analyst can read it directly using any JPEG parser. Matching the quantisation table to known profiles for specific cameras, software applications, or social media platforms is a fast first step that narrows the provenance question before any pixel-level analysis begins.

Block grid phase estimation and splicing detection

When a manipulator cuts a region from one JPEG image and pastes it into another, the pasted region brings its own block grid phase. Unless the manipulator takes deliberate steps to realign the grid (which requires working in a lossless format and re-saving with an identical origin), the pasted region and the host image will have blocking artifacts at different horizontal and vertical offsets. The phase difference is one of the most reliable indicators of a composite image.

The detection procedure works by computing a local measure of blocking artifact strength across the image at each of the 64 possible phase offsets (8 horizontal times 8 vertical). For each small analysis window, the dominant phase is estimated by comparing the variance at block boundaries versus mid-block positions for each candidate phase. A consistent dominant phase across a region means that region was encoded with that phase. A sudden change in the dominant phase map indicates a boundary between regions from different sources.

Scenario	Phase map appearance	Interpretation
Unmodified JPEG from camera	Uniform dominant phase across entire image	Single-source, single-compression history
Spliced region from different source	Distinct phase in pasted region, different from surrounding area	Region from a second source image with a different grid origin
Aligned double compression (same grid)	Uniform phase, but artifact strength pattern differs from single-compress	Re-saved without editing; grid was not shifted
Misaligned double compression (grid shifted)	Two coexisting phases superimposed in the re-compressed area	Image decoded, cropped or shifted, then re-saved

The localisation resolution of the phase estimator depends on the size of the analysis window. A small window gives fine spatial resolution but noisy phase estimates; a large window gives stable estimates but blurs the boundary location. Practical implementations use overlapping windows of 64x64 pixels or larger, then apply spatial filtering to the phase map. The result is a map that identifies suspicious regions but typically cannot precisely determine the boundary at single-pixel resolution.

Double JPEG compression: aligned and misaligned cases

Double JPEG compression is a necessary consequence of any editing workflow that loads and re-saves a JPEG. It is not by itself proof of tampering: legitimate workflows such as adjusting metadata, rotating an image, or passing a file through a social media platform all introduce a second compression pass. What matters forensically is whether the double compression is consistent with the claimed provenance of the file and, crucially, whether it is uniform across the image or present only in some regions.

When the first and second compression passes use the same 8x8 grid alignment (aligned double compression), the DCT coefficients after the second pass show a characteristic histogram pattern. DCT coefficients that were already quantised to multiples of the first quantisation step Q1 are now quantised again at step Q2. If Q2 divides Q1, the histogram is largely unchanged. If Q2 does not divide Q1, the histogram shows periodically missing values: integers that would require fine DCT precision to produce cannot survive the first quantisation. Detectors based on this effect are sometimes called DQ (double quantisation) detectors.

Misaligned double compression occurs when the image is shifted or cropped between the two compression passes, so the second-pass grid does not coincide with the first-pass grid. This scenario produces stronger forensic evidence. The first-pass blocking artifacts are at phase P1; the second pass introduces new artifacts at phase P2. The decoded image contains blocking patterns at both phases simultaneously, which is physically impossible in a single-compression image. Measuring the power spectral density of the luminance channel at 8-pixel periodicity reveals two distinct phase peaks.

The JPEG ghost technique

The JPEG ghost technique, described by Hany Farid in 2009, provides a different angle on the same problem. Instead of analysing the blocking artifact pattern directly, it exploits the fact that re-compressing an image at its original quality factor reduces the blocking artifacts to near zero, while re-compressing at a different quality factor introduces new artifacts. An image that contains a spliced region saved at quality Q will show minimum distortion in that region when re-compressed at Q, but higher distortion in the surrounding area (which was saved at a different quality). The spatial map of this difference, computed across a sweep of quality factors, reveals the extent of the pasted region as a ghost.

The procedure is as follows: for each candidate quality factor q from 50 to 99, decode the suspicious JPEG to a bitmap, re-compress it as JPEG at quality q, decode again, and compute the mean squared pixel difference from the original bitmap. A region whose original quality was q will show a local minimum in this difference at q, while surrounding regions with a different original quality will show a local minimum at a different q. A spatial map of which quality factor minimises local difference reveals quality-inconsistent regions.

Limitations of the JPEG ghost include: the method is computationally expensive (requiring one full re-compression per quality factor tested); it is sensitive to any geometric transformation between the source and the pasted region; and it is less reliable when the quality difference between source and host is small (fewer than 5-10 quality points). When the quality difference is large, the ghost is visually clear and easy to interpret. When it is small, quantitative threshold analysis is required.

DCT artifact analysis in video forensics

Block-based DCT compression is not exclusive to still JPEG images. Video codecs including MPEG-2, H.264/AVC, and H.265/HEVC encode intra-coded frames (I-frames) using the same DCT-on-blocks paradigm. For inter-coded frames (P-frames and B-frames), motion compensation is applied before the DCT step, but residual error blocks are still DCT-coded. This means the same blocking artifact analysis methods apply to video, with some important adaptations.

In video tampering, the most common manipulation types are frame deletion, frame insertion, and region replacement within a frame. Frame deletion removes frames from a sequence and causes a discontinuity in the temporal artifact pattern: the blocking artifacts at the cut point show a sudden change in statistical properties. Region replacement within a frame introduces a spatial phase inconsistency within an individual frame, detectable by the same phase estimator used for still images. Frame insertion introduces frames with a different quantisation history into a sequence, visible as a sudden change in the DCT coefficient histogram pattern.

The analysis is complicated by the fact that video codecs apply adaptive quantisation: different regions of the same frame may be quantised at different strengths depending on their motion and complexity. A forensic examiner must use tools that account for the codec's adaptive quantisation map rather than assuming uniform quantisation across the frame. This is covered in more detail in the topic on video double-compression analysis.

Presenting DCT evidence in legal proceedings

DCT block analysis evidence has been presented in criminal and civil proceedings in multiple jurisdictions. In the United States, the admissibility framework under Daubert v. Merrell Dow Pharmaceuticals Inc. (1993) requires that the technique be testable, subject to peer review, have a known error rate, and be generally accepted in the relevant scientific community. DCT-based methods satisfy these criteria: the algorithms are published in peer-reviewed literature, multiple independent implementations exist, and the underlying mathematics of JPEG compression is well established.

In the United Kingdom, forensic image authentication is governed by the Forensic Science Regulator's codes of practice and the guidance of the Chartered Society of Forensic Sciences. Expert witnesses must be able to explain the method to a non-technical jury, quantify their uncertainty, and acknowledge the limitations specific to the image under examination. In EU member states, the standard varies by country but the general requirement for scientific validity and expert qualification is consistent.

Under India's Bharatiya Sakshya Adhiniyam 2023 (which replaced the Indian Evidence Act 1872), electronic records admitted as evidence must satisfy the conditions in Section 63, which requires a certificate from a responsible person and compliance with prescribed procedures. An expert who presents DCT analysis under this framework must document the examination procedure, the tools used, the unaltered state of the file examined (supported by hash values), and the chain of custody, which is addressed more fully in the topic on chain of custody for digital media.

A court-ready DCT report should include: the hash value of the original file, the specific algorithm and tool version used, the parameters applied (window size, quality factor range for JPEG ghost), the output images or maps with clear legends, the finding expressed as an observation rather than a conclusion about guilt (for example, 'the blocking artifact phase in region A is inconsistent with the surrounding image, indicating that region A was encoded in a different compression pass'), and a plain-language summary of what the finding means and what it cannot determine.

Worked example

Analysing a suspected composite photograph using DCT grid phase

A photograph submitted as evidence in a financial fraud investigation appears to show a person at a specific location. The defence questions its authenticity. This walkthrough traces the DCT analysis from file intake to report.

The submitted JPEG (3024x4032 pixels, approximately 4.2 MB) purports to come from a smartphone. Visual inspection shows a person in the foreground against an outdoor background. The lighting on the person appears inconsistent with the background shadows. DCT analysis is requested to assess whether the image contains regions with different compression histories.

File intake and hash. The file is received on write-blocked media, hashed (SHA-256), and copied to the working directory. The hash is recorded in the case file. All subsequent analysis is performed on the copy, and the hash of the copy is verified to match.
JPEG structure inspection. JPEGsnoop is used to read the file header. The DQT marker shows a single luminance quantisation table with a maximum value of 8, consistent with a quality factor of approximately 92. The EXIF data claims the image was taken with a specific smartphone model. The quantisation table is compared against the database of known device profiles: it matches the claimed camera model's default table.
Block grid phase map. Amped Authenticate's blocking artifact analysis module is applied to the luminance channel. The phase estimator uses 64x64 pixel windows at 8-pixel steps across the image. The resulting phase map shows a dominant phase of (0,0) across most of the image, consistent with a standard grid origin. However, a region covering roughly the foreground figure shows a dominant phase of (4,2): a 4-pixel horizontal and 2-pixel vertical shift from the surrounding content.
JPEG ghost analysis. The JPEG ghost sweep is run from quality 50 to 99. For the background region, the difference minimum occurs at quality 92, consistent with the file header's quantisation table. For the foreground figure, the difference minimum occurs at quality 78, indicating the figure was originally saved at a lower quality setting before being pasted into the host image.
Boundary localisation. The phase discontinuity and the JPEG ghost boundary are overlaid on the image. Both methods independently identify the same spatial boundary, enclosing the foreground figure. The two independent detectors producing coincident boundaries increases confidence in the finding. The boundary follows the outline of the person, not any rectangular region, which indicates the compositor used a feathered selection or masking tool.
Report preparation. The report states: (1) the background region shows phase (0,0) and quality factor 92, consistent with single JPEG compression from the claimed device; (2) the foreground region shows phase (4,2) and quality factor 78, inconsistent with the surrounding content; (3) the boundary between the two regions follows the contour of the person; (4) these findings are inconsistent with an unmodified single-source photograph and indicate the foreground element was sourced from a separately compressed image. The report does not assert who made the composite or why.

Check your understanding

Question 1 of 4· 0 answered

A JPEG image is split into 8x8 pixel blocks before DCT encoding. Why does this create a forensically useful artifact?

Key Takeaways

JPEG encoding divides images into an 8x8 pixel grid and quantises each block independently, leaving blocking artifacts whose spatial phase is set by the grid origin and whose strength depends on the quantisation table. Both quantities are readable from the file and measurable in the pixel data.
A pasted region from a different source image brings its own block grid phase. Mapping the dominant phase across the image reveals spatial boundaries between regions with different compression histories, independent of image content.
Double JPEG compression leaves a DCT coefficient histogram signature when the first and second quantisation steps are misaligned in scale (periodic missing values). Misaligned grid double compression adds a second, stronger signature: two coexisting blocking phases within the same image.
The JPEG ghost technique sweeps candidate quality factors and identifies regions whose re-compression error is minimised at a quality factor different from the rest of the image, localising quality-inconsistent pasted regions.
DCT analysis extends to video I-frames using the same methods as still images. Expert reports in any jurisdiction (US Daubert, UK FSR codes, India's Bharatiya Sakshya Adhiniyam 2023) must document the algorithm, tool version, file hash, and the chain of custody, and should express findings as observations about the compression history rather than conclusions about intent.

What is a DCT blocking artifact and why does it matter forensically?

A blocking artifact is a visual discontinuity at the boundary between adjacent 8x8 pixel blocks in a JPEG image. Each block is transformed and quantised independently, so slight differences in reconstruction accumulate at borders. Forensically, these artifacts encode the quantisation table and grid alignment used during each compression pass. Inconsistencies between blocks indicate that part of the image was compressed separately, which is a strong signal of splicing or region replacement.

What is double JPEG compression and how is it detected?

Double JPEG compression occurs when a JPEG image is decoded and then re-saved as JPEG, subjecting it to a second round of quantisation. If the two compression passes used the same 8x8 block grid, the DCT coefficient histograms show characteristic double-quantisation peaks. If the grids are misaligned, the second pass creates new blocking artifacts at an offset phase from the first, and the affected region shows artifacts at both phases simultaneously. Detectors measure the periodicity and phase of blocking strength across the image to distinguish aligned from misaligned double compression.

What is block grid phase and why does a spliced region have a different phase?

Block grid phase refers to the horizontal and vertical offset of the 8x8 compression grid relative to the image origin. A camera or software encodes an image with a fixed phase, typically starting from the top-left corner. When a region is cut from a second image with a different grid origin and pasted into the first, the pasted region carries its own phase. The host image and the pasted region therefore show blocking artifacts at offset positions. Measuring the dominant phase in each region of the image and mapping discontinuities reveals where boundaries between different source images fall.

Can DCT analysis detect tampering in video as well as still images?

Yes. Video codecs such as H.264 and HEVC use block-based DCT transforms for intra-coded frames, producing the same type of quantisation grid structure as still JPEG images. Frame deletion, insertion, or region replacement in video leaves inconsistencies in the blocking artifact pattern across frames, particularly in intra-frames. Analysts examine the per-frame blocking strength map and inter-frame consistency to detect anomalies. The technique is closely related to video double-compression analysis.

What tools are used for DCT grid analysis in casework?

Forensic suites such as Amped Authenticate include dedicated blocking artifact analysis modules that compute the block-grid phase map, double-quantisation histograms, and differential blocking energy maps. Hex editors and JPEG structure parsers (ExifTool, JPEGsnoop) reveal the quantisation table directly from the file header. Academic toolkits including the MATLAB Image Forensic Toolbox implement published algorithms for JPEG ghost and double-quantisation detection. In court-ready work, examiners document the specific algorithm used, its parameters, and the chain of custody of the unmodified file.

Test yourself on Multimedia Authentication and Deepfake Forensics with free, timed mocks.

Practice Multimedia Authentication and Deepfake Forensics questions

Found this useful? Pass it along.

Spotted an error in this page? Report a correction or read our editorial standards.

Key Takeaways

Your journey to becoming a forensic professional starts here.