Writing a Media Authenticity Examination Report

A media authenticity report must document the exhibit received, the methods applied, the findings at each analytical step, and the conclusion drawn, without overstating what probabilistic results can prove. This topic covers report structure, hedging language for statistical outputs, and how to communicate the limits of deepfake detectors and error level analysis to a non-specialist reader.

Last updated: 24 Jun 2026

A media authenticity examination report is a formal document in which a forensic examiner states what digital exhibit was received, what analytical methods were applied, what each method found, and what conclusion follows from the aggregate findings. The report is the examinable record of the examination: it must be transparent enough that a second examiner could read it, understand every step taken, and independently assess whether the conclusion is supported. This requirement governs every structural choice, from how to describe the exhibit at intake to how to qualify a deepfake detector's probability score before presenting it as a finding.

Media authenticity examinations combine several technical methods: error level analysis, metadata and provenance inspection, noise consistency checks, compression history analysis, and AI-based deepfake detectors. Each method has a defined scope and documented failure modes. The report must reflect both what each method found and what it cannot determine. A deepfake detector returning a 94% synthetic probability is not the same as a confirmed finding of manipulation; a report that presents it as such misstates the science and creates a point of attack in cross-examination or appellate review.

Courts in different jurisdictions impose different admissibility standards, but all of them share one underlying expectation: the examiner can explain the method, identify its error rate, and state the basis for the conclusion. In the United States this is formalised under Federal Rule of Evidence 702 and the Daubert framework. In England and Wales the Law Commission's 2023 Expert Evidence consultation addresses digital forensic standards directly. In India, the Bharatiya Sakshya Adhiniyam 2023 governs expert opinion evidence. A report built around transparent methodology and honest limitation disclosure will satisfy the foundational requirements of all these frameworks, even when the underlying detection technology differs.

By the end of this topic you will be able to:

Describe the mandatory sections of a media authenticity report and explain the function of each.
Apply appropriate hedging language to qualify probability scores from deepfake detectors and ELA outputs without understating or overstating their evidential weight.
Explain the key limitations of error level analysis and AI-based deepfake detectors to a non-specialist reader.
Draft a findings paragraph that separates observation, inference, and conclusion across three distinct analytical steps.
Identify the admissibility requirements that a media authenticity report must address in US, UK, Indian, and EU legal contexts.

Key terms

Error level analysis (ELA): A technique that re-compresses a JPEG at a controlled quality setting and maps the pixel-level difference between the re-compressed and original images. Regions altered after the last full save cycle show different error levels than surrounding unaltered areas. ELA does not prove manipulation; it identifies regions warranting further examination.
Deepfake detector: A machine-learning classifier trained to distinguish authentic recordings from AI-synthesised or face-swapped media. Outputs a probability score rather than a binary verdict. Published false-positive and false-negative rates are central to interpreting the score in a report.
Hedging language: Qualified phrasing that accurately conveys the degree of certainty a method supports. Examples: 'consistent with', 'the findings are indicative of', 'cannot be excluded', and 'the balance of indicators suggests'. Hedging is not evasion; it is precision about what the method can and cannot establish.
C2PA (Coalition for Content Provenance and Authenticity): An open technical standard that embeds cryptographically signed provenance assertions into media files at the point of capture or editing. A C2PA manifest, when intact and verified, provides chain-of-custody evidence about a file's creation history. A missing or broken manifest is consistent with tampering but is not proof of it.
Hash verification: The process of computing a cryptographic hash (SHA-256 or equivalent) of the exhibit file and comparing it against a previously recorded value to confirm the file has not been altered since acquisition. Hash values must be logged at intake and again at the time of examination. Any mismatch is a chain-of-custody failure that must be disclosed.
False positive rate: The proportion of authentic media files that a detection tool incorrectly classifies as manipulated or synthetic. Deepfake detectors and ELA tools both have published or empirically derived false-positive rates that must be cited in the report when their outputs are used as evidence.

Report structure: mandatory sections

A media authenticity report has a defined structure because courts and reviewing experts need to locate specific information quickly and assess whether the examination was conducted properly. The sections below are mandatory in the sense that omitting any of them leaves a gap that will emerge under cross-examination or peer review.

Section	Content	Why it matters
Exhibit description	File name, format, size, SHA-256 hash, acquisition date, acquisition method, and who submitted the exhibit	Establishes what was actually examined and enables verification by a second examiner
Scope and instructions	The specific question the examiner was asked to answer and any restrictions on the examination	Prevents scope creep findings and anchors the conclusion to the instruction
Qualifications	Examiner's training, relevant experience, and any accreditation held	Required for admissibility under Daubert (US), Criminal Practice Directions (UK), and Bharatiya Sakshya Adhiniyam 2023 (India)
Methods	Each tool and technique applied, version numbers, settings used, and published error rates where available	Allows the work to be replicated and the methodology to be challenged
Findings	Observations from each analytical step, separated from inferences	The factual record the conclusion must be traceable to
Conclusion	A direct answer to the examination question, qualified to the degree of certainty the evidence supports	The output the court or client needs; must not exceed what the findings justify
Appendices	Raw tool outputs, hash logs, annotated screenshots, and any reference material used	The audit trail; required to allow independent review

The findings and conclusion sections must remain clearly separated. A finding is an observation: 'ELA analysis identified elevated error levels in the region covering the subject's face.' A conclusion is an inference drawn from one or more findings: 'The combination of elevated ELA values in the facial region, inconsistent EXIF metadata, and a high synthetic probability score from the deepfake detector is consistent with AI-assisted facial replacement.' Merging observations with inferences makes the report harder to challenge and harder to defend.

Describing the exhibit and chain of custody

The exhibit description section establishes the integrity of the examination before any analytical finding is mentioned. If the file examined cannot be shown to be the same file that was submitted, every subsequent finding is contestable. The minimum information to record is the original file name, the file format and codec, the file size in bytes, the SHA-256 hash computed at acquisition, the date and time of acquisition, the name and role of the person who submitted the exhibit, and the storage medium from which it was imaged.

The examiner must also record the hash of the working copy used during examination and confirm it matches the acquisition hash. If examination requires format conversion (for example, re-muxing a video to allow frame-level analysis), the report must document the conversion step, the tool used, and the hash of both the original and converted file. Undisclosed format conversion is a chain-of-custody failure that defence counsel will identify.

Describing ELA findings with appropriate limits

Error level analysis is one of the most commonly applied and most frequently misrepresented techniques in image authentication. The output is a visualisation, not a score: regions with elevated residual error after controlled re-compression appear brighter. The challenge in report writing is that ELA outputs look compelling but rest on several assumptions that may not hold for a given file.

The limitations that the report must disclose: First, JPEG re-compression history matters. An image that has been saved multiple times before reaching the examiner will have a flattened error level baseline across all regions, making elevated regions harder to identify. Second, different camera manufacturers and image editing applications apply different compression algorithms, producing different baseline error patterns for unaltered regions. Third, ELA cannot distinguish between alteration for deceptive purposes and innocent re-encoding, such as a thumbnail generation or an automated social-media compression pass. Fourth, the technique has no published, standardised false-positive rate across image types because results depend so heavily on the individual file's compression history.

A correctly written ELA finding looks like this: 'ELA analysis at a re-compression quality of 70 was applied to the exhibit. The facial region of the subject (approximately bounded by coordinates [x1,y1] to [x2,y2] in the 1920x1080 frame) showed residual error levels approximately 40% higher than the surrounding background regions. This result is consistent with that region having a different compression history than the surrounding image. ELA alone cannot establish whether this difference results from deliberate manipulation, innocent re-encoding, or a camera-specific compression artefact. The ELA result is treated as one indicator requiring corroboration from additional analysis.'

Reporting deepfake detector outputs

Deepfake detectors are machine-learning classifiers, not measuring instruments. They return a probability score representing the model's confidence that the input belongs to the synthetic class, given the training data and architecture that model was built on. That score is not the probability that the exhibit is a deepfake; it is the model's output conditional on its training distribution. A score of 94% synthetic from one tool may correspond to a score of 61% synthetic from a different tool examining the same file, because the two models were trained on different synthetic media datasets.

The report must state: the tool name and version, the score returned, the published or empirically established false-positive rate for that tool on a relevant test set, and a qualified conclusion. Where the tool was trained on a specific generation of deepfake technology (for example, FaceSwap or first-generation diffusion models) and the exhibit may involve a newer method, the report must note that the detector's published performance figures may not apply to the exhibit.

Where multiple detectors are applied, the report should present each result separately, note whether they are concordant, and explain any discordance before drawing an aggregate conclusion. Averaging scores from different models with different architectures is not a valid analytical step and should not appear in a report.

Communicating metadata and provenance findings

EXIF metadata, file system timestamps, and C2PA manifests each tell a different part of the provenance story, and each has its own failure modes. The report must describe what was found, what was absent, and what the absence means.

EXIF metadata can be stripped, altered, or injected using widely available tools. A missing EXIF block is consistent with both deliberate removal and routine social-media processing. An intact EXIF block with plausible values is consistent with authenticity but does not confirm it. The report should state whether the EXIF data is present, whether it is internally consistent (date, GPS data, camera model, and focal length should be plausible in combination), and whether the camera model cited in the EXIF data matches the noise characteristics found in PRNU analysis if that was conducted.

A C2PA manifest, when present and verified, provides a cryptographically signed record of the file's creation and editing history. The report should state whether a manifest was present, whether the cryptographic signature verified against the issuing certificate, and what the manifest asserts about the file's origin. A broken or absent manifest is not proof of tampering; it means the cryptographic provenance chain is unavailable. This distinction must be stated clearly. See image file format integrity checks for the full range of file-level integrity indicators.

Conclusions, admissibility, and jurisdiction

The conclusion section answers the examination question. It should be a single, direct statement followed by the qualified basis for it. The statement must not claim more certainty than the aggregate findings support, and it must not be so heavily hedged that it fails to answer the question. 'The examination is inconclusive' is a valid conclusion when the evidence genuinely does not resolve the question; it is not a safe default to avoid commitment.

Admissibility requirements vary. Under Federal Rule of Evidence 702 in the United States, the court acts as gatekeeper, assessing whether the method is based on sufficient facts, is the product of reliable principles and methods, and has been reliably applied. The Daubert framework explicitly requires consideration of error rates and peer review. In England and Wales, the Criminal Practice Directions require expert reports to state the range of opinion on the topic, any literature relied upon, and a summary of the conclusions. The Law Commission's 2023 report on expert evidence proposed a reliability test that mirrors Daubert more closely and is expected to influence future case law.

In India, the Bharatiya Sakshya Adhiniyam 2023 governs expert opinion evidence (replacing the Indian Evidence Act 1872). Section 39 of the Act addresses opinions of experts, and courts have increasingly scrutinised the methodology behind digital forensic conclusions. The Bharatiya Nagarik Suraksha Sanhita 2023 governs procedural requirements for evidence submission in criminal proceedings. In the EU, national procedural rules apply, but the European Digital Media Observatory has published guidance on forensic standards for digital media in legal proceedings. Across all these systems, a report that discloses its methods, error rates, and limitations is far better positioned for admissibility than one that presents tool outputs as self-evidently conclusive.

Worked example

Drafting a findings section for a disputed video clip

A 90-second video clip submitted in a civil defamation matter shows a named individual making a statement they deny making. The examiner is asked whether the video shows signs of manipulation. The following steps illustrate how to build and write the findings section.

The clip was submitted as an MP4 file. SHA-256 hash was logged at acquisition and confirmed unchanged at the start of examination. The file had no C2PA manifest. EXIF-equivalent container metadata named a smartphone model and a recording date three days before the alleged incident date. The following steps were taken in sequence.

Container and codec analysis. MediaInfo v24.01 was applied. The container metadata reported recording device model X and a creation timestamp of [Date A]. The video stream was H.264, the audio stream was AAC. No embedded C2PA manifest was found. Finding: metadata claims device model X and recording date [Date A]; the claimed date precedes the alleged incident by three days, which is an anomaly requiring explanation.
Double-compression analysis. Quantisation table analysis of the H.264 stream indicated two distinct encoding passes at different quality settings. The outer encoding was consistent with social-media re-compression. The inner encoding used settings inconsistent with the claimed device model's factory codec profile. Finding: the file shows evidence of re-encoding beyond the claimed original capture, but re-encoding alone does not establish deceptive manipulation.
Facial region analysis. Frame-level extraction was performed at 30fps. ELA was applied to 15 reference frames at a re-compression quality of 75. In 12 of 15 frames, the facial region showed residual error levels approximately 35% higher than the background. The elevation was spatially consistent with the face-bounding box across frames. Finding: the facial region shows a different compression history than the background in a majority of sampled frames; this result is consistent with that region having been processed differently but does not establish the cause.
Deepfake detector. FaceForensics++-trained classifier (version noted) was applied to the 90-second clip using a sliding-window frame sampling approach. The per-frame synthetic probability mean was 0.81 (range 0.67 to 0.94). The tool's published false-positive rate on its evaluation benchmark is approximately 9%. Finding: the detector result is consistent with AI-assisted face manipulation. Given the published false-positive rate, the result cannot be treated as a positive identification in isolation.
Aggregate conclusion. The aggregate findings (anomalous container metadata, double-compression signatures inconsistent with the claimed device, elevated facial-region ELA values across multiple frames, and a high synthetic probability score from a validated classifier) are collectively consistent with AI-assisted facial replacement in the subject video. No single finding is individually conclusive. The balance of indicators, considered together, supports the conclusion that the video has been manipulated. This conclusion is held at the level of reasonable forensic probability, not certainty.

Check your understanding

Question 1 of 4· 0 answered

A deepfake detector returns a synthetic probability of 89% for an exhibit. How should this result be presented in the report?

Key Takeaways

Every media authenticity report must contain: exhibit description with hash verification, examiner qualifications, scope, methods with tool versions and error rates, findings separated from inferences, a qualified conclusion, and appendices holding raw outputs.
Deepfake detector scores are model outputs, not measurements. The report must state the tool name and version, the score, the published false-positive rate, and use qualified language: 'consistent with AI-generated content' rather than 'confirmed deepfake'.
ELA identifies regions with a different compression history; it cannot distinguish deliberate manipulation from innocent re-encoding. Its failure modes, particularly sensitivity to prior re-saves and camera-specific compression profiles, must be disclosed in the report.
A verified C2PA manifest supports provenance continuity but does not conclusively prove authenticity. A missing or broken manifest is consistent with tampering but is not proof of it.
Admissibility requirements across US (Daubert/FRE 702), UK (Criminal Practice Directions), India (Bharatiya Sakshya Adhiniyam 2023), and EU frameworks all reward the same thing: transparent disclosure of methods, error rates, and limitations.

What sections must a media authenticity report always contain?

Every media authenticity report should include: an exhibit description (hash, file format, acquisition method), the scope and limitations of the examination, a methods section naming each tool and technique applied, a findings section organised by analytical step, a conclusion that links findings to the examination question without overstating certainty, and a declaration of the examiner's qualifications. Appendices hold raw tool outputs and hash verification logs.

How should a report describe a deepfake detector result?

A deepfake detector returns a probability score, not a binary verdict. The report should state the tool name and version, the score returned, the published false-positive rate for that tool, and a qualified conclusion such as: 'The detector assigned a 94% synthetic probability. Given the tool's published false-positive rate of approximately 8%, this result is consistent with AI-generated content but cannot be treated as definitive without corroborating analysis.' Never report the score alone as a conclusion.

What is error level analysis and what are its limitations?

Error level analysis (ELA) re-compresses a JPEG image at a known quality level and maps the difference between the re-compressed version and the original. Regions that have been altered since the last save cycle typically show different error levels than unaltered regions. Its main limitations are: results vary with the number of prior re-saves, different camera and editing software compression pipelines produce different baselines, and ELA cannot distinguish innocent re-encoding from deliberate manipulation. Reports must state these limitations explicitly.

How should an authenticity report handle a finding that is inconclusive?

Inconclusive findings must be reported as such. The report should state what was examined, what the result was, why it does not resolve the question (for example, insufficient original reference material, loss of metadata during social-media processing, or a method whose detection threshold does not cover the quality of the exhibit), and what additional examination or reference material would be needed to proceed. Omitting inconclusive results or forcing a qualified finding into a binary answer misrepresents the examination.

Do courts in different jurisdictions accept media authenticity reports differently?

Yes. In the United States, expert evidence is governed by Federal Rule of Evidence 702 and the Daubert standard, requiring the court to assess whether the method is scientifically valid and has known error rates. In England and Wales, the Criminal Practice Directions and the Law Commission's 2023 Expert Evidence report govern admissibility. In India, the Bharatiya Sakshya Adhiniyam 2023 (replacing the Indian Evidence Act 1872) addresses expert opinion evidence. EU member states apply national procedural rules but the European Digital Media Observatory has issued guidance on digital forensic standards for court. A well-structured authenticity report, transparently disclosing methods and limitations, is the strongest foundation for admissibility in all these systems.

Test yourself on Multimedia Authentication and Deepfake Forensics with free, timed mocks.

Practice Multimedia Authentication and Deepfake Forensics questions

Found this useful? Pass it along.

Spotted an error in this page? Report a correction or read our editorial standards.

Key Takeaways

Your journey to becoming a forensic professional starts here.