Facial Image Comparison: Morphological and Holistic Methods

Facial image comparison is the forensic discipline of examining still or video images to determine whether two faces belong to the same person, using anatomical landmark analysis and holistic assessment under rigorous methodological standards.

Last updated: 19 Jun 2026

Forensic facial image comparison is the systematic examination of still or video images to assess whether two facial depictions originate from the same individual. Examiners combine morphological analysis, which catalogues anatomical structures region by region, with holistic assessment, which evaluates the face as an integrated whole. The Facial Identification Scientific Working Group (FISWG), established in 2008, provides the internationally adopted methodology and conclusion vocabulary, including a structured scale running from Exclusion through Inconclusive to Identification. Supplementary techniques such as ear comparison and photo-anthropometric overlay extend the analysis when direct facial comparison is constrained by image quality, pose, or occlusion.

A grainy CCTV still, a passport photograph, and a person sitting in the dock: the question is whether these faces belong to the same individual. No fingerprint is available. No DNA was left behind. The investigating officer has two images and a suspect, and the case may turn on what a facial image examiner can and cannot say about them. This is the territory of forensic facial image comparison: the systematic examination of images to assess whether two facial depictions share a common origin.

The discipline draws on two complementary approaches. Morphological analysis works feature by feature, cataloguing the shape of the orbital region, the nasal profile, the ear, the chin line, and dozens of other anatomical structures, then asking whether the pattern of similarities and differences between two images is consistent with one identity or two. Holistic assessment asks the examiner to take the face as a whole, drawing on the same perceptual processes humans use to recognise one another, but guided by training and anchored to a structured conclusion scale. Neither approach alone is sufficient; skilled examiners combine them.

The field has been shaped by two main institutional voices. In the United Kingdom the technique has been called Facial Mapping, and it reached the courts through cases in the 1990s and 2000s. Internationally, the Facial Identification Scientific Working Group (FISWG), established in the United States in 2008 with broader representation, developed the terminology and guidelines that are now the reference standard. This topic walks through how both morphological and holistic methods work, how the ear and photo-anthropometric overlay fit in, and what the FISWG conclusion scale actually asks an examiner to say.

By the end of this topic you will be able to:

Describe the five anatomical regions examined in morphological facial comparison and the rationale for treating each independently.
Explain why holistic assessment supplements rather than replaces feature-level analysis, and identify where examiner bias enters that process.
Interpret a conclusion on the FISWG scale and state what documentary requirements accompany each level.
Assess when ear comparison and photo-anthropometric overlay add evidential value and what conditions limit their reliability.
Distinguish Facial Mapping (UK/Australia) from FISWG Facial Image Comparison in terms of documentation requirements and conclusion vocabulary.

Key terms

Facial Mapping: The term used primarily in the UK and Australia for the forensic comparison of facial images. Now largely superseded in international literature by the FISWG term Facial Image Comparison, though the underlying methods are equivalent.
Morphological analysis: A feature-by-feature examination of anatomical structures, comparing the shapes of orbital, nasal, auricular, mandibular, and soft-tissue regions between a reference image and a questioned image.
Holistic assessment: Evaluation of the face as an integrated whole rather than as a sum of parts, drawing on the examiner's trained perceptual processing to form an overall impression that supplements feature-level analysis.
FISWG: The Facial Identification Scientific Working Group, a US-originated but internationally engaged body that produced guidelines for forensic facial comparison, including a standardised methodology and conclusion vocabulary.
Photo-anthropometric overlay: A technique that aligns and scales two facial images and computes proportional distances between anatomical landmarks, used when direct feature comparison is constrained by pose or resolution.
Conclusion scale: The structured verbal scale recommended by FISWG, ranging from Exclusion through Inconclusive to Identification, used to communicate the weight of facial comparison evidence to courts and investigators.

A brief history: from Facial Mapping to FISWG

Forensic facial comparison reached UK courts in the 1980s and 1990s, largely through the work of practitioners who had backgrounds in medical illustration and physical anthropology. The term Facial Mapping entered legal vocabulary to describe the process, and early cases relied on a relatively informal methodology that varied between practitioners. The reliability of the field was contested, and courts struggled with how to weigh evidence that lacked the population-frequency databases underpinning fingerprint and DNA work.

In Australia, similar debates played out, with the Bluey Clark case in the 1990s among the early examples of facial image evidence being tested in court. In the United States, law-enforcement interest was driven partly by the expansion of photographic surveillance and partly by post-9/11 investment in biometric identification. The FISWG was established in 2009, bringing together facial comparison practitioners, cognitive psychologists, and computer scientists. Its published guidelines in 2012 and subsequent updates established a methodology built around documentation, sequential unmasking, and a structured conclusion vocabulary.

Morphological analysis: working through the anatomy

Morphological facial comparison is systematic: the examiner records the characteristics of a defined set of anatomical regions and assesses whether the pattern of correspondences and differences between two images is consistent with a single identity or two distinct individuals. The regions examined follow anatomical logic.

Orbital region: eye shape, palpebral fissure inclination, medial and lateral canthal angles, epicanthal fold presence, lid crease morphology, and eyebrow form and position.
Nasal region: dorsal profile (straight, convex, concave), nasal tip shape, tip projection, nostril shape and orientation, columella visibility, and alar base width relative to facial width.
Auricular region: helix shape and attachment, antihelix bifurcation, lobule form (attached, free, intermediate), tragus size and shape, concha depth, and overall ear size and projection.
Mandibular and chin region: chin form (pointed, rounded, squared), chin projection, jaw angle definition, gonial angle, and the overall lower-face shape in frontal view.
Soft-tissue mouth region: lip vermilion border shape, philtrum column definition, philtrum width, commissure position, and the relative proportions of upper and lower lips.

The examiner records whether features show correspondence, difference, or are indeterminate (due to pose, resolution, lighting, or age difference between images). A difference in a single feature does not automatically mean exclusion, because the same person photographed years apart, at different angles, or under different lighting conditions will show apparent variation. The comparison matrix as a whole drives the conclusion, not any single outlier.

Facial landmark zones used in morphological comparison.

Holistic assessment and its cognitive basis

Face recognition is one of the most heavily studied areas of human perception. A substantial body of research from cognitive psychology shows that people, including trained examiners, process faces holistically: the brain encodes a face as a unified pattern, not as a list of features assembled at runtime. This is why inverting a face dramatically impairs recognition, and why a feature that looks right in isolation can look wrong once the rest of the face changes around it.

In forensic facial comparison, holistic assessment serves as a check on the morphological feature matrix. After working through the anatomy, an examiner also forms an overall impression: does the general facial gestalt fit with the two images being the same person? This step can catch things the feature-by-feature pass misses, particularly when subtle proportional relationships across the face carry identity signal that no single feature captures alone. But it is also where examiner subjectivity is highest, which is why FISWG guidelines require that the holistic assessment supplements rather than replaces the documented feature analysis.

Approach	Strength	Weakness
Morphological (feature-by-feature)	Explicit, documentable, reviewable step by step	May miss identity signal carried by proportional relationships across features
Holistic assessment	Captures gestalt identity signal, mirrors natural face processing	Susceptible to examiner bias; hard to audit or reproduce
Combined (FISWG best practice)	Systematic documentation plus perceptual synthesis	Requires more training and takes longer per case

Research by David White, Richard Kemp, and colleagues at UNSW and other institutions has repeatedly shown that even trained facial comparison examiners make errors when working with poor-quality images. Accuracy drops significantly when the two images to be compared differ in pose, lighting, or age. This research has been influential in pushing the field toward formalised methodology and quality controls, and in reinforcing that facial comparison evidence should be presented with explicit statements of its limitations.

Ear comparison as a supplementary method

The external ear, or pinna, has a complex three-dimensional structure made up of cartilaginous ridges and hollows that are largely set by genetics and remain relatively stable through adult life. The helix, antihelix, tragus, antitragus, concha, and lobule all show variation across individuals, and the overall shape and size of the ear can be assessed from a two-dimensional image if the angle of view is favourable.

Ear comparison entered forensic literature in the late 1990s, and Dutch police officer and forensic instructor Cor van der Lugt was among the early systematic researchers. In the UK, earprint evidence was central to the Dallagher case: Mark Dallagher was convicted of murder in 1998 partly on earprint evidence, but his conviction was quashed by the Court of Appeal in 2002 and he was acquitted at retrial in 2004 after DNA analysis failed to link the earprint to him. The evidentiary value of ear comparison on its own remains debated, and the field acknowledges that the population-frequency data for ear morphology are thin compared with fingerprint or DNA databases. In practice, ear comparison is most useful when the face is fully obscured or when the ear happens to be captured clearly on surveillance footage that shows the face at an unhelpful angle.

Anatomical regions of the external ear used in forensic comparison.

Photo-anthropometric overlay and superimposition

Photo-anthropometric overlay is a technique that places two images in alignment and measures the proportional relationships between anatomical landmarks: the intercanthal distance, the alar width, the mouth width, the distance between pupil centres, and several other fixed-point pairs. The examiner converts these measurements to ratios that are independent of absolute image scale, then compares the ratio profiles between the reference and questioned images.

The approach derives from craniofacial anthropometry. In forensic contexts the numerical output carries apparent objectivity, but that appearance requires careful qualification. But the technique carries important caveats. Accurate landmark localisation requires adequate image resolution, and even small errors in placing a landmark can shift ratio values substantially. More critically, all metric comparisons are sensitive to camera geometry: a face photographed with a telephoto lens at a distance will have systematically different apparent proportions than the same face photographed with a wide-angle lens at close range. Correcting for this requires knowing or estimating the camera and its focal length, which is rarely possible from crime-scene footage.

When the geometry can be controlled, for instance when measurements are taken from calibrated photography of a suspect alongside the crime-scene image, anthropometric overlay adds a useful quantitative dimension to the comparison. When the geometry is uncertain, the numerical output can create a false impression of precision, and examiners are expected to state clearly which conditions applied.

The FISWG conclusion scale

One of FISWG's most practically important contributions was a standardised conclusion vocabulary. Before this standardisation, different examiners used phrases such as consistent with, cannot be excluded, and highly probable in ways that courts interpreted inconsistently. FISWG proposed a scale anchored at both ends by strong conclusions and with graduated intermediate levels, structured to communicate the weight of the evidence in a way that does not overstate certainty.

Exclusion
The examiner concludes the two images depict different individuals. Reserved for cases where there are clear, unambiguous morphological differences that cannot be explained by image variation, pose, or age change.
Supports exclusion
The evidence is more consistent with the images depicting different people than the same person, but the examiner cannot exclude the possibility of one identity.
Inconclusive
The evidence is insufficient to support either conclusion. This can reflect poor image quality, unfavourable pose, extensive occlusion, or a genuine balance of corresponding and differing features.
Supports identification
The evidence is more consistent with the images depicting the same person than different people, but the examiner stops short of positive identification.
Identification
The examiner concludes the two images depict the same individual. This is the strongest positive conclusion and requires a high level of correspondence across multiple independent features with no unaccounted differences.

FISWG conclusion scale from Exclusion to Identification.

The scale is not a probability scale in the statistical sense: there are no likelihood ratios attached to each level, and the verbal labels are not mapped to fixed numerical ranges. What the scale does do is force an examiner to commit to a position rather than hiding behind deliberately vague language, and it gives courts a framework to understand the relative weight of positive and negative conclusions. FISWG guidelines also require that any conclusion be accompanied by an explanation of what features drove it and what limitations affected the analysis.

Worked example

Bank robbery comparison: two low-resolution frames, one conclusion

Working through a morphological comparison when image quality is the limiting factor.

A bank in central London is robbed by a person wearing a cap and a loose jacket. The CCTV records two brief intervals of useful footage: a frontal frame as the person enters, in which the cap is pushed back enough to reveal the upper face, and a three-quarter view as the person leaves, where the cap is lower but the ear and jaw are visible. A suspect is arrested two days later and photographed. An examiner is asked to compare the CCTV frames with the arrest photograph.

Image assessment. The examiner first characterises each image independently: resolution (approximately 40 pixels across the face width in both CCTV frames), lighting (overhead fluorescent, moderate shadow under the cap brim), and pose (frontal, approximately 10-degree chin tilt in frame 1; 30-degree yaw in frame 2). The arrest photograph is high resolution, full frontal, well lit. The quality mismatch is documented before any comparison begins.
Morphological pass, frontal frame. The examiner can assess the nasal bridge and dorsal profile (slightly convex, consistent between CCTV and arrest photo), the intercanthal width relative to nasal base width (ratio consistent), and the upper orbital profile. The cap obscures the forehead. The mouth is partially obscured by a scarf in CCTV. Features assessed: nasal and orbital regions. Ear and jaw not visible in this frame.
Morphological pass, departure frame. The three-quarter view reveals the left ear clearly. The examiner records helix form (distinctive partial helix notch at the superior margin), lobule shape (free, moderately large), and antihelix bifurcation. The jaw angle is also visible: the gonial angle appears obtuse and the jaw line relatively smooth. These features correspond with the arrest photograph.
Holistic assessment. After the feature-level pass, the examiner considers the overall proportional impression across both frames together and compares it with the arrest photograph. The overall facial width-to-height ratio, the position of the ears relative to the eye line, and the mid-facial proportions are consistent. No single feature shows a clear discordance.
Conclusion. The examiner reports Supports identification: the combined evidence from nasal morphology, orbital proportions, and auricular morphology is consistent with the images depicting the same person, and no feature provides unambiguous grounds for exclusion. The report explicitly notes the resolution limitation and states that a stronger conclusion is not warranted by the available image quality.

The example demonstrates several principles in practice: the conclusion was not inflated beyond what image quality supported; ear morphology contributed a distinctive feature from a single partial view; and the FISWG scale allowed a proportionate statement the court could weigh alongside other evidence.

Check your understanding

Question 1 of 4· 0 answered

Which of the following best describes the difference between Facial Mapping and Facial Image Comparison?

Key Takeaways

Morphological facial comparison works feature by feature across orbital, nasal, auricular, mandibular, and soft-tissue regions, building a correspondence matrix that the examiner synthesises into an overall conclusion.
Holistic assessment, grounded in how the brain naturally processes faces, supplements the feature-level analysis but cannot replace the explicit documentation that makes a report reviewable.
Facial Mapping (UK/Australia) and Facial Image Comparison (FISWG) describe the same discipline; FISWG guidelines impose stricter documentation requirements and provide the internationally adopted conclusion vocabulary.
Ear comparison is a valid supplementary method, particularly when the face is occluded, but the evidential weight from auricular morphology alone is limited by thin population-frequency data.
Photo-anthropometric overlay adds metric support when camera geometry is known and controlled; without geometric correction, proportional measurements between images of different origins can be systematically misleading.
The FISWG conclusion scale, from Exclusion to Identification with intermediate levels, requires examiners to commit to a proportionate statement and explain what drove it, preventing the deliberate vagueness that made earlier facial evidence hard for courts to weigh.

What is the difference between Facial Mapping and Facial Image Comparison?

Facial Mapping is a term historically used in the United Kingdom and Australia for the forensic comparison of facial images. Facial Image Comparison is the terminology standardised by FISWG and increasingly adopted internationally. Both refer to the same core discipline, but FISWG's framing emphasises a structured methodology and a defined conclusion scale.

What anatomical regions does a morphological facial comparison examine?

A morphological comparison typically covers orbital features (eye shape, lid creases, canthal angles), nasal features (dorsal profile, tip shape, nostril form), auricular features (ear helix, antihelix, lobule, tragus), mandibular features (chin form, jaw angle), and soft tissue features such as lip morphology and philtrum shape. The examiner works feature by feature, then synthesises the findings into an overall assessment.

What is the FISWG conclusion scale?

FISWG recommends a seven-point verbal conclusion scale running from Exclusion through Inconclusive to Identification, with intermediate levels such as Supports exclusion, Indeterminate, Supports identification, and others. The scale mirrors probability-ratio reasoning without requiring numerical likelihood ratios, giving courts a structured verbal statement of the weight of the evidence.

Can ear comparison alone identify a person?

Ear morphology is considered a useful supplementary method rather than a primary identification tool. The ear has relatively stable soft-tissue structures, but the evidential weight from ear comparison alone is limited without corroborating facial features. It is most useful when the face is obscured or when CCTV captures the ear at a favourable angle.

What is photo-anthropometric overlay and when is it used?

Photo-anthropometric overlay places a reference image over a crime-scene image and measures the proportional relationships between anatomical landmarks. It is used when direct feature comparison is hindered by pose, resolution, or partial occlusion, and when metric data can be extracted reliably. The technique requires careful camera-geometry correction to avoid distortion errors.

Test yourself on Forensic Audio, Video and Image Analysis with free, timed mocks.

Practice Forensic Audio, Video and Image Analysis questions

Found this useful? Pass it along.

Spotted an error in this page? Report a correction or read our editorial standards.

Key Takeaways

Your journey to becoming a forensic professional starts here.