Skip to content

DNA Phenotyping and RNA Profiling in Forensic Science

Forensic DNA phenotyping (HIrisPlex-S, Parabon Snapshot), RNA body-fluid panels, biogeographic ancestry and ethics.

Last updated:

Share

Forensic DNA phenotyping (FDP) predicts externally visible characteristics, biogeographic ancestry, and approximate age of an unknown DNA donor from SNP genotypes, functioning as an investigative lead rather than an identification tool. RNA profiling uses tissue-specific mRNA and miRNA transcripts to determine which body fluid a stain originated from, a question classical serology cannot always resolve. Both techniques run on the same next-generation sequencing platforms now deployed in major forensic laboratories, including CFSL DNA divisions. Neither substitutes for STR-based identification; they operate upstream of it, narrowing the investigative field before a reference comparison is possible.

Forensic DNA phenotyping (FDP) and RNA profiling are the two newest tools the syllabus packs into. STR analysis tells youwhoseDNA is on the swab, but only if you already have a reference or a database hit. FDP and RNA profiling address the questions that precede that comparison: what the unknown contributor probably looks like, and what body fluid the stain originated from. Both run on the same NGS hardware CFSL DNA divisions are now installing, so examiners has folded them in as an emerging-techniques bullet.

By the end of this topic you will be able to:

  • Distinguish the investigative role of forensic DNA phenotyping from STR-based identification, including what each technique requires and what it returns.
  • Identify the key SNP panels used in FDP (IrisPlex, HIrisPlex, HIrisPlex-S, AIM panels) and state which traits and marker counts each addresses.
  • Describe the mRNA and miRNA markers used to identify specific body fluids and explain why miRNA is preferred for degraded samples.
  • Explain the legal status of FDP outputs under the Bharatiya Sakshya Adhiniyam 2023 and the DNA Technology Bill framework in India.
  • Recognise the principal ethical concerns with FDP, including probabilistic uncertainty, biogeographic ancestry sensitivity, trait creep, and population-database gaps for South Asian samples.
Key terms
Forensic DNA phenotyping (FDP)
Prediction of externally visible characteristics, ancestry and age of an unknown donor from SNP genotypes. Investigative lead, not identification.
Externally visible characteristics (EVCs)
Traits a witness could see: eye, hair, skin colour, freckling, hair shape, height. The EU-funded VISAGE consortium targets this set.
SNP
Single Nucleotide Polymorphism. A one-base variant at a defined position. FDP uses SNP panels, not STRs.
HIrisPlex-S
Reference SNP panel for eye, hair and skin colour. 41 SNPs: 6 IrisPlex (eye), 18 added for HIrisPlex (hair), 17 for the S extension (skin).
Biogeographic ancestry inference (BGA)
Statistical assignment of a donor to continental ancestry groups using Ancestry-Informative SNPs (AIMs). Distinct from racial categorisation.
Parabon Snapshot
Commercial FDP service (Parabon NanoLabs, USA) that returns a predicted composite, ancestry, eye, hair and skin colour from crime-scene DNA.
mRNA / miRNA
Messenger and micro RNA. Forensic RNA profiling uses tissue-specific transcripts of both classes to identify body fluids.
Body-fluid-specific transcripts
RNA markers expressed only in one fluid: HBB for blood, STATH for saliva, PRM1 for semen, MUC4 for vaginal fluid, MMP7 for menstrual blood.
Deconvolution
Statistical separation of a mixed profile (DNA or RNA) into individual contributor profiles.

Why both: STR profiling needs a reference; FDP and RNA profiling do not

Classical STR profiling, covered in the DNA structure, extraction and profiling techniquesbullet, is a comparison method. You run the crime-scene profile against a suspect or a database (CODIS, or the Indian National DNA Database under the DNA Technology Bill framework). No match, no answer.

Forensic DNA phenotyping fills the gap on the front end. From the same swab, FDP returns a probabilistic description of the unknown donor: most likely eye colour, hair colour, skin colour, biogeographic ancestry, and, with methylation assays, approximate age. None of this identifies the person; it narrows the suspect pool the way an eyewitness description would, but from the biology itself.

RNA profiling fills a parallel gap on the sample side. STR profiling tells you whose cells are on the swab; it does not tell you whether those cells came from blood, saliva, semen, vaginal epithelium or menstrual blood. That distinction often decides the charge, and the classical presumptive tests have known specificity problems. mRNA and miRNA panels read the transcript signature of the fluid directly.

The exam framing is straightforward: FDP and RNA profiling areinvestigative aidsnot standalone identification evidence.

AxisForensic DNA phenotypingRNA profiling
Target moleculeGenomic DNA (SNPs, CpG sites)mRNA and miRNA transcripts
What it predictsDonor EVCs, ancestry, ageSource body fluid
Sample requirementSub-nanogram DNA, NGS-gradeCo-extracted RNA, RNase-free handling
Indian admissibilityInvestigative lead only; not enumerated in DNA Technology BillNot yet routine accredited; closer to existing serology
Main ethics concernTrait creep, ancestry misuse, group-based suspicionPrivacy of fluid origin in sexual-offence cases

DNA phenotyping basics: SNP panels, ancestry and age

FDP rests on the observation that a small number of well-chosen SNPs explain most of the heritable variation in a few visible traits. The panels and their targets:

TraitReference panelMarkersPerformance
Eye colourIrisPlex (2011)6 SNPs~94% AUC blue/brown
Hair colourHIrisPlex (2012)24 SNPsRed predicted best
Skin colourHIrisPlex-S (2018)41 SNPs totalStrong at extremes
Biogeographic ancestryAIM panels (Kidd, EUROFORGEN)55-170 SNPsContinental robust
AgeMethylation (ELOVL2, FHL2)5-7 CpG sites±3-5 years (blood)

Points the MCQs lean on: HIrisPlex-S is the integrated panel because it combines eye, hairandskin colour in one assay (41 SNPs); the "S" is the skin extension. Biogeographic ancestry uses Ancestry-Informative Markers (AIMs), SNPs whose allele frequencies differ between continental populations. Age estimation does not use SNPs at all; it uses DNA methylation at CpG sites whose level changes predictably with age, the so-called epigenetic clock.

Indian-population angle: most published FDP panels were trained on European, East Asian and African reference samples. The Centre for DNA Fingerprinting and Diagnostics (CDFD, Hyderabad) has begun population-frequency work on Indian samples, but the validation gap is real and underlies most admissibility objections.

Tools and pipelines (kit to prediction)

The end-to-end FDP workflow moves from DNA extraction through targeted SNP library preparation, NGS sequencing, and prediction software. Representative tools at each stage:

StageRepresentative toolFunction
Library preparationForenSeq DNA Signature Prep; IDseek HID; Precision ID AncestryMultiplex amplification of SNPs, STRs, Y-markers.
SequencingMiSeq FGx (Verogen); Ion S5 (Thermo)Forensic NGS run, 150-250 bp reads.
Variant callingForenSeq Universal Analysis Software; ConvergeGenotype calls plus QC.
Phenotype predictionHIrisPlex-S web tool (Erasmus MC); SnipperEye/hair/skin colour and ancestry probabilities.
Commercial serviceParabon Snapshot (Parabon NanoLabs)End-to-end: sample in, composite plus EVCs plus kinship out.

Two facts to lock in:MiSeq FGx is the dedicated forensic NGS sequencer(Illumina MiSeq chemistry, validated for ForenSeq), and Parabon Snapshot is a commercial FDP service not an open-source pipeline. The Snapshot composite is a probabilistic prediction, not a photograph.

For NGS context, the MiSeq / Ion S5 instruments belong to the same high-throughput, dedicated-software toolkit you meet in the hyphenated instrumental techniquesbullet for chemical evidence.

RNA profiling for body-fluid identification

The classical body-fluids workflowrelies on colour-change presumptive tests (Kastle-Meyer, acid phosphatase, amylase) followed by confirmatory tests (Takayama, p30/PSA, microscopy). The colour tests are sensitive but not specific. RNA profiling answers the body-fluid question by reading the transcripts that are highly expressed only in that fluid.

mRNA markers(tissue-specific, co-extracted with DNA):

  • Blood: HBA, HBB, ALAS2, CD93.
  • Saliva: STATH (statherin), HTN3 (histatin), MUC7.
  • Semen: PRM1, PRM2 (protamines), TGM4, KLK3.
  • Vaginal secretion: MUC4, HBD1, LCN1.
  • Menstrual blood: MMP7, MMP10, MMP11 (matrix metalloproteinases).

miRNA markers are much shorter (~22 nt) and considerably more stable in degraded samples than mRNA. Standard markers include miR-451 / miR-144 for blood, miR-205 for saliva, miR-891a for semen and miR-412 for menstrual blood. miRNA is the class that survives best in aged or degraded stains.

Advantages over classical serology: higher specificity, the ability to identify menstrual blood specifically (classical tests cannot separate menstrual from peripheral blood), and parallel processing on the same extract. Limitations: RNA lability, RT chemistry cost, and the absence of routine accredited deployment in Indian labs so far.

Body-fluid RNA marker panel: mRNA transcripts (stable in fresh stains) and miRNA markers (preferred in degraded samples) that
Body-fluid RNA marker panel: mRNA transcripts (stable in fresh stains) and miRNA markers (preferred in degraded samples) that confirm each fluid type; miR-451 and miR-205 survive RNase degradation whe
mRNAmiRNABloodSalivaSemenVaginalMenstrual bloodHBB, HBA, ALAS2,CD93miR-451, miR-144STATH, HTN3,MUC7miR-205PRM1, PRM2,TGM4, KLK3miR-891aMUC4, HBD1, LCN1miR-124aMMP7, MMP10,MMP11miR-412mRNA (fresh stains)miRNA (degraded stains)
Body-fluid RNA marker panel: mRNA transcripts (stable in fresh stains) and miRNA markers (preferred in degraded samples) that confirm each fluid type; miR-451 and miR-205 survive RNase degradation where full-length mRNA does not

Ethics and the Indian context

FDP predicts traits the analyst cannot observe directly and generates group-level ancestry inferences that can map onto categories with a history of misuse. The principal concerns are:

  1. Probabilistic, not deterministic. FDP returns a probability ("brown eyes, 87%"). Presenting a single colour as fact overstates the science.
  2. Biogeographic ancestry sensitivity. Continental ancestry can be confused for racial or caste categorisation; group-based suspect-pool framing is a real risk in a multi-ethnic country.
  3. Trait creep. The same SNP architecture could extend to behavioural or medical predictions. German and Dutch FDP laws explicitly cap the permitted traits.
  4. Population-database adequacy. Most FDP algorithms are under-trained on South Asian samples, weakening accuracy for Indian donors.

For Indian admissibility, the operative statute is the Bharatiya Sakshya Adhiniyam 2023Section 39 of which governs expert opinion. The BSA does not specifically address FDP, and Indian courts have so far treated FDP outputs as investigative leads, not substantive identification evidence. The DNA Technology (Use and Application) Regulation Bill restricts permitted uses of DNA data to identification, missing persons, parentage and disaster-victim ID; phenotype prediction is not an enumerated purpose. CDFD and DFSS have flagged the need for an explicit regulatory frame before FDP enters routine casework.

What is the difference between forensic DNA phenotyping and STR profiling?
STR profiling compares a crime-scene profile to a reference (suspect or database) at 13 to 24 STR loci and produces an identification on match. FDP uses SNP panels (HIrisPlex-S, AIMs) to predict EVCs, ancestry and approximate age of an unknown donor with no reference required. FDP is an investigative lead; STR is identification evidence.
What does HIrisPlex-S predict, and how many SNPs does it use?
HIrisPlex-S is the integrated SNP panel for eye, hair and skin colour. It uses 41 SNPs in total: the original 6 IrisPlex SNPs (eye), 18 more for hair (HIrisPlex, 24 total) and 17 for skin (the S extension). Prediction is probabilistic, run via the Erasmus MC web tool.
Which RNA class is preferred for body-fluid identification in degraded samples, mRNA or miRNA?
miRNA. MicroRNAs are around 22 nt long and markedly more resistant to RNase degradation than full-length mRNA. mRNA panels (HBB, STATH, PRM1/2, MUC4, MMP7) give stronger specificity in fresh stains, but miRNA panels are favoured for aged or degraded samples.
Is FDP admissible as identification evidence in Indian courts?
No. Indian courts treat FDP outputs as investigative leads, not standalone identification. The BSA 2023 (Section 39) does not specifically address FDP, and the DNA Technology Bill restricts permitted uses of DNA data to identification, missing persons, parentage and disaster-victim ID. Phenotype prediction is not an enumerated purpose.
What is the role of DNA methylation in forensic DNA phenotyping?
DNA methylation at specific CpG sites (ELOVL2, FHL2, KLF14, TRIM59 etc.) changes predictably with age. Forensic age estimation, the epigenetic clock, uses methylation at 5 to 7 such CpGs to predict donor age within ±3 to ±5 years on blood. Part of FDP, but methylation-based rather than SNP-based.

Test yourself on UGC-NET Forensic Science with free, timed mocks.

Practice UGC-NET Forensic Science questions

Found this useful? Pass it along.

Share

Spotted an error in this page? Report a correction or read our editorial standards.

Your journey to becoming a forensic professional starts here.

Practice with mock tests, learn from structured notes, and get your questions answered by a global forensic community, all in one place.