Reference Databases in Wildlife Forensics

A guide to the major reference databases used to identify species, assign geographic origins, and link wildlife trade seizures, covering DNA barcoding libraries, STR databases, trade intelligence systems, and the significant coverage gaps that still exist.

Last updated: 22 Jun 2026

Reference databases in wildlife forensics fall into three functional categories: genetic reference libraries (BOLD Systems, GenBank, ElePhant, RhODIS) that link DNA sequences or profiles to known species and populations; trade intelligence systems (the CITES trade database) that track legal specimen movement; and seizure and legal intelligence platforms (WildCAP, SHERLOC) that connect enforcement records and court outcomes across borders. A forensic identification is only as reliable as the reference it is compared against, and every analyst must be able to state not only the match result but the completeness of the database for the taxon in question. For major traded vertebrates such as African elephants and rhinoceroses, species-specific databases now support individual-level matching; for invertebrates, tropical timber, and many marine species, coverage remains too thin for reliable species-level conclusions.

A wildlife forensic identification is only as strong as the reference it is compared against. A mitochondrial sequence from a bone fragment, a STR profile from a rhino horn, a permit number on a live bird consignment, a seizure record from a port: each of these produces a result only when matched against a database of known values. The databases that exist, what they cover, and where they fail determine what can and cannot be proven in court.

These databases fall into three functional categories. Genetic reference libraries hold sequences or profiles from specimens of known species and, in some cases, known geographic populations. Trade intelligence databases track legal wildlife movement through the permit system and seizure records. Species-specific forensic databases such as RhODIS for African rhinoceroses and ElePhant for elephants combine both functions: they hold individual-level genetic profiles and link those profiles to seizure histories and origin data.

This topic covers the principal databases in each category, what an analyst can obtain from each, how they connect to casework, and the coverage gaps that represent real limits on what wildlife forensics can currently prove in court. A forensic expert who understands these limits can qualify conclusions honestly rather than overstate a match.

By the end of this topic you will be able to:

Explain the three functional categories of wildlife forensic databases and give a named example of each.
Describe the threshold logic applied when interpreting a BOLD Systems COI match, including what identity percentages support species-level versus genus-level conclusions.
Distinguish between what ElePhant and RhODIS respectively provide, and explain why population-level assignment and individual-level matching require different data infrastructures.
Identify the principal enforcement uses of the CITES trade database and state why its reporting gaps matter for forensic conclusions.
Assess the forensic consequences of coverage gaps for invertebrates, tropical timber, and marine species, and explain what a forensic report must include when database completeness is limited.

Key terms

BOLD Systems: Barcode of Life Data System. The primary global repository for DNA barcode sequences (chiefly mitochondrial COI). Used for first-pass species identification by sequence similarity comparison.
RhODIS: Rhino DNA Index System. A forensic-grade STR-profile database for African white and black rhinoceroses maintained in South Africa. Used to link horn seizures to individual animals and range-state populations.
ElePhant: A microsatellite (STR) reference database for African and Asian elephants, used to assign ivory seizures to continental and population-level origin.
CITES trade database: Maintained by UNEP-WCMC, this holds records of all reported legal CITES-listed species trade. Used to detect permit fraud, unusual trade volumes, and laundering of illegal specimens.
WildCAP: Wildlife Contraband Analysis and Profiling database. Logs and cross-references wildlife seizure records across enforcement agencies to identify trafficking patterns.
SHERLOC: UNODC Sharing Electronic Resources and Laws on Crime. The SHERLOC wildlife crime module indexes national laws, case law, and court outcomes on wildlife trafficking to support enforcement and prosecution.

BOLD Systems: barcode-based identification

DNA barcoding standardised around the mitochondrial cytochrome c oxidase I (COI) gene for animals, and around rbcL and matK for plants. The Barcode of Life Data System at the University of Guelph holds more than nine million sequences from more than 300,000 species, making it the single largest reference library for species identification by short-read sequencing. An unknown sequence is submitted to the BOLD Identification Engine, and the system returns the closest matches with percentage identity and a statistical confidence measure.

In casework, BOLD is used as a first-pass screen: a result with 99% or better COI identity to a single species typically supports a species-level identification. Identifications between 95% and 99% may only support genus-level assignment, and anything below that is at best a family match. The analyst's report must state which threshold applied and what the coverage of the database is for the taxon in question, since a high-confidence match to the closest available sequence is not the same as a high-confidence match to the actual source species if the actual species is absent from the database.

GenBank and the broader INSDC

GenBank, maintained by the US National Center for Biotechnology Information, is not a forensic database. It is a general-purpose repository for nucleotide sequences and holds far more sequences than BOLD, but with less curation. Any researcher can deposit any sequence with any species annotation, and errors in species identification or geographic attribution can propagate without systematic correction.

Wildlife forensic analysts use GenBank and its partner databases (EMBL at EBI and DDBJ in Japan, the three forming the International Nucleotide Sequence Database Collaboration) as a complement to BOLD when a taxon has poor barcoding coverage but has published sequences from other gene regions in the scientific literature. For multi-gene analyses or whole-mitogenome approaches, GenBank is often the only resource available. The quality caveat remains: any match from GenBank needs the original accession record checked for specimen voucher information and the depositor's identification credentials.

ElePhant and RhODIS: individual-level forensic databases

For most wildlife species, DNA identification reaches species level and stops. For elephants and African rhinoceroses, the forensic DNA infrastructure extends to the population and, in some cases, individual level, representing what species-specific forensic databases can achieve when there is sufficient institutional investment.

ElePhant holds microsatellite profiles from hundreds of elephants sampled across African and Asian range states, plus profiles from large ivory seizures analysed in research by Samuel Wasser's laboratory at the University of Washington. When a new ivory seizure is profiled, the STR data is compared against the geographic reference populations to assign the seized material to a continental region and, with sufficient data, to a specific range state. This is not a match to a single identified animal; it is a probabilistic assignment to a population based on allele frequency distributions. In the Wasser et al. seizure studies, this approach consistently mapped large-scale ivory flows to central and east African source populations despite ivory having been traded and laundered across multiple countries.

RhODIS was developed in South Africa to address a specific enforcement problem: South Africa has the world's largest population of white rhinos and faces the highest volume of horn poaching. A national database of individual STR profiles from live-sampled, registered animals means that a seized horn can be matched to a known individual, which means it can be linked to a specific reserve, a registered owner, and in cases where the animal was already dead or reported missing, an existing investigation. New profile acquisitions from living animals, from post-mortem sampling after poaching events, and from horn seizures are continuously added.

How RhODIS links a seized horn to an individual animal and its country of origin through STR profiling and population assignment.

CITES trade database and permit intelligence

The CITES trade database, managed by UNEP-WCMC on behalf of the CITES Secretariat, holds records of all legal trade in listed species as reported annually by state parties. By 2025 it contained over 25 million trade records going back to the 1970s. These records cover live animals, plants, and their derivatives, classified by taxon, quantity, trade purpose, and the importing and exporting countries.

For enforcement, the database serves several functions. Permit verification: a permit number accompanying a consignment can be cross-checked against known issued permits to detect forgeries or permits reused across multiple shipments. Trade pattern analysis: an unusually high reported export volume from a country with limited habitat for a species may indicate that wild-caught animals are being laundered through captive-breeding facilities. Anomaly detection: a species that shows near-zero reported trade but shows up repeatedly in seizure records is a signal of unlaundered illegal trade.

WildCAP and SHERLOC: seizure and legal intelligence

Genetic databases establish what a specimen is and where it came from. Seizure and legal databases establish whether a similar specimen was intercepted before, by whom, on what route, and what the judicial outcome was. WildCAP (Wildlife Contraband Analysis and Profiling) was developed as a shared logging tool for wildlife enforcement agencies. An agency logs details of a seized consignment, and investigators in other jurisdictions can query whether the same route, species, or quantity profile appears in other seizure records.

UNODC's SHERLOC platform indexes national legislation, case law, and court outcomes across multiple crime types including wildlife trafficking. The wildlife module gives prosecutors and investigators access to how courts in other jurisdictions have handled similar cases, what legal arguments have succeeded or failed, and what penalties have been applied. This kind of legal intelligence is particularly useful in countries where wildlife crime prosecution is new and courts have little precedent to draw on.

Database	Type	Primary user	Key output
BOLD Systems	Genetic (barcode)	Laboratory analyst	Species ID by COI sequence similarity
GenBank / INSDC	Genetic (multi-gene)	Laboratory analyst	Broader sequence comparison for non-barcoded taxa
ElePhant	Genetic (STR)	Wildlife forensic specialist	Ivory population assignment to range state
RhODIS	Genetic (STR, individual)	Wildlife forensic specialist	Horn-to-animal match and population assignment
CITES trade database	Trade intelligence	Enforcement officer / analyst	Permit verification and trade pattern anomalies
WildCAP	Seizure intelligence	Enforcement officer	Cross-border seizure pattern linking
SHERLOC wildlife module	Legal intelligence	Prosecutor / investigator	Case law, legislation, court outcomes

Coverage gaps and their forensic consequences

Wildlife forensics can currently provide species-level DNA identification for most commonly traded vertebrates. For less-studied taxa, coverage is substantially weaker. Large proportions of traded invertebrates, tropical timber species, and Neotropical fauna have few or no COI reference sequences in BOLD. If the actual source species is absent from the database, the closest match in the database will still return a high percentage identity to its nearest sequenced relative, and an analyst who does not know the database coverage for that taxon may report a false species identification.

Invertebrates and tropical insects: millions of described species with negligible barcode coverage. Some of the highest-value traded invertebrate groups (stag beetles, tarantulas) have patchy representation.
Tropical timber: species-level wood identification by anatomy requires reference atlases that exist for some families but not others. DNA from processed timber is degraded and matches against partial databases.
Marine species: deep-sea invertebrates and many commercially traded fish species from the southern hemisphere have incomplete barcode coverage, relevant for shark fin and sea cucumber trade.
Regional population data: even for species with good species-level coverage, population-level reference sets for geographic assignment are mostly missing outside Africa (for elephants and rhinos) and a few well-funded bird and reptile projects.

The practical consequence is that a forensic expert must always report not only the match result but the coverage completeness for the taxon queried. Saying a sample matches species X at 99% COI identity is a stronger statement if X is a well-sampled bird than if X is the only described species in a poorly sampled tropical genus. Courts and lawyers who understand this distinction will scrutinise it; those who do not may accept a weaker identification as more certain than it is.

Worked example

Ivory seizure population assignment using ElePhant

How a seized ivory tusk gets assigned to a range state without a direct individual match.

A consignment of 20 ivory tusks is seized at a container port. The tusks are unmarked and the shipping documents claim a legal origin. The investigation team sends samples from all 20 tusks to a laboratory with access to the ElePhant reference database. The goal is population assignment: not to find a direct individual match, but to determine which African or Asian range state the elephants came from.

DNA extraction: small drill cores are taken from inside each tusk where DNA preservation is better. Extraction protocols for degraded ivory use modified CTAB methods and additional inhibitor removal steps.
Species confirmation: a species-discriminating PCR distinguishes African savanna elephant (Loxodonta africana), African forest elephant (Loxodonta cyclotis), and Asian elephant (Elephas maximus). This step is required because each has different CITES trade implications.
STR profiling: samples are amplified at the ElePhant marker panel (typically 14-16 microsatellite loci) and fragment-analysed. Allele calls are entered into the database.
Population assignment: the allele frequencies are compared against geographic reference populations in ElePhant using a Bayesian assignment method. The output is a probability distribution across sampled populations. In this example, 17 of the 20 tusks assign with high probability to a central African forest elephant population; the remaining three assign to a west African population.
Reporting: the analyst reports the population assignments with confidence intervals, states the number of reference individuals per population in the database, and notes whether any direct individual matches were found (none here, since these animals were not in the database). The result contradicts the claimed legal origin and supports the prosecution case.

This approach has been validated across multiple large seizure studies. Its limitation is that it works best for African elephants where the reference database is well populated. For Asian elephants and for most other taxa, the reference coverage is insufficient for reliable sub-continental assignment, and the analyst must limit the conclusion to continental origin or species level.

Check your understanding

Question 1 of 4· 0 answered

At what COI sequence identity threshold does BOLD Systems typically support a species-level identification?

Key Takeaways

BOLD Systems is the primary first-pass species identification tool for DNA barcode sequences, but coverage varies widely by taxon and a high-similarity match is only reliable when the database is well populated for the group in question.
ElePhant and RhODIS are species-specific forensic databases that go beyond species ID to provide population-level and individual-level matching for ivory and rhino horn seizures.
The CITES trade database supports enforcement by enabling permit verification and detection of trade anomalies, though reporting gaps mean it is not a complete picture of legal flows.
WildCAP and SHERLOC provide seizure pattern intelligence and legal precedent intelligence, complementing the genetic databases in building a prosecution case.
Coverage gaps are most severe for invertebrates, tropical plants, marine species, and taxa from under-resourced biodiverse regions, and every forensic report must state the database completeness for the taxon being queried.

What is BOLD Systems and why is it used in wildlife forensics?

BOLD Systems (Barcode of Life Data System) is the primary repository for DNA barcoding sequences, mostly the mitochondrial COI gene, from species across the animal and plant kingdoms. In wildlife forensics it acts as the first-pass species identification tool: a barcode sequence from an unknown specimen is queried against BOLD and the closest match, with its percentage similarity, provides a species-level or genus-level identification. Coverage is strong for vertebrates but thin for many invertebrate and plant groups.

What is RhODIS and how is it used in rhino horn cases?

RhODIS is a forensic DNA database for African rhinoceros that holds STR profiles from individual animals sampled across range states. In a horn seizure case, DNA is extracted from the horn, profiled at the RhODIS marker set, and compared against the database to determine whether the individual is already known (linking the horn to a previously registered animal and its country of origin) or is new. Population assignment from SNP panels can narrow an unregistered horn to a range-state even without a direct match.

How does the CITES trade database support enforcement?

The CITES trade database, maintained by UNEP-WCMC, holds records of all legal trade in CITES-listed species as reported by member parties. Enforcement agencies use it to cross-check whether a permit accompanying a shipment matches a real permit number, to identify unusual trade flows that may indicate laundering of illegal specimens into the legal supply chain, and to build intelligence on high-volume trading routes and companies.

What are the main coverage gaps in wildlife forensic databases?

Coverage gaps are largest for invertebrates, tropical plants, marine invertebrates, and species in biodiverse but under-resourced regions. Many threatened species from tropical Africa, Southeast Asia, and Latin America have few or no reference sequences in BOLD or GenBank, and no dedicated STR databases like ElePhant or RhODIS exist for most taxa.

Test yourself on Wildlife Forensics with free, timed mocks.

Practice Wildlife Forensics questions

Found this useful? Pass it along.

Spotted an error in this page? Report a correction or read our editorial standards.

Key Takeaways

Your journey to becoming a forensic professional starts here.