Blockchain Fundamentals for Investigators

A practical grounding in how blockchain networks record and confirm transactions, covering distributed ledgers, UTXO and account models, and the transparency properties investigators rely on when tracing cryptocurrency.

Last updated: 19 Jun 2026

A blockchain is a distributed ledger in which every transaction is recorded in a cryptographically linked chain of blocks, replicated across thousands of independent nodes. No single party can alter a confirmed record without redoing the computational work for every subsequent block and outpacing the entire honest network simultaneously. Bitcoin uses an Unspent Transaction Output (UTXO) model that preserves the full provenance graph of every coin, while Ethereum uses an account-balance model that resembles a traditional ledger but routes value through programmable smart contracts. These structural differences directly shape which tracing methods work, which heuristics apply, and what legal process is needed to obtain records.

When investigators first encountered Bitcoin transactions as evidence, the instinct was to treat them like bank records with the bank removed. A blockchain does record every transfer in a permanent ledger that no single party can quietly edit, but the mechanics underneath differ enough from traditional finance that misreading them can send an investigation in the wrong direction. How outputs are spent, how nodes reach agreement on validity, and how pseudonymity is constructed and broken all require precise understanding before any tracing tool is useful.

This topic builds the structural knowledge an investigator needs before touching any tracing tool. It covers how a distributed ledger achieves tamper-resistance without a central authority, the two main accounting models (UTXO and account-based) and what each reveals in a money trail, the meaningful distinctions between public and private blockchains, and the confirmation mechanism that makes a recorded transaction reliable evidence. Along the way it flags where the transparency properties that help investigators also create the privacy pressures that bad actors try to exploit.

The clustering heuristics and graph analysis covered in subsequent topics rest on these structural properties.

By the end of this topic you will be able to:

Explain how cryptographic hashing and node replication combine to make a confirmed blockchain transaction tamper-resistant in practice.
Distinguish the UTXO and account-based accounting models and describe what each model reveals or conceals in a money-trail investigation.
Apply the co-spend clustering heuristic to identify likely shared wallet control from Bitcoin transaction inputs.
Identify the confirmation thresholds at which a Bitcoin or Ethereum transaction can be cited as reliable evidence, and explain why.
Describe the investigative access differences between public and private blockchains and the legal process required for each.

Key terms

Distributed ledger: A transaction record replicated across many independent nodes. Each node holds a full or partial copy and validates new entries against the shared rules. No single node is authoritative, so tampering requires controlling a majority of the network.
UTXO (Unspent Transaction Output): Bitcoin's accounting unit. Each UTXO is a discrete chunk of value from a prior transaction's output. Spending it consumes it entirely; change comes back as a new UTXO to an address controlled by the sender. A wallet's balance is the sum of all UTXOs it controls.
Account model: Ethereum's approach, where each address holds a running balance incremented or decremented by transactions. Simpler to reason about but less transparent about the exact provenance of funds than the UTXO graph.
Public blockchain: A permissionless network where anyone can read the ledger, submit transactions, or run a node. Bitcoin and Ethereum are public. The ledger is openly auditable, which is the property that makes blockchain tracing possible.
Private (permissioned) blockchain: A network where participation is restricted to known, approved nodes. Used by banks and supply chains for transaction settlement. Records are not public but are obtainable through legal process directed at the operator.
Block confirmation: The count of blocks added to the chain after the block containing a specific transaction. Higher confirmation counts mean greater computational cost to reverse the transaction, which translates to greater reliability as evidence.

How a distributed ledger achieves tamper-resistance

A traditional bank ledger is authoritative because the bank controls it. Change the bank's database and the record changes. A blockchain achieves something different: the record is authoritative because thousands of independent nodes each hold a copy, continuously cross-check new additions, and reject anything that violates the shared rules. To alter a transaction already on the chain, an attacker would have to rewrite it on every node simultaneously and outrun the honest majority that is still adding new blocks. In practice, for a large public chain, this is computationally and economically prohibitive.

The mechanism tying this together is cryptographic hashing. Each block contains the hash of the previous block. Changing one transaction inside block 500 changes that block's hash, which breaks the link to block 501, which invalidates every subsequent block. To make the altered chain the dominant one, an attacker would have to redo the proof-of-work (or produce valid signatures, in proof-of-stake systems) for every block from 500 onward, faster than the honest network adds new ones. The chain structure makes the past progressively harder to revise, not just harder to access.

Chain of blocks: altering any block breaks every subsequent link.

UTXO model: following discrete outputs

In Bitcoin's UTXO model, a transaction is a declaration: I am spending these specific prior outputs and creating these new outputs. Each output is locked to an address. To spend it, the holder must provide a valid signature unlocking that lock. Once spent, the output is gone permanently; it cannot be spent again (the double-spend problem Bitcoin solved). A new output is created for the recipient, and if the sender is sending less than the full UTXO, a change output goes back to an address the sender controls.

A transaction on a UTXO chain therefore carries explicit provenance: each output can be traced back to the specific prior transaction that created it, and to the addresses that funded that transaction. The trail is a directed acyclic graph running backward through time. Tracing tools walk this graph. When multiple inputs are combined in one transaction, the tool notes that the addresses controlling those inputs were probably controlled by the same entity, because only someone with keys to all of them could sign the transaction. That heuristic is called co-spend clustering.

Feature	UTXO (Bitcoin)	Account model (Ethereum)
Balance representation	Sum of unspent outputs	Single running balance per address
Transaction structure	Inputs consumed, outputs created	From-address decremented, to-address incremented
Change handling	Explicit change output to sender	Remainder stays in sender balance
Tracing provenance	Full input graph visible	Debit-credit history visible
Clustering heuristic	Co-spend of inputs implies shared control	Less direct; needs token-flow analysis

Account model: Ethereum and smart-contract chains

Ethereum and most EVM-compatible chains use an account model. An address has a balance. A transaction subtracts from the sender and adds to the recipient. There is no consumption of specific prior outputs; the chain just maintains a state that records every address's current balance. This is conceptually closer to a traditional bank ledger, which makes it easier for financial investigators to read but strips some of the provenance detail the UTXO graph provides.

The added complexity on Ethereum comes from smart contracts, code deployed at an address that holds and transfers value according to programmable rules. Decentralised exchanges, lending protocols, and token bridges are all implemented as contracts. Tracing funds through a DeFi protocol requires reading contract logic to determine how value entered, transformed, and exited, not simply following peer-to-peer transfers. Tools like Etherscan let you inspect individual contract calls, but understanding what those calls mean requires reading the contract's source code or ABI.

Public versus private blockchains

A public blockchain is permissionless: anyone can download the full node software, sync the ledger, and read every transaction ever recorded. No API key, no account, no legal process required. Bitcoin's ledger from the genesis block in January 2009 to the present is publicly accessible in full. This is why tracing firms can build their products: the raw data is free and complete.

A private blockchain is operated by a known consortium or company. Hyperledger Fabric deployments used by banks for interbank settlement, or supply-chain blockchains used by logistics companies, fall here. The ledger is not public. Getting those records requires a subpoena or court order directed at the known operator, followed by authentication of the exported records. The challenge is not finding the data but establishing its integrity, since the operator could theoretically manipulate their own private chain. Hash-chain integrity checks and independent node logs from consortium partners provide corroboration.

Public vs private blockchain: investigative access paths differ fundamentally.

Transaction confirmation and evidence reliability

When a transaction is broadcast to the Bitcoin network, miners compete to include it in the next block. Once included, the transaction has one confirmation. Each subsequent block adds another confirmation. Because rewriting history would require outpacing the entire honest mining network from that point, the cost of reversal grows roughly exponentially with the confirmation count.

For investigative purposes, a transaction with fifty or a hundred confirmations sitting on a major public chain is, for practical purposes, immutable. Presenting it as evidence does not require the same authentication chain a bank statement would. Block explorer output with timestamp, transaction ID (TXID), input and output addresses, and value, supplemented by a hash-chain verification showing the block is on the main chain, is sufficient for most courts that have addressed the question. The TXID is itself a cryptographic hash of the transaction content, so any alteration of the transaction data would change the TXID and invalidate the record.

0 confirmations (mempool): transaction broadcast but not yet included in a block. Can still be dropped or replaced. Not reliable as final evidence.
1-5 confirmations: included in a block but theoretically reversible if a competing chain branch wins. Sufficient for small-value evidence; treat with caution for high-stakes matters.
6+ confirmations: the conventional Bitcoin finality threshold. Reversal would require a sustained 51% attack, which is economically implausible on Bitcoin. Treat as reliable evidence.
Ethereum finality: post-merge Ethereum uses checkpoint-based finality (Casper FFG). A transaction included in a finalised epoch is cryptographically sealed and cannot be reversed without at least one-third of staked ETH being slashed.

Transparency, pseudonymity, and the investigative balance

Public blockchains present a tension that shapes both investigative opportunity and the countermeasures market. Every transaction is permanently visible to everyone: amounts, addresses, timestamps, and the full graph of prior transactions. This is more transparency than any traditional bank provides. And yet the addresses are pseudonyms. The ledger says that address 1A1zP1... sent 50 BTC to 1BpEi6... at block 170. It does not say those addresses belong to anyone in particular.

The link between address and identity comes from off-chain anchors: exchange KYC records that associate a customer with withdrawal addresses, IP logs captured at broadcast time that tie a transaction to a device, or blockchain explorer accounts linked to an email address. Investigators who obtain these off-chain anchors, through subpoenas, court orders, or undercover operations, can collapse the pseudonymity of a large cluster of addresses at once. Once one address in a cluster is identified, the co-spend and change-address heuristics propagate that identity to every address in the same cluster.

Worked example

Reading a Bitcoin transaction from a block explorer

A single TXID decoded step by step into investigative intelligence.

Suppose a fraud investigation surfaces a Bitcoin address used to receive proceeds. The investigator pulls the address history on a block explorer and finds one incoming transaction and two outgoing transactions. Here is how to read the first outgoing transaction.

Identify the inputs
The transaction has one input: the UTXO received from the incoming transaction. This confirms the entire received amount is being moved. The input section shows the prior TXID and output index that is being consumed.
Read the outputs
Two outputs appear. Output 0 sends a specific amount to an external address. Output 1 (smaller) returns to a new address. The second output is almost certainly change. The wallet generated a fresh address for change to improve privacy, but the co-spend heuristic ties both back to the same controlling entity.
Check confirmation count
The block height field shows which block contains the transaction. Subtracting from the current block height gives the confirmation count. At 200+ confirmations, this record is reliable. Note the block timestamp for your timeline.
Identify the recipient address
The external output address becomes the next hop to trace. Feed it back into the explorer. If this address has activity on a known exchange cluster (identified by the tracing tool's attribution database), submit a preservation order or subpoena to that exchange for the account holder's KYC records.

This is the core investigative loop on a public blockchain. The ledger gives you the graph for free. The work is in mapping nodes in that graph to real-world identities using off-chain records. Blockchain fundamentals tell you where to look; the KYC subpoena tells you who is there.

Check your understanding

Question 1 of 4· 0 answered

In Bitcoin's UTXO model, what happens to a UTXO when it is spent?

Key Takeaways

A distributed ledger's tamper-resistance comes from its replication across independent nodes and the hash-chain structure that makes retroactive alteration computationally prohibitive.
Bitcoin's UTXO model preserves full provenance as a spendable-output graph, enabling co-spend clustering; Ethereum's account model is simpler to read but requires separate attention to token transfers and smart-contract calls.
Public blockchains are directly readable by anyone without legal process; private blockchains require a subpoena directed at the known operator.
Confirmation count determines how reliably a transaction can be cited as evidence; six confirmations on Bitcoin and finalised checkpoints on Ethereum represent practical immutability.
Public chains are transparent about amounts and addresses but pseudonymous about identity; the link to a real person comes from off-chain anchors such as exchange KYC records and IP logs, not from the ledger itself.

What is a distributed ledger and why does it matter for investigations?

A distributed ledger is a copy of the transaction record held simultaneously by every participating node. No single party can quietly alter history without the rest of the network rejecting the change. For investigators, this means the ledger is an unusually durable evidence source that cannot be unilaterally edited by a suspect or an exchange.

What is the difference between a UTXO and an account-based blockchain?

In a UTXO model (Bitcoin), a wallet's balance is the sum of unspent outputs from prior transactions; spending means consuming specific outputs and creating new ones. In an account model (Ethereum), addresses hold a running balance updated with each transaction. UTXO chains preserve a full graph of prior inputs, which aids tracing; account chains show simpler debit-credit flows but can complicate reconstructing a money trail.

Can a private blockchain transaction be subpoenaed like a bank record?

A private blockchain is controlled by a known operator, so its records can be obtained through the same legal process as any corporate database: subpoena, court order, or mutual legal assistance request. The evidentiary challenge shifts from locating the record (it is easy to find) to authenticating it and demonstrating the operator has not altered it, which is where hash-chain integrity proofs become useful.

Does a public blockchain address identify a person?

A Bitcoin or Ethereum address is a pseudonym, not a real-world identity. The link between address and person typically comes from exchange KYC records, IP logs captured at broadcast time, or on-chain behaviour clustering. Without at least one of those off-chain anchors, the address alone names no one.

What does transaction confirmation mean and how does it affect evidence reliability?

A confirmation is one block added on top of the block containing a transaction. With each added block, reversing the transaction becomes computationally more expensive. Six confirmations is the conventional threshold for Bitcoin; Ethereum uses a similar finality concept. For investigative purposes, a transaction buried under dozens of confirmations is effectively immutable and can be cited as a reliable record.

Test yourself on Forensic Accounting and Financial Forensics with free, timed mocks.

Practice Forensic Accounting and Financial Forensics questions

Found this useful? Pass it along.

Spotted an error in this page? Report a correction or read our editorial standards.

Key Takeaways

Your journey to becoming a forensic professional starts here.