Logistic Regression (Supervised Fraud Model)

Definition

A classification model trained on historically labelled transactions (fraud vs. legitimate) to estimate the probability that a new transaction is fraudulent. Requires a labelled training set and is interpretable: each feature's contribution to the score is a coefficient.

Related terms

Anomaly Scoring: A numeric score assigned to each entity or transaction based on how different it is from the expected population, derived from multiple...
Explainability: The degree to which a model's output can be explained in terms of its inputs and logic. Logistic regression and decision trees...
Isolation Forest: An unsupervised machine-learning model for anomaly detection. It builds random decision trees and scores each record by the average depth required to...
Network Analysis (Link Analysis): A method that models entities (people, companies, accounts, addresses) as nodes and connections between them (shared attributes, transactions, ownership) as edges, then...
Timeline Reconstruction: The process of ordering digital events from multiple sources into a single chronological account. Requires normalising all timestamps to a common reference...

Explained in

Data Analytics in Fraud InvestigationsA classification model trained on historically labelled transactions (fraud vs. legitimate) to estimate the probability that a new transaction is fraudulent. R...

Logistic Regression (Supervised Fraud Model)

Related terms

Explained in

Your journey to becoming a forensic professional starts here.