Logistic regression (supervised fraud model)
Definition
A classification model trained on historically labelled transactions (fraud vs. legitimate) to estimate the probability that a new transaction is fraudulent. Requires a labelled training set and is interpretable: each feature's contribution to the score is a coefficient.
Related terms
- Anomaly scoring
- A numeric score assigned to each entity or transaction based on how different it is from the expected population, derived from multiple...
- Explainability
- The degree to which a model's output can be explained in terms of its inputs and logic. Logistic regression and decision trees...
- Isolation Forest
- An unsupervised machine-learning model for anomaly detection. It builds random decision trees and scores each record by the average depth required to...
- Network analysis (link analysis)
- A method that models entities (people, companies, accounts, addresses) as nodes and connections between them (shared attributes, transactions, ownership) as edges, then...
- Timeline reconstruction
- The assembly of events from multiple data sources onto a chronological axis to establish the sequence of actions in a fraud: when...
Explained in
- Data Analytics in Fraud InvestigationsA classification model trained on historically labelled transactions (fraud vs. legitimate) to estimate the probability that a new transaction is fraudulent. R...