Skip to content

Detection Sources and Alert Pipelines

Detection sources are the systems that generate security alerts, spanning SIEM platforms, endpoint detection and response tools, intrusion detection systems, cloud-native monitors, and threat intelligence feeds. This topic explains how raw signals from these sources flow through aggregation, correlation, and deduplication before reaching an analyst as an actionable alert.

Last updated:

Share

Detection sources are the systems and services that observe activity across an organisation's environment and generate security alerts when something looks suspicious or malicious. The main categories are Security Information and Event Management (SIEM) platforms, Endpoint Detection and Response (EDR) tools, network Intrusion Detection and Prevention Systems (IDS/IPS), cloud-native monitoring services, and threat intelligence feeds. Each source produces raw signals: log entries, network flow records, process events, or indicator matches. Those raw signals do not reach an analyst directly. Instead they pass through an alert pipeline that aggregates signals from many sources, applies correlation logic to group related events, and deduplicates near-identical alerts before surfacing a single, enriched alert to the analyst queue. The quality of detection in an incident response programme depends as much on how the pipeline processes signals as on the individual sources that generate them.

Security teams in organisations of any size face a fundamental mismatch: the volume of raw events generated by a modern environment far exceeds what analysts can review individually. A mid-sized organisation might generate tens of millions of log entries per day. The pipeline exists to reduce that volume to a manageable set of high-fidelity alerts. Correlation rules group events that belong to the same attack sequence. Deduplication removes redundant copies of the same event. Enrichment adds context from threat intelligence and asset inventories. The result is an alert that contains the essential facts an analyst needs to begin triage without having to reassemble those facts from raw logs.

Detection pipelines vary widely in architecture. A small organisation may rely on a cloud-hosted SIEM that ingests logs from a firewall and a handful of servers. A large enterprise may run a multi-tier architecture in which vendor-specific collectors normalise data before forwarding it to a central platform, where a rules engine and a machine-learning anomaly detector run in parallel, each producing alerts that feed into a case management system. Understanding the components and their roles is essential for any analyst who needs to know where an alert came from, what evidence supports it, and what may have been missed.

By the end of this topic you will be able to:

  • Identify the main categories of detection source (SIEM, EDR, IDS/IPS, cloud-native monitoring, threat intelligence feeds) and describe the type of signal each produces.
  • Explain the stages of an alert pipeline: collection, normalisation, aggregation, correlation, deduplication, and enrichment.
  • Compare signature-based and behaviour-based detection methods and explain the tradeoff between false positives and false negatives for each.
  • Describe how threat intelligence feeds integrate with a SIEM or network sensor to improve detection quality and analyst context.
  • Explain why cloud environments require dedicated monitoring sources and name at least two cloud-native detection services from major providers.
Key terms
SIEM (Security Information and Event Management)
A platform that collects, normalises, and stores log and event data from many sources, then applies correlation rules and anomaly detection to generate security alerts. Examples include Microsoft Sentinel, Splunk Enterprise Security, IBM QRadar, and Elastic SIEM.
EDR (Endpoint Detection and Response)
An agent-based security tool deployed on individual endpoints (workstations, servers, mobile devices) that monitors process execution, file changes, network connections, and registry activity in real time. Examples include CrowdStrike Falcon, Microsoft Defender for Endpoint, and SentinelOne.
IDS/IPS (Intrusion Detection/Prevention System)
Network or host-based systems that inspect traffic or system calls for known attack patterns. An IDS generates alerts without blocking; an IPS sits inline and can drop malicious traffic. The distinction matters for tuning: prevention mode can block legitimate activity if rules are misconfigured.
Alert correlation
The process of grouping multiple related events or alerts into a single higher-level alert representing one attack sequence. A correlation rule might group a failed-login burst followed by a successful login from the same IP as a single brute-force alert rather than hundreds of individual login-failure alerts.
Indicator of Compromise (IOC)
A specific, observable artefact associated with known malicious activity: a file hash, IP address, domain name, URL, registry key, or certificate fingerprint. IOCs are the primary data carried by threat intelligence feeds and are matched against observed activity to generate alerts.
Alert fatigue
The condition in which analysts receive more alerts than they can meaningfully review, leading to delayed responses, dismissed true positives, and reduced confidence in the alert system. Alert fatigue is the main operational risk of a poorly tuned detection pipeline and a leading factor in delayed breach detection.

SIEM: the central aggregation layer

A SIEM receives log and event data from sources across the environment: firewalls, operating systems, directory services, web proxies, email gateways, cloud services, and applications. Its first function is normalisation: each source writes logs in its own format, so the SIEM parses incoming data into a common schema, mapping fields such as source IP, destination IP, username, timestamp, and action to consistent field names regardless of which source generated the record. Normalisation is what makes cross-source correlation possible. Without it, a rule that links a firewall block event to a Windows login failure event cannot be written, because those events use different field names and different timestamp formats in their raw form.

Once data is normalised, the SIEM applies correlation rules. A simple rule might fire an alert when more than ten failed login attempts occur from the same source IP within five minutes. A more complex rule might fire when a failed-VPN-authentication event is followed within thirty minutes by a successful authentication from the same account from a geographically distant IP, which then accesses a file server not previously accessed by that account. These multi-step rules catch attack sequences that no single event would reveal individually.

Modern SIEM platforms supplement rule-based correlation with machine-learning anomaly detection. User and Entity Behaviour Analytics (UEBA) builds a baseline of normal behaviour for each user and device, then generates an alert when observed behaviour departs significantly from that baseline, without requiring a rule that anticipates the specific deviation. UEBA is particularly useful for detecting insider threats and novel attack techniques that signature-based rules would miss, but it requires a sufficient history of baseline data and generates more ambiguous alerts that demand more analyst judgement.

Endpoint detection and response

An EDR agent runs as a privileged process on each endpoint and captures a continuous stream of telemetry: every process that starts or stops, every file created, modified, or deleted, every network connection opened or closed, and every registry key written. This telemetry is transmitted to the EDR's cloud or on-premises backend, where detection logic runs. The volume of EDR telemetry is large enough that the backend applies its own filtering and aggregation before surfacing alerts.

EDR detection logic operates in two modes. Signature-based detection matches process hashes, file hashes, or command-line strings against a database of known malware. It is fast and reliable for known threats but blind to novel variants and living-off-the-land techniques where attackers use legitimate system tools. Behaviour-based detection looks for patterns: a process that injects code into another process, a script interpreter spawned by an Office document, or a service account opening a large number of files in rapid succession. Behaviour-based detection catches novel techniques but generates more false positives, requiring tuning and analyst judgment.

AttributeSignature-based detectionBehaviour-based detection
Coverage of known threatsHighMedium to high
Coverage of novel threatsLowHigh
False positive rateLowMedium to high
Tuning requiredLow (update signatures)Ongoing
Typical useMalware hash blocklistingLiving-off-the-land, insider threat

EDR tools also support live response: an analyst can remotely query a suspicious endpoint for running processes, open network connections, and recently created files without requiring physical access or a separate forensic tool. Some EDR platforms allow the analyst to isolate the endpoint from the network (cutting all connections except the management channel to the EDR backend) directly from the analyst console. This containment capability is important in the early stages of an incident, but the decision to isolate must be made carefully: isolating the wrong system can disrupt operations and, if the attacker has a kill switch, can trigger destructive activity.

Network detection: IDS, IPS, and traffic analysis

Network intrusion detection systems (NIDS) inspect traffic passing through a network segment, typically by receiving a copy of traffic via a switch mirror port or network tap. They compare packet content and connection metadata against a ruleset of known attack signatures. Snort and Suricata are the two most widely deployed open-source NIDS engines; commercial alternatives include Palo Alto Networks Threat Prevention and Cisco Firepower. A NIDS alert identifies the source and destination IP, port, protocol, and the rule that fired, giving an analyst a starting point for investigation.

A network intrusion prevention system (NIPS) is a NIDS placed inline in the traffic path rather than receiving a mirror copy. Because it sits between sender and receiver, it can drop a packet, reset a TCP connection, or block a source IP when a rule fires. This gives the NIPS the ability to stop an attack in real time, but it also means a misconfigured or overly aggressive rule can disrupt legitimate traffic. Most organisations run new rules in detection-only mode for a period of observation before enabling prevention, and they maintain a documented procedure for disabling a prevention rule quickly if it begins blocking legitimate activity.

Network traffic analysis (NTA) tools do not rely on signatures. Instead they establish a baseline of normal communication patterns: which hosts talk to which other hosts, on what protocols, at what volumes and times. Deviations from the baseline, such as an internal host suddenly communicating with an external IP it has never contacted before, or a host sending far more data outbound than it normally does, generate alerts. NTA is particularly effective at detecting command-and-control beaconing and data exfiltration, which may use common protocols and therefore evade signature-based IDS rules.

Cloud-native monitoring and detection

Cloud environments produce activity logs through provider-specific services. AWS records API calls via CloudTrail, network flow data via VPC Flow Logs, and DNS queries via Route 53 Resolver Query Logs. Microsoft Azure records management-plane activity via Azure Activity Log, data-plane events via Azure Resource Logs, and security signals via Microsoft Defender for Cloud. Google Cloud records admin and data access via Cloud Audit Logs. These logs are the raw material for cloud detection, and they must be explicitly enabled and routed to a SIEM or a cloud-native detection service; they are not automatically available.

Cloud providers also offer managed threat detection services that analyse their own telemetry internally and surface pre-built alerts. AWS GuardDuty processes CloudTrail, VPC Flow Logs, and DNS logs to detect threats such as credential compromise, cryptomining, and unusual API activity. Microsoft Defender for Cloud covers Azure, AWS, and GCP resources and produces security recommendations alongside threat alerts. Google Security Command Center provides centralised visibility and threat detection across GCP. These services require no log export or correlation rule authoring; they apply the provider's detection logic to the same telemetry they already hold.

Containerised and serverless environments present additional detection challenges. Containers are ephemeral: a compromised container may execute malicious code and be destroyed before any log is exported. Runtime security tools such as Falco (open-source) and commercial equivalents intercept kernel system calls from within the container host and generate alerts in real time, before the container terminates. Serverless functions similarly generate short-lived execution environments; function-level logging must be explicitly enabled and must be retained outside the function's execution context to survive its termination.

Threat intelligence feeds and IOC matching

Threat intelligence feeds supply structured data about known malicious infrastructure and artefacts. A feed entry might identify an IP address as a known command-and-control server, a domain as a phishing infrastructure host, a file hash as a ransomware executable, or a certificate fingerprint as associated with a known threat actor. When a SIEM rule or a network sensor observes traffic to or from a flagged IP, or when an EDR records execution of a file whose hash appears in the feed, a high-confidence alert fires without requiring a behavioural rule that would have anticipated the specific technique.

Feeds come from multiple sources. Commercial threat intelligence providers such as Recorded Future, Mandiant, and CrowdStrike Intelligence aggregate indicators from their own telemetry, from dark web monitoring, and from human intelligence. Open-source feeds include the Abuse.ch collections (MalwareBazaar, URLhaus, ThreatFox), the Cisco Talos IP reputation list, and feeds shared through sector-specific Information Sharing and Analysis Centers (ISACs) in financial services, healthcare, and critical infrastructure. Government feeds include the US CISA Known Exploited Vulnerabilities catalogue and the UK NCSC's threat indicator feeds shared with qualifying organisations.

The STIX/TAXII standard provides a common format (Structured Threat Information Expression) and transport protocol (Trusted Automated eXchange of Indicator Information) for sharing threat intelligence between organisations and platforms. Most modern SIEM platforms and many EDR tools support native STIX/TAXII ingestion, allowing a feed to be pulled automatically on a schedule and applied to detection rules without manual import. The value of a feed degrades quickly as indicators age: an attacker's command-and-control infrastructure changes frequently, so a feed that is updated daily is more useful than one updated weekly.

The alert pipeline: from raw signal to analyst queue

The path from a raw event to an analyst-ready alert passes through several processing stages. Collection is the first: agents, collectors, and API integrations pull or receive events from sources and forward them to the central platform. Normalisation parses each event into a common schema. Aggregation groups related low-level events before correlation runs; for example, grouping all events from the same source IP and destination host in a five-minute window reduces the number of objects the correlation engine must evaluate. Correlation then applies rules or model-based logic to the aggregated data and produces candidate alerts.

Deduplication removes redundant alerts before they reach the analyst. A single malware execution can trigger the same detection rule multiple times as the same process is logged from multiple sources, or as related child processes fire the same rule. Without deduplication, the analyst sees dozens of identical alerts instead of one. Deduplication logic groups alerts that share the same detection rule, source host, and time window into a single alert with a count. Enrichment then appends context: the asset record for the affected host (its owner, business function, and criticality), geolocation and reputation data for any external IP, and relevant threat intelligence context for any matched IOC.

The output of the pipeline is an alert delivered to a case management or Security Orchestration, Automation, and Response (SOAR) platform, or directly to the analyst queue in the SIEM. Well-designed pipelines also record provenance: which raw events contributed to each alert, what rule or model fired, and what enrichment was applied. This provenance chain is essential for the analyst to understand the full context of an alert and for the incident responder to collect the underlying evidence without retracing the pipeline from scratch. The triage and prioritisation step that follows depends on the quality of the alert the pipeline delivers.

Check your understanding
Question 1 of 4· 0 answered

A SIEM receives login events from an identity provider and firewall block events from a network device, but the two sources use different field names for the source IP address. Which pipeline stage resolves this mismatch?

Key Takeaways

  • Detection sources fall into five main categories: SIEM platforms (broad log aggregation and correlation), EDR tools (per-endpoint process and file telemetry), network IDS/IPS (traffic signature and anomaly detection), cloud-native monitoring services (provider-specific API and resource activity), and threat intelligence feeds (IOC matching and adversary context).
  • An alert pipeline transforms raw events through collection, normalisation, aggregation, correlation, deduplication, and enrichment before delivering a single, self-contained alert to the analyst. Each stage reduces volume and adds context.
  • Signature-based detection is reliable for known threats and produces few false positives; behaviour-based detection catches novel techniques but requires more tuning and analyst judgement to manage false positives.
  • Cloud-native detection services (AWS GuardDuty, Microsoft Defender for Cloud, Google Security Command Center) cover their own provider's telemetry only. Multi-cloud environments need a common SIEM or XDR layer to correlate cross-provider activity and close detection gaps.
  • Alert fatigue is the primary operational risk of a poorly tuned pipeline. Tuning is continuous: rules must be reviewed and refined as the environment changes, and each false positive is a signal that a rule needs adjustment.
What is the difference between a SIEM and an EDR in a detection pipeline?
A SIEM (Security Information and Event Management) platform collects, aggregates, and correlates log and event data from many sources across the environment to detect patterns spanning multiple systems. An EDR (Endpoint Detection and Response) tool runs directly on endpoints, monitors process behaviour and file activity in real time, and can take automated containment actions on that device. In most organisations the EDR feeds its telemetry into the SIEM, which combines it with network, identity, and cloud signals for broader correlation.
What does alert deduplication do in an alert pipeline?
Deduplication identifies multiple alerts that describe the same underlying event and collapses them into a single alert before they reach the analyst queue. Without deduplication, a single malware execution can generate dozens of nearly identical alerts from different detection rules or from the same rule firing on repeated log entries. The result would be alert fatigue: analysts spend time closing duplicates rather than investigating new activity.
How does a threat intelligence feed improve detection quality?
A threat intelligence feed supplies indicators of compromise (IOCs) such as malicious IP addresses, domain names, file hashes, and certificate fingerprints. When a SIEM or network sensor matches observed traffic or file activity against a current feed, it can generate a high-confidence alert even without a behavioural rule that would have caught the same activity. Feeds also carry context about adversary groups and tactics, which helps analysts prioritise and scope an incident more quickly.
Why do cloud environments need their own detection sources rather than relying on the on-premises SIEM alone?
Cloud environments generate activity logs in formats and locations specific to each provider: AWS CloudTrail, Microsoft Azure Monitor Logs, and Google Cloud Audit Logs all record different event schemas that on-premises log collectors were not designed to ingest. Cloud services also expose threat surfaces that have no on-premises equivalent, such as misconfigured storage buckets, overprivileged service accounts, and serverless function abuse. Cloud-native monitoring services like AWS GuardDuty and Microsoft Defender for Cloud understand these surfaces and generate alerts tailored to cloud-specific attack techniques.
What is the difference between a network IDS and a network IPS?
Both a network intrusion detection system (NIDS) and a network intrusion prevention system (NIPS) inspect traffic for known attack patterns. The difference is action: an IDS passively detects and generates an alert, while an IPS sits inline in the traffic path and can drop or reset a connection when a rule fires. The IPS can stop an attack in progress without waiting for human action, but misconfigured prevention rules can block legitimate traffic, so organisations often start with detection-only mode and promote rules to prevention only after tuning.

Test yourself on Incident Response and Management with free, timed mocks.

Practice Incident Response and Management questions

Found this useful? Pass it along.

Share

Spotted an error in this page? Report a correction or read our editorial standards.

Your journey to becoming a forensic professional starts here.

Practice with mock tests, learn from structured notes, and get your questions answered by a global forensic community, all in one place.