Penetration Testing Scope and Audit Interface
A penetration test is a controlled attack commissioned to discover exploitable weaknesses before adversaries do; its findings become evidence for the security audit rather than a substitute for it. This topic explains how organisations commission, scope, and interpret penetration tests, and how auditors consume the results without duplicating forensic or incident-response activities.
Last updated:
A penetration test is a time-bounded, authorised simulation of an attack against an organisation's systems, networks, or applications. Testers use the same techniques as real adversaries but operate under a written contract that defines what they may target, what they may do, and what they must not touch. The output is a report documenting discovered vulnerabilities, their exploitability, the evidence gathered, and recommended remediations. For the security audit function, that report is a primary source of technical evidence: it shows not just whether a control exists on paper, but whether it can be bypassed in practice.
Penetration testing and security auditing serve different purposes and must not be conflated. An audit evaluates whether controls are designed correctly and operating as intended, drawing on policy documents, configuration reviews, interviews, and technical tests. A penetration test focuses exclusively on finding weaknesses that an attacker could exploit. The audit uses penetration test results as evidence; the penetration test does not substitute for the broader audit process. Organisations that treat a clean penetration test report as proof of compliance are making a governance error: a test scoped to the external perimeter says nothing about insider threats, physical controls, or process failures.
Compliance regimes have different positions on penetration testing. PCI-DSS (Payment Card Industry Data Security Standard) requires annual penetration tests and tests after significant changes. ISO/IEC 27001 does not mandate penetration testing but lists it as a control option in Annex A (control 8.8 in the 2022 edition covers management of technical vulnerabilities; 8.29 covers security testing in development). The NIST Cybersecurity Framework lists penetration testing under the Identify and Protect functions. India's Digital Personal Data Protection Act 2023, the EU General Data Protection Regulation, and HIPAA in the United States do not prescribe penetration testing by name but require demonstrable technical and organisational measures, for which penetration test evidence is commonly used. Auditors working across jurisdictions must understand what each regime actually requires versus what it merely implies.
By the end of this topic you will be able to:
- Describe the four main penetration test types (black-box, white-box, grey-box, red team) and select the appropriate type for a given audit objective.
- Draft the key elements of a rules-of-engagement document and explain why each element is legally and operationally necessary.
- Read a penetration test report executive summary and findings list, map findings to control framework requirements, and triage them by severity for remediation planning.
- Explain the boundary between a penetration test and an incident-response engagement, and describe the escalation path when a tester discovers evidence of a prior compromise.
- Describe how penetration test findings feed into the risk register and the audit report without duplicating forensic or incident-response activities.
- Rules of engagement (RoE)
- The written contract or pre-test agreement that defines the authorised scope, permitted techniques, excluded systems, test window, escalation contacts, and emergency stop criteria for a penetration test. Without a valid RoE, a penetration test may constitute unauthorised computer access under applicable law.
- Scope creep
- The unintended expansion of a penetration test beyond the agreed boundaries, either because testers follow a vulnerability chain into an out-of-scope system or because the client adds targets mid-engagement without a formal scope change. Scope creep creates legal exposure and invalidates the original test design.
- Common Vulnerability Scoring System (CVSS)
- A standardised scoring framework that rates vulnerability severity on a 0-10 scale using base metrics (attack vector, complexity, privileges required, user interaction, impact on confidentiality, integrity, and availability), temporal metrics, and environmental metrics. CVSS scores appear in penetration test reports as a common severity reference.
- Red team exercise
- A full-scope adversary simulation in which a team of testers uses the full range of attack techniques (technical, social engineering, and physical) over an extended period, typically without most internal staff knowing the exercise is underway. More comprehensive than a standard penetration test; used to test detection and response capability as well as preventive controls.
- Attestation letter
- A formal document issued by a qualified assessor, such as a PCI Qualified Security Assessor (QSA) or an ISO 27001 certification body, confirming that an organisation has met the requirements of a particular standard or compliance regime. Auditors should not substitute a penetration test report for an attestation letter when the applicable regime requires one.
- Remediation verification test
- A targeted re-test conducted after an organisation has applied fixes to vulnerabilities identified in the original penetration test. The re-test confirms that the specific findings are resolved and that the fix has not introduced new weaknesses. Distinct from a full penetration test and typically lower cost.
Test types and when to use each
The choice of penetration test type determines what the results can and cannot tell an auditor. Four types are in common use, and their differences matter when mapping findings to compliance requirements.
| Test type | Information given to tester | Best suited for | Audit use case |
|---|---|---|---|
| Black-box | Target names only; no credentials, architecture, or source code | Simulating an external unauthenticated attacker | Perimeter exposure; regulatory requirements specifying external testing (e.g., PCI-DSS Requirement 11.4) |
| White-box | Full documentation: architecture, source code, credentials, network maps | Finding implementation flaws efficiently; code-level security review | Pre-deployment testing; internal audit of development practices |
| Grey-box | Partial information: IP ranges, user-level credentials, basic architecture overview | Simulating an attacker with some foothold (phished credential, insider) | Most common commercial arrangement; balances coverage and realism |
| Red team | No information; full adversary simulation including social engineering and physical access attempts | Testing detection and response capability over an extended period | Mature security programmes; regulatory exercises (e.g., TIBER-EU for financial sector) |
Auditors should match the test type to the audit objective before commissioning. A black-box test scoped to the internet-facing perimeter will not find a misconfigured internal database server. A white-box test of the web application layer will not reveal whether the network perimeter can be breached. Specifying the wrong test type is the most common commissioning error, and it produces findings that satisfy neither the compliance requirement nor the organisation's actual risk question.
Commissioning a penetration test: the rules of engagement
Every penetration test must be preceded by a rules-of-engagement document signed by the asset owner or their authorised delegate. This is not a formality: without written authorisation, the tester's activities may constitute unauthorised access under the UK Computer Misuse Act 1990, the US Computer Fraud and Abuse Act, India's Information Technology Act 2000, or equivalent statutes in other jurisdictions. The document must be in place before any test activity begins, including reconnaissance.
A complete rules-of-engagement document covers the following elements. Scope defines the IP ranges, domain names, application URLs, or physical locations that are in scope. Out-of-scope assets must be listed explicitly; ambiguity defaults to out-of-scope. Permitted techniques list what the tester may do: network scanning, web application attacks, credential brute-forcing, social engineering, physical access attempts. Each category that is not listed is excluded. The test window specifies start and end dates and times, which matters for operational systems where testing during business hours creates service-disruption risk. Communication protocol defines who the tester contacts if they find a critical vulnerability that poses immediate risk, and who they contact in an emergency. The emergency stop condition defines what triggers an immediate halt: commonly, discovery of an active breach by a third party, or any action that would cause data loss. Data handling rules cover what the tester may retain, how long, and how it must be destroyed after the report is delivered.
Auditors reviewing a penetration test engagement should verify that a signed RoE exists and that its scope matches the systems the audit needs to assess. A test conducted under an inadequate RoE produces findings that cannot be relied upon as audit evidence because the test may not have been conducted lawfully or consistently.
Reading and interpreting a penetration test report
Penetration test reports follow a broadly standard structure, though format varies by vendor. The sections an auditor must read carefully are: the executive summary, the methodology statement, the findings list with severity ratings, the evidence appendix, and the remediation recommendations. The cover page and table of contents are irrelevant to the audit.
The executive summary states the overall risk posture in non-technical language, summarises the number of findings by severity, and highlights any critical findings that require immediate attention. An auditor should read this section with one question in mind: does the overall posture stated here match the risk level in the organisation's risk register? If the risk register records a network perimeter risk as low and the penetration test found three critical external vulnerabilities, that discrepancy is itself an audit finding.
Each finding in the body of the report should include: a title, a severity rating (Critical, High, Medium, Low, Informational, often accompanied by a CVSS score), a description of the vulnerability, evidence (screenshots, HTTP request and response captures, command outputs), the impact if exploited, and a recommendation. An auditor maps each finding to the relevant control: for ISO/IEC 27001, this means the relevant Annex A control; for PCI-DSS, the relevant requirement number; for NIST CSF, the relevant subcategory. If a finding maps to a control that the organisation has stated is fully implemented, the finding is evidence that the control is ineffective, which is a compliance gap.
Mapping findings to control frameworks and compliance regimes
A penetration test report is raw technical evidence. Its value to the audit function depends on how findings are mapped to the control framework the organisation has adopted. Without this mapping, the report is a list of technical problems without governance context; with it, the report becomes evidence that specific controls are absent, degraded, or bypassable.
For organisations certified or certifying to ISO/IEC 27001, each finding should be mapped to the relevant Annex A control or to the Statement of Applicability. A finding of unpatched systems maps to control 8.8 (management of technical vulnerabilities). A finding of weak authentication maps to control 8.5 (secure authentication). If the Statement of Applicability marks a control as implemented and a penetration test finding demonstrates it is ineffective, the ISMS has a nonconformity that must be recorded and remediated before the next certification audit.
For PCI-DSS environments, the mapping is prescribed. Requirement 11.4 requires internal and external penetration testing at least annually, and after significant changes. Requirement 11.4.4 requires that exploitable vulnerabilities found are corrected, and a re-test confirms correction. Requirement 11.4.5 requires that penetration testing includes testing of network segmentation controls if the organisation uses segmentation to reduce cardholder data environment scope. An auditor reviewing PCI-DSS compliance must verify that the test covered these specific elements; a test that omitted segmentation testing is incomplete regardless of its findings.
For organisations subject to India's Digital Personal Data Protection Act 2023, the EU GDPR, or the UK GDPR, penetration test findings involving personal data systems carry direct regulatory significance. A finding that personal data is accessible without authorisation may constitute a personal data breach requiring notification under Article 33 of the EU GDPR (72-hour notification window to the supervisory authority) or the equivalent provision of the DPDP Act 2023. Auditors must assess whether any finding triggers a reporting obligation and escalate if so, independently of the remediation timeline.
The boundary between penetration testing and incident response
Penetration testers occasionally discover evidence that a system has already been compromised by a third party: unfamiliar user accounts, unexpected outbound connections, malware artifacts, or data exfiltration indicators. When this happens, the engagement has moved into incident-response territory, and the tester must stop, preserve evidence, and notify the designated contact immediately. Continuing the penetration test in the presence of an active compromise risks destroying forensic evidence, interfering with an incident investigation, and creating legal complications about who conducted what activity on the system.
The rules-of-engagement document should define this escalation path explicitly. The tester's obligation is to: stop all activity on the affected system, document what they observed and when, notify the designated contact (not a general help desk), and await instruction. The incident-response team then assesses the situation and decides whether the penetration test continues on other in-scope systems while the compromise is investigated. Auditors should verify that the RoE contains this escalation clause before approving a test. See Incident Response and Management for the full incident-response framework that would take over in this scenario.
Penetration testing is also not digital forensics. Penetration testers are not trained evidence collectors; they are trained to find and exploit weaknesses. If a penetration test produces output that may be used in criminal or civil proceedings (for example, because a finding reveals criminal conduct by an insider), the chain of custody for that evidence is almost certainly broken. Auditors should advise that forensic-grade evidence collection requires a separate engagement by qualified digital forensic practitioners following established collection procedures. The distinction matters across jurisdictions: in India, the Bharatiya Sakshya Adhiniyam 2023 governs the admissibility of electronic records; in the EU and US, digital evidence standards vary by proceeding type.
Remediation, verification, and closing the audit loop
Finding a vulnerability is not the end of the process. The audit function must track remediation of each finding to closure. A structured remediation workflow has four steps: triage (prioritise findings by severity and business impact), assign (allocate each finding to an owner with a target date), fix (implement the remediation), and verify (confirm the fix works, either through a targeted re-test or, for lower-severity findings, through a configuration review or log analysis).
Remediation verification is not optional for critical and high findings. An organisation that marks a critical finding as remediated without a re-test has an unverified assumption in its risk register. For PCI-DSS, Requirement 11.4.4 mandates re-testing after remediation. For ISO/IEC 27001, the internal audit programme should schedule a follow-up review. For general good practice, a re-test by the same vendor is efficient because they already understand the vulnerability; however, a re-test by a different party provides additional assurance that the fix is complete.
Some findings cannot be remediated immediately, typically because they require architectural changes, vendor patches that are not yet available, or budget approval cycles. These findings must be accepted as residual risk through a formal risk acceptance process, documented in the risk register with the accepting owner's name and review date, and not simply marked as closed. An auditor who finds unacknowledged open findings from a prior penetration test, findings that were neither remediated nor formally accepted, has discovered a control failure in the risk management process itself, separate from the technical vulnerability.
An organisation's Statement of Applicability records ISO 27001 Annex A control 8.5 (secure authentication) as fully implemented. A penetration test then finds that the web application accepts passwords of four characters with no complexity requirement. What is the correct audit conclusion?
Key Takeaways
- Penetration testing and security auditing are complementary, not interchangeable: the audit uses penetration test findings as technical evidence that controls are effective or not, but a clean penetration test report does not equal compliance with any regime.
- Every penetration test must be preceded by a signed rules-of-engagement document that specifies scope, permitted techniques, excluded systems, test window, emergency contacts, and escalation conditions; without it, the test may not be lawful under applicable computer access statutes.
- Auditors must map each finding to the relevant control framework requirement (ISO 27001 Annex A, PCI-DSS requirement number, NIST CSF subcategory) to convert raw technical findings into governance evidence; a finding that contradicts the Statement of Applicability or risk register is itself a compliance gap.
- When a tester discovers evidence of an active third-party compromise, the penetration test stops immediately on the affected system: the tester documents and notifies, and the incident-response function takes over; penetration testers are not forensic evidence collectors and their work does not produce forensic-grade evidence for legal proceedings.
- Critical and High findings must be verified closed through a targeted re-test, not through developer self-attestation; findings that cannot be remediated promptly must be formally accepted as residual risk in the risk register with a named owner and review date.
What is the difference between a penetration test and a security audit?
What should a rules-of-engagement document contain?
How should an auditor read a penetration test report?
What is the difference between a black-box and a white-box penetration test?
When does a penetration test finding become a forensic or incident-response matter?
Test yourself on Information Security Audit and Compliance with free, timed mocks.
Practice Information Security Audit and Compliance questionsSpotted an error in this page? Report a correction or read our editorial standards.