Penetration Testing Scope and Audit Interface

A penetration test is a controlled attack commissioned to discover exploitable weaknesses before adversaries do; its findings become evidence for the security audit rather than a substitute for it. This topic explains how organisations commission, scope, and interpret penetration tests, and how auditors consume the results without duplicating forensic or incident-response activities.

Last updated: 24 Jun 2026

A penetration test is a time-bounded, authorised simulation of an attack against an organisation's systems, networks, or applications. Testers use the same techniques as real adversaries but operate under a written contract that defines what they may target, what they may do, and what they must not touch. The output is a report documenting discovered vulnerabilities, their exploitability, the evidence gathered, and recommended remediations. For the security audit function, that report is a primary source of technical evidence: it shows not just whether a control exists on paper, but whether it can be bypassed in practice.

Penetration testing and security auditing serve different purposes and must not be conflated. An audit evaluates whether controls are designed correctly and operating as intended, drawing on policy documents, configuration reviews, interviews, and technical tests. A penetration test focuses exclusively on finding weaknesses that an attacker could exploit. The audit uses penetration test results as evidence; the penetration test does not substitute for the broader audit process. Organisations that treat a clean penetration test report as proof of compliance are making a governance error: a test scoped to the external perimeter says nothing about insider threats, physical controls, or process failures.

Compliance regimes have different positions on penetration testing. PCI-DSS (Payment Card Industry Data Security Standard) requires annual penetration tests and tests after significant changes. ISO/IEC 27001 does not mandate penetration testing but lists it as a control option in Annex A (control 8.8 in the 2022 edition covers management of technical vulnerabilities; 8.29 covers security testing in development). The NIST Cybersecurity Framework lists penetration testing under the Identify and Protect functions. India's Digital Personal Data Protection Act 2023, the EU General Data Protection Regulation, and HIPAA in the United States do not prescribe penetration testing by name but require demonstrable technical and organisational measures, for which penetration test evidence is commonly used. Auditors working across jurisdictions must understand what each regime actually requires versus what it merely implies.

By the end of this topic you will be able to:

Describe the four main penetration test types (black-box, white-box, grey-box, red team) and select the appropriate type for a given audit objective.
Draft the key elements of a rules-of-engagement document and explain why each element is legally and operationally necessary.
Read a penetration test report executive summary and findings list, map findings to control framework requirements, and triage them by severity for remediation planning.
Explain the boundary between a penetration test and an incident-response engagement, and describe the escalation path when a tester discovers evidence of a prior compromise.
Describe how penetration test findings feed into the risk register and the audit report without duplicating forensic or incident-response activities.

Key terms

Rules of engagement (RoE): The written contract or pre-test agreement that defines the authorised scope, permitted techniques, excluded systems, test window, escalation contacts, and emergency stop criteria for a penetration test. Without a valid RoE, a penetration test may constitute unauthorised computer access under applicable law.
Scope creep: The unintended expansion of a penetration test beyond the agreed boundaries, either because testers follow a vulnerability chain into an out-of-scope system or because the client adds targets mid-engagement without a formal scope change. Scope creep creates legal exposure and invalidates the original test design.
Common Vulnerability Scoring System (CVSS): A standardised scoring framework that rates vulnerability severity on a 0-10 scale using base metrics (attack vector, complexity, privileges required, user interaction, impact on confidentiality, integrity, and availability), temporal metrics, and environmental metrics. CVSS scores appear in penetration test reports as a common severity reference.
Red team exercise: A full-scope adversary simulation in which a team of testers uses the full range of attack techniques (technical, social engineering, and physical) over an extended period, typically without most internal staff knowing the exercise is underway. More comprehensive than a standard penetration test; used to test detection and response capability as well as preventive controls.
Attestation letter: A formal document issued by a qualified assessor, such as a PCI Qualified Security Assessor (QSA) or an ISO 27001 certification body, confirming that an organisation has met the requirements of a particular standard or compliance regime. Auditors should not substitute a penetration test report for an attestation letter when the applicable regime requires one.
Remediation verification test: A targeted re-test conducted after an organisation has applied fixes to vulnerabilities identified in the original penetration test. The re-test confirms that the specific findings are resolved and that the fix has not introduced new weaknesses. Distinct from a full penetration test and typically lower cost.

Test types and when to use each

The choice of penetration test type determines what the results can and cannot tell an auditor. Four types are in common use, and their differences matter when mapping findings to compliance requirements.

Test type	Information given to tester	Best suited for	Audit use case
Black-box	Target names only; no credentials, architecture, or source code	Simulating an external unauthenticated attacker	Perimeter exposure; regulatory requirements specifying external testing (e.g., PCI-DSS Requirement 11.4)
White-box	Full documentation: architecture, source code, credentials, network maps	Finding implementation flaws efficiently; code-level security review	Pre-deployment testing; internal audit of development practices
Grey-box	Partial information: IP ranges, user-level credentials, basic architecture overview	Simulating an attacker with some foothold (phished credential, insider)	Most common commercial arrangement; balances coverage and realism
Red team	No information; full adversary simulation including social engineering and physical access attempts	Testing detection and response capability over an extended period	Mature security programmes; regulatory exercises (e.g., TIBER-EU for financial sector)

Auditors should match the test type to the audit objective before commissioning. A black-box test scoped to the internet-facing perimeter will not find a misconfigured internal database server. A white-box test of the web application layer will not reveal whether the network perimeter can be breached. Specifying the wrong test type is the most common commissioning error, and it produces findings that satisfy neither the compliance requirement nor the organisation's actual risk question.

Commissioning a penetration test: the rules of engagement

Every penetration test must be preceded by a rules-of-engagement document signed by the asset owner or their authorised delegate. This is not a formality: without written authorisation, the tester's activities may constitute unauthorised access under the UK Computer Misuse Act 1990, the US Computer Fraud and Abuse Act, India's Information Technology Act 2000, or equivalent statutes in other jurisdictions. The document must be in place before any test activity begins, including reconnaissance.

A complete rules-of-engagement document covers the following elements. Scope defines the IP ranges, domain names, application URLs, or physical locations that are in scope. Out-of-scope assets must be listed explicitly; ambiguity defaults to out-of-scope. Permitted techniques list what the tester may do: network scanning, web application attacks, credential brute-forcing, social engineering, physical access attempts. Each category that is not listed is excluded. The test window specifies start and end dates and times, which matters for operational systems where testing during business hours creates service-disruption risk. Communication protocol defines who the tester contacts if they find a critical vulnerability that poses immediate risk, and who they contact in an emergency. The emergency stop condition defines what triggers an immediate halt: commonly, discovery of an active breach by a third party, or any action that would cause data loss. Data handling rules cover what the tester may retain, how long, and how it must be destroyed after the report is delivered.

Auditors reviewing a penetration test engagement should verify that a signed RoE exists and that its scope matches the systems the audit needs to assess. A test conducted under an inadequate RoE produces findings that cannot be relied upon as audit evidence because the test may not have been conducted lawfully or consistently.

Reading and interpreting a penetration test report

Penetration test reports follow a broadly standard structure, though format varies by vendor. The sections an auditor must read carefully are: the executive summary, the methodology statement, the findings list with severity ratings, the evidence appendix, and the remediation recommendations. The cover page and table of contents are irrelevant to the audit.

The executive summary states the overall risk posture in non-technical language, summarises the number of findings by severity, and highlights any critical findings that require immediate attention. An auditor should read this section with one question in mind: does the overall posture stated here match the risk level in the organisation's risk register? If the risk register records a network perimeter risk as low and the penetration test found three critical external vulnerabilities, that discrepancy is itself an audit finding.

Each finding in the body of the report should include: a title, a severity rating (Critical, High, Medium, Low, Informational, often accompanied by a CVSS score), a description of the vulnerability, evidence (screenshots, HTTP request and response captures, command outputs), the impact if exploited, and a recommendation. An auditor maps each finding to the relevant control: for ISO/IEC 27001, this means the relevant Annex A control; for PCI-DSS, the relevant requirement number; for NIST CSF, the relevant subcategory. If a finding maps to a control that the organisation has stated is fully implemented, the finding is evidence that the control is ineffective, which is a compliance gap.

Mapping findings to control frameworks and compliance regimes

A penetration test report is raw technical evidence. Its value to the audit function depends on how findings are mapped to the control framework the organisation has adopted. Without this mapping, the report is a list of technical problems without governance context; with it, the report becomes evidence that specific controls are absent, degraded, or bypassable.

For organisations certified or certifying to ISO/IEC 27001, each finding should be mapped to the relevant Annex A control or to the Statement of Applicability. A finding of unpatched systems maps to control 8.8 (management of technical vulnerabilities). A finding of weak authentication maps to control 8.5 (secure authentication). If the Statement of Applicability marks a control as implemented and a penetration test finding demonstrates it is ineffective, the ISMS has a nonconformity that must be recorded and remediated before the next certification audit.

For PCI-DSS environments, the mapping is prescribed. Requirement 11.4 requires internal and external penetration testing at least annually, and after significant changes. Requirement 11.4.4 requires that exploitable vulnerabilities found are corrected, and a re-test confirms correction. Requirement 11.4.5 requires that penetration testing includes testing of network segmentation controls if the organisation uses segmentation to reduce cardholder data environment scope. An auditor reviewing PCI-DSS compliance must verify that the test covered these specific elements; a test that omitted segmentation testing is incomplete regardless of its findings.

For organisations subject to India's Digital Personal Data Protection Act 2023, the EU GDPR, or the UK GDPR, penetration test findings involving personal data systems carry direct regulatory significance. A finding that personal data is accessible without authorisation may constitute a personal data breach requiring notification under Article 33 of the EU GDPR (72-hour notification window to the supervisory authority) or the equivalent provision of the DPDP Act 2023. Auditors must assess whether any finding triggers a reporting obligation and escalate if so, independently of the remediation timeline.

The boundary between penetration testing and incident response

Penetration testers occasionally discover evidence that a system has already been compromised by a third party: unfamiliar user accounts, unexpected outbound connections, malware artifacts, or data exfiltration indicators. When this happens, the engagement has moved into incident-response territory, and the tester must stop, preserve evidence, and notify the designated contact immediately. Continuing the penetration test in the presence of an active compromise risks destroying forensic evidence, interfering with an incident investigation, and creating legal complications about who conducted what activity on the system.

The rules-of-engagement document should define this escalation path explicitly. The tester's obligation is to: stop all activity on the affected system, document what they observed and when, notify the designated contact (not a general help desk), and await instruction. The incident-response team then assesses the situation and decides whether the penetration test continues on other in-scope systems while the compromise is investigated. Auditors should verify that the RoE contains this escalation clause before approving a test. See Incident Response and Management for the full incident-response framework that would take over in this scenario.

Penetration testing is also not digital forensics. Penetration testers are not trained evidence collectors; they are trained to find and exploit weaknesses. If a penetration test produces output that may be used in criminal or civil proceedings (for example, because a finding reveals criminal conduct by an insider), the chain of custody for that evidence is almost certainly broken. Auditors should advise that forensic-grade evidence collection requires a separate engagement by qualified digital forensic practitioners following established collection procedures. The distinction matters across jurisdictions: in India, the Bharatiya Sakshya Adhiniyam 2023 governs the admissibility of electronic records; in the EU and US, digital evidence standards vary by proceeding type.

Remediation, verification, and closing the audit loop

Finding a vulnerability is not the end of the process. The audit function must track remediation of each finding to closure. A structured remediation workflow has four steps: triage (prioritise findings by severity and business impact), assign (allocate each finding to an owner with a target date), fix (implement the remediation), and verify (confirm the fix works, either through a targeted re-test or, for lower-severity findings, through a configuration review or log analysis).

Remediation verification is not optional for critical and high findings. An organisation that marks a critical finding as remediated without a re-test has an unverified assumption in its risk register. For PCI-DSS, Requirement 11.4.4 mandates re-testing after remediation. For ISO/IEC 27001, the internal audit programme should schedule a follow-up review. For general good practice, a re-test by the same vendor is efficient because they already understand the vulnerability; however, a re-test by a different party provides additional assurance that the fix is complete.

Some findings cannot be remediated immediately, typically because they require architectural changes, vendor patches that are not yet available, or budget approval cycles. These findings must be accepted as residual risk through a formal risk acceptance process, documented in the risk register with the accepting owner's name and review date, and not simply marked as closed. An auditor who finds unacknowledged open findings from a prior penetration test, findings that were neither remediated nor formally accepted, has discovered a control failure in the risk management process itself, separate from the technical vulnerability.

Worked example

Scoping and consuming a penetration test for a PCI-DSS assessment

Following a grey-box penetration test from commissioning through findings triage to audit closure for an e-commerce organisation in scope for PCI-DSS.

An e-commerce company processes card payments through a hosted payment page. The cardholder data environment (CDE) consists of three web servers, two application servers, a payment gateway integration, and the network segment connecting them. The internal audit team commissions an annual penetration test to meet PCI-DSS Requirement 11.4. Here is the sequence from commissioning to audit closure.

Scope definition. The audit team lists the in-scope assets: the three web servers by IP address and hostname, the two application servers, the payment gateway integration endpoint, and the network segment. They explicitly exclude the corporate office network (not part of the CDE) and the cloud infrastructure management console (not in scope for this test; covered by a separate cloud security review). The test type is grey-box: the tester receives the IP ranges and a standard user account but no admin credentials or architecture diagrams.
Rules of engagement signed. The RoE is signed by the CTO (asset owner), specifies a two-week test window on weekday evenings to avoid peak traffic, lists the vendor's test IP addresses for firewall whitelisting, names the Head of Security as the emergency contact, and includes the escalation clause: if the tester finds evidence of a third-party compromise, they stop, document, and call the Head of Security immediately.
Test conducted. The vendor conducts the test and delivers a report. The report contains: two Critical findings (SQL injection on the checkout endpoint allowing database access; unpatched Apache with a known remote-code-execution CVE), three High findings (weak TLS configuration, missing HTTP security headers, verbose error messages disclosing server version), and four Medium or Low findings.
Audit mapping. The internal auditor maps each finding to the PCI-DSS requirement. The SQL injection finding maps to Requirement 6.2.4 (software development practices) and Requirement 6.3.1 (security vulnerabilities). The unpatched Apache maps to Requirement 6.3.3 (all system components protected from known vulnerabilities). The auditor also notes that the SQL injection finding may constitute a personal data breach under GDPR because customer order data is accessible; this is escalated to the Data Protection Officer the same day.
Remediation tracking. The two Critical findings are assigned to the development team and infrastructure team respectively, with a seven-day remediation target. The three High findings are assigned a fourteen-day target. All findings are entered into the risk register with owner, due date, and current status. The four Medium and Low findings are scheduled for the next sprint cycle with formal risk acceptance for any not resolved within thirty days.
Re-test and closure. After seventeen days, the vendor conducts a targeted re-test of the two Critical and three High findings. All five are confirmed resolved. The internal auditor updates the risk register, marks the findings as closed with re-test evidence attached, and includes the penetration test report and re-test confirmation in the PCI-DSS evidence package for the Qualified Security Assessor review.

Check your understanding

Question 1 of 4· 0 answered

An organisation's Statement of Applicability records ISO 27001 Annex A control 8.5 (secure authentication) as fully implemented. A penetration test then finds that the web application accepts passwords of four characters with no complexity requirement. What is the correct audit conclusion?

Key Takeaways

Penetration testing and security auditing are complementary, not interchangeable: the audit uses penetration test findings as technical evidence that controls are effective or not, but a clean penetration test report does not equal compliance with any regime.
Every penetration test must be preceded by a signed rules-of-engagement document that specifies scope, permitted techniques, excluded systems, test window, emergency contacts, and escalation conditions; without it, the test may not be lawful under applicable computer access statutes.
Auditors must map each finding to the relevant control framework requirement (ISO 27001 Annex A, PCI-DSS requirement number, NIST CSF subcategory) to convert raw technical findings into governance evidence; a finding that contradicts the Statement of Applicability or risk register is itself a compliance gap.
When a tester discovers evidence of an active third-party compromise, the penetration test stops immediately on the affected system: the tester documents and notifies, and the incident-response function takes over; penetration testers are not forensic evidence collectors and their work does not produce forensic-grade evidence for legal proceedings.
Critical and High findings must be verified closed through a targeted re-test, not through developer self-attestation; findings that cannot be remediated promptly must be formally accepted as residual risk in the risk register with a named owner and review date.

What is the difference between a penetration test and a security audit?

A penetration test is a controlled attack that discovers exploitable weaknesses by simulating adversary techniques. A security audit evaluates whether controls exist and operate as designed, using evidence from multiple sources including penetration test reports. The two activities are complementary: pen test findings provide technical evidence that auditors use to assess control effectiveness, but neither replaces the other.

What should a rules-of-engagement document contain?

Rules of engagement define the authorised scope (IP ranges, domains, application URLs), permitted techniques (network, web application, social engineering), excluded systems, test window, communication protocols, emergency stop criteria, and legal authorisation from the asset owner. Without this document, a penetration test carries legal risk for both the tester and the commissioning organisation.

How should an auditor read a penetration test report?

Start with the executive summary to understand scope and overall risk posture, then move to the findings list sorted by severity. For each finding, examine the evidence (screenshots, request/response captures), the root cause, and the recommended remediation. Map each finding to the relevant control in the applicable framework (ISO 27001 Annex A, NIST CSF, PCI-DSS requirement) before assessing whether the control is absent, ineffective, or bypassed.

What is the difference between a black-box and a white-box penetration test?

In a black-box test, the tester receives no prior knowledge of the target environment and works from the same position as an external attacker. In a white-box test, the tester receives full documentation: architecture diagrams, source code, credentials, and network maps. White-box testing is more thorough and cost-efficient for finding implementation flaws; black-box testing better simulates external threat scenarios. Grey-box tests provide partial information, which is the most common commercial arrangement.

When does a penetration test finding become a forensic or incident-response matter?

A penetration test finding becomes an incident-response matter if the tester discovers evidence of a prior or ongoing compromise that is outside the test scope. The rules of engagement should define the reporting obligation: the tester halts the test, notifies the designated contact, and preserves evidence. The incident-response team then takes over. Auditors should verify that the rules of engagement include this escalation path before approving a test.

Test yourself on Information Security Audit and Compliance with free, timed mocks.

Practice Information Security Audit and Compliance questions

Found this useful? Pass it along.

Spotted an error in this page? Report a correction or read our editorial standards.

Key Takeaways

Your journey to becoming a forensic professional starts here.