Skip to content

Operating Systems, Boot Process and File Systems

UEFI to MFT to APFS snapshots, MAC timestamps, file slack and shadow copies, mapped to digital forensics and BSA 2023 Section 63 admissibility.

Last updated:

Share

An operating system manages process scheduling, memory, file systems, I/O, and access control, and exposes each of these functions as forensic evidence: prefetch entries, swap files, file-system journals, USB installation logs, and login records respectively. The boot sequence runs from UEFI firmware through Secure Boot and TPM 2.0 measurements to the bootloader and kernel, with each stage producing its own durable artifact. File systems are transactional ledgers with their own journals, timestamp models, and deletion semantics: on NTFS a deleted file's MFT record typically survives, on ext4 the inode is cleared but journal extents may remain, and on APFS deletion can be reversed through a snapshot the OS still holds. Examiners routinely recover data from hidden zones including slack space, the page file, the hibernation file, the Recycle Bin, and Volume Shadow Copies that users had no knowledge of.

An operating system is the layer that schedules processes, manages RAM and virtual memory, owns the file system, and brokers every I/O request from user-space to hardware. The forensic examiner sees the OS through its persistent artifacts: partition tables (MBR or GPT), file system structures (FAT, exFAT, NTFS, ext2/3/4, HFS+, APFS), the journal that records changes-in-flight, and the metadata each file carries (MAC timestamps, attributes, permissions). The boot process starts in firmware (BIOS or UEFI), is gated by Secure Boot and TPM 2.0 measurements on modern Windows and Linux installs, and hands off to a bootloader that loads the kernel into RAM. Every step writes evidence: UEFI variables, bootloader logs, kernel ring buffer, file-system journal entries. A trained examiner reads them in that order.

Key takeaways

  • The boot sequence runs from UEFI firmware through Secure Boot and TPM 2.0 measurements to the bootloader and kernel, with each step writing its own evidence: UEFI variables, bootloader logs, and the kernel ring buffer.
  • On NTFS a deleted file's MFT entry typically still exists, on ext4 the inode is cleared but extents may survive in the journal, and on APFS deletion may be reversible through a snapshot the OS still holds.
  • The file system is a transactional ledger with its own journal, its own time model, and its own definition of deleted, not simply a container for files.
  • Hidden zones including slack space, the page file, the hibernation file, the Recycle Bin, and Volume Shadow Copies on Windows hold data the user believes is gone and are a routine source of forensic recovery.
  • An OS has five jobs the examiner must name without hesitation: process scheduling, memory management with virtual memory and paging, file-system management, I/O management, and security and access control.

The point the working examiner sees, and the contrarian point most candidates miss, is that the file system is not just "where files live." It is a transactional ledger with its own journal, its own time model, and its own definition of "deleted." On NTFS, a deleted file's MFT entry typically still exists; on ext4, the inode is cleared but extents may remain in the journal; on APFS, the deletion may be reversible through a snapshot the OS still has. The hidden zones (slack space, page file, hibernation file, $Recycle.Bin, Volume Shadow Copy on Windows; swap and ~/.Trash on Linux and macOS) hold data the user thinks is gone. Hardware (covered in Computer Hardware Fundamentals) only sets the upper bound. The OS decides what is actually there.

By the end of this topic you will be able to:

  • Name the five OS functions and identify the specific forensic artifact each one produces.
  • Trace the six boot stages from POST to user-space init and state which artifact each stage writes and whether it survives a reboot.
  • Explain the on-disk structure and deletion behavior of FAT32, exFAT, NTFS, ext4, HFS+, and APFS, and select the correct recovery approach for each.
  • Distinguish file slack, RAM slack, and drive slack, and apply that distinction when enumerating recoverable data surfaces on a seized drive.
  • Describe the evidentiary significance of NTFS $MFT, $LogFile, $UsnJrnl, Volume Shadow Copies, pagefile.sys, and hiberfil.sys in a timeline reconstruction.
Key terms
Kernel space vs user space
Kernel space is the privileged execution mode where the OS schedules processes, drives hardware and enforces memory protection. User space is the unprivileged mode where applications run. A user-space process must syscall into the kernel for every file or network operation.
MBR vs GPT
Master Boot Record is the legacy partition table at LBA 0, limited to 4 primary partitions and 2 TiB. GUID Partition Table is the UEFI-era replacement, supporting around 128 partitions, GUID identifiers, a backup at the end of the disk, and CRC checksums.
Journal (file-system)
A circular log of file-system changes that allows recovery after a crash. NTFS keeps $LogFile, ext3/ext4 keep a dedicated journal block, APFS uses copy-on-write rather than a classical journal.
MAC timestamps
Modified, Accessed, and Changed or Created times. Different file systems use the letters differently: on POSIX it is mtime, atime, ctime (change time); on NTFS it is Modified, Accessed, Created, and MFT-Modified.
File slack
The unused space at the tail of the last cluster of a file. It can contain remnants of whatever previously occupied that cluster, including parts of older deleted files. Examined directly with Autopsy or EnCase.
Volume Shadow Copy
Windows' snapshot mechanism (VSS). Creates point-in-time copies of files and folders. Examiners find shadow copies particularly useful because users rarely know the mechanism exists and Windows often preserves data the user believed was overwritten.

What the OS does, and the OS taxonomy an examiner is expected to know

The OS has five jobs an examiner is expected to name without hesitation: process scheduling, memory management (including virtual memory and paging), file-system management, I/O management, and security or access control. Each job produces forensic artifacts. Process scheduling writes prefetch entries and event-log records (covered later under Windows Forensic Artifacts). Memory management writes a page file or swap file that the examiner can image. File-system management writes the journal entries the lab will replay. I/O management writes device installation logs (USB sticks plugged in, printers attached). Security or access control writes login records.

The kernel-versus-user-space distinction matters because anti-forensic and rootkit techniques live or die on whether they can run kernel-mode code. A kernel rootkit can hide files from a user-mode tool like Windows Explorer; it cannot hide them from a raw disk image read offline. This is the central justification for the dead-disk imaging workflow in Indian forensic SOPs.

The five OS families a digital forensic examiner sees in Indian field practice:

  • Windows. Dominant on government and corporate desktops, the bulk of laptop fleet. Versions 10 and 11 are the working population in 2026; Windows 7 and XP still appear on legacy industrial control kit. NTFS is the default file system.
  • Linux. Server-room dominant; rising on developer laptops. Ubuntu and RHEL derivatives are common; ext4 is the default file system. Indian government cloud (MeghRaj) infrastructure is heavily Linux.
  • macOS. Workstation and creative-shop population, smaller in India than in the West but growing in tier-1 cities. APFS replaced HFS+ as the default for SSDs in 2017 with macOS High Sierra; HDDs and Fusion Drives continued to use HFS+ until later releases.
  • Android. The mobile bulk. Linux kernel, custom user space, ext4 or F2FS on the data partition. Covered in detail in Module 6.
  • iOS. Apple's mobile OS, BSD-derived kernel, APFS file system. Acquisition is its own specialty.

The boot process: POST to kernel

The boot process runs from power-on to a usable OS across six distinct stages. Each stage writes its own evidence, and knowing the stages in order allows the examiner to identify the correct artifact for a given timeline question.

PC boot sequence from firmware to user space. Four labelled stages with the key artifact each produces and whether that artif
PC boot sequence from firmware to user space. Four labelled stages with the key artifact each produces and whether that artifact survives a reboot. UEFI boot log and registry hives survive; RAM contents and kernel ring buffer do not.
  1. POST and firmware init
    Power-On Self Test inside BIOS or UEFI. Inventories DRAM, runs CPU microcode, enumerates PCIe and USB. UEFI writes a boot log to NVRAM that survives power-off; older BIOS does not.
  2. Firmware boot device selection
    UEFI reads its boot-order list from NVRAM and picks the first available device. With Secure Boot on, the firmware verifies the bootloader signature against keys in the UEFI key store before handover.
  3. Partition table read
    Firmware reads LBA 0. If the disk is MBR, the firmware executes the 446-byte boot code in the MBR. If the disk is GPT, the firmware reads the EFI System Partition directly and looks for a signed EFI binary listed in NVRAM. The partition table itself is forensic evidence.
  4. Bootloader execution
    On Windows, bootmgr.efi reads the BCD (Boot Configuration Data) hive and loads winload.efi. On Linux, GRUB or systemd-boot reads its config (grub.cfg or loader.conf) and loads the chosen vmlinuz and initramfs. On macOS, boot.efi handles selection and unlock. Each bootloader writes its own log.
  5. Kernel load
    The kernel image is loaded into RAM, decompressed if needed, the early console comes up, and the kernel mounts the root file system. The Linux ring buffer (dmesg) starts collecting messages here. Windows starts populating the System event log.
  6. Userspace init and login
    Windows starts the Session Manager (smss.exe), then csrss.exe and winlogon.exe. Linux starts PID 1 (systemd on modern distros). The Windows Security event log starts recording 4624 logon events. From this point the running OS owns the disk.

Secure Boot is the firmware-level signature check on the bootloader. It is on by default on Indian-market Windows 11 OEM installs because Microsoft mandates it. TPM 2.0 is a discrete or firmware-resident security chip that holds keys and measurements (PCRs, Platform Configuration Registers) that record the hash of every boot component. BitLocker on Windows binds the volume master key to specific PCR values, which is why a tampered bootloader breaks BitLocker auto-unlock and why TPM 2.0 status is a recordable artifact at seizure.

The 2024 CERT-In advisory on a UEFI vulnerability in a major OEM's firmware (representative of a recurring class of issues) is the kind of artifact NFSU SoCS lectures use to make the point that the boot chain itself is a forensic surface, not just the OS that sits on top of it. A compromised UEFI can survive a full disk wipe and reinstall.

MBR vs GPT, and what each tells the examiner

When an imaged disk reaches the forensic workstation, the first byte range read is the partition table. On an MBR disk, that is LBA 0: 446 bytes of boot code, then a 64-byte partition table (4 entries of 16 bytes each), then the 0x55AA signature. On a GPT disk, LBA 0 holds a protective MBR (a single dummy entry covering the disk) for backwards compatibility, then LBA 1 holds the GPT header, and a backup GPT sits at the end of the disk.

PropertyMBRGPT
Year established1983 (IBM PC DOS 2.0)2006 (UEFI 2.0)
Disk size limit2 TiBAround 9.4 ZB
Primary partitions4 (more via extended)Around 128 by default
Partition identifier1-byte type code16-byte GUID
Header integrityNo checksumCRC32 on header and entries
Backup tableNoneMandatory backup at end of disk
Firmware pairingLegacy BIOS, also bootable on UEFI in CSM modeUEFI (UEFI 2.x required for boot)
Forensic recoverability of damaged tableDifficult, single copyEasier, backup at end of disk lets the examiner reconstruct

For the examiner, GPT is the friendlier table because the backup at the end of the disk lets a damaged primary be reconstructed. MBR's single copy means a deliberate or accidental overwrite of LBA 0 can wipe the partition layout entirely, and the examiner has to fall back on file-carving (covered in Data Recovery and File Carving).

A common reference question: "A seized 1 TB SSD shows no partitions in Autopsy. The hex viewer shows LBA 0 is zeroed out for the first 512 bytes. What is the immediate next step?" The expected answer is to inspect the end of the disk for a GPT backup header (the disk may have been GPT, with only the primary header wiped); if the disk was MBR, fall back to carving from signatures.

File systems in depth: FAT, exFAT, NTFS, ext, APFS

Each major file system has a distinct on-disk structure, journal mechanism, and deletion behavior. These differences determine which tools apply, what data is recoverable, and what can be presented as evidence.

File-system layout comparison across four major file systems. Each row shows the on-disk structure: boot sector or MFT locati
File-system layout comparison across four major file systems. Each row shows the on-disk structure: boot sector or MFT location, the metadata structure, and the data cluster area. The platform and journal method for each is annotated at the right.

FAT12 / FAT16 / FAT32

The original DOS file system, still used on small USB sticks, SD cards, and the EFI System Partition on every UEFI Windows install. The structure is a Boot Sector, two copies of the File Allocation Table, the Root Directory (on FAT12 / FAT16) or a regular cluster chain (on FAT32), and the data area. Deleted files are marked by replacing the first character of the directory entry with 0xE5; the rest of the entry, including the starting cluster, is preserved. This is why FAT is the textbook "easy to recover" file system: the directory entry tells the examiner where the file was, and the data clusters are often intact until reused.

exFAT

Microsoft's extension of FAT for large flash media (SDXC cards, large USB drives). Same general structure as FAT32, no journaling, supports files over 4 GiB. exFAT is the default file system for SDXC cards. From a forensic recovery standpoint it behaves like FAT.

NTFS

The Windows production file system since Windows NT 3.1. Built around a single special file, the Master File Table ($MFT), which holds one record per file on the volume. Each MFT record is 1024 bytes by default, and contains a set of typed attributes: $STANDARD_INFORMATION (timestamps, attributes), $FILE_NAME (name, parent reference, a second copy of timestamps), $DATA (the file content, either resident in the MFT record for small files or as cluster runs), $INDEX_ROOT and $INDEX_ALLOCATION (for directories, organised as B-trees). The journal lives in $LogFile. The $MFT is also self-describing: record 0 of the MFT is itself, record 5 is the root directory, record 6 is the bitmap of which records are in use, record 8 is the bad-cluster file. An examiner can reconstruct most of an NTFS volume from $MFT alone if the data clusters are intact.

ext2 / ext3 / ext4

The Linux production file systems. ext2 was the original, ext3 added journaling, ext4 added extents, the htree directory index, larger files and volumes, and more accurate timestamps. The structure is built around inodes: every file is one inode, and the inode holds the file's metadata and a list of extents (in ext4) or block pointers (in ext2/3). The journal lives in a reserved inode (typically inode 8). On deletion, the inode is cleared, but the directory entry may still hold the inode number for a window, and the journal often preserves the inode contents long enough for extundelete, ext4magic or the debugfs lsdel command to find it.

HFS+ and APFS

HFS+ was Mac's file system from 1998 until 2017. APFS replaced it across iOS, iPadOS, watchOS and macOS High Sierra and later. APFS is copy-on-write: every write to a file is to a new extent, and the metadata is updated to point at the new extent, leaving the old extent in place until garbage collection. APFS snapshots are a first-class feature: a snapshot freezes the metadata at a point in time and the snapshot can be mounted read-only later. Time Machine on modern macOS uses APFS snapshots. For the examiner this means a freshly imaged APFS volume often contains several days of past file-system state in snapshots, and a "deleted" file may be readable from a snapshot even after the user has emptied Trash.

File systemDefault onJournal / COWDeletion artifactForensic recovery surface
FAT32 / exFATUSB sticks, SD cards, EFI System PartitionNoneDirectory entry first byte 0xE5, rest preservedDirectory entry + data clusters until reuse
NTFSWindows C:$LogFile journalMFT record marked unused, $FILE_NAME and $DATA usually intact$MFT, $LogFile, $UsnJrnl, Volume Shadow Copy
ext4Linux /Dedicated journal inode 8Inode cleared, dirent may persist brieflyJournal replay (extundelete, ext4magic), block scanning
APFSmacOS, iOSCopy-on-write, no classical journalOld extents survive until garbage collection; snapshots may hold prior stateAPFS snapshots, Time Machine local snapshots, container metadata
HFS+Pre-2017 macOS, legacy iPodsOptional journalCatalog file entry marked unusedCatalog file + journal replay

File allocation, slack, and metadata

File allocation on every modern OS happens in clusters (NTFS, FAT) or blocks (ext, APFS). A cluster is a fixed group of sectors. On a 4 KiB-cluster NTFS volume, a 100-byte file occupies one cluster of 4096 bytes, of which only the first 100 are the file's contents. The remaining 3996 bytes are slack, and they hold whatever was in that cluster the last time it was used.

Slack space has three named regions the examiner must distinguish:

  • File slack. The unused space from the end of the file to the end of the last cluster. On a 4 KiB cluster volume holding a 5000-byte file, the file uses two clusters (8192 bytes); the last 3192 bytes are file slack.
  • RAM slack. The portion of file slack from the end of the file to the end of the sector. The OS pads writes to the sector boundary with bytes from RAM (historically), which means RAM slack can hold data the kernel had recently touched. On modern Windows, RAM slack is typically zeroed, but legacy images still show meaningful content.
  • Drive slack. The portion of file slack from the end of the sector to the end of the cluster. Drive slack contains whatever was previously on those sectors and is the most common location for older deleted-file fragments.

The forensic implication: a file system that reports a 100-byte file is sitting on, in the case of a 4 KiB cluster, just under 4 KiB of potentially evidentiary slack. Multiplied across a 1 TB drive with millions of files, slack is a major recoverable surface, and Autopsy, FTK and EnCase all expose slack space as a first-class data source.

Metadata: timestamps, attributes, permissions

Every file carries metadata, and the metadata is what an examiner reasons about for timeline questions.

  • MAC timestamps. On POSIX (Linux, macOS, BSD) the standard set is mtime (last modify), atime (last access), ctime (last metadata change). On NTFS the set is Modified, Accessed, Created (also called "MFT-Modified" for the fourth, internal time). APFS adds a "first added" time on top of POSIX.
  • File attributes. System, Hidden, Read-Only, Archive on NTFS and FAT (the legacy DOS attribute set). On NTFS there is also Encrypted, Compressed, and reparse-point.
  • Permissions. POSIX permissions on ext and APFS (owner / group / other read / write / execute, plus setuid / setgid / sticky). NTFS uses Access Control Lists, where each ACL entry binds a SID to an allow-or-deny mask covering read, write, execute, modify, delete, take ownership and more.
  • Extended attributes. xattrs on Linux, ADS (Alternate Data Streams) on NTFS, finder info on HFS+ and APFS. Each is a place where data can hide outside the visible file contents.

Hidden data zones: slack, swap, hibernation, Recycle Bin, shadow copies

The hidden data zones on a typical Windows workstation, in the order an Indian examiner usually checks them:

  • $Recycle.Bin. Per-user directory at the volume root. Each deleted item is a paired $I file (metadata: original path, deletion timestamp, original size) and $R file (contents). Until the user empties the Recycle Bin, the file is fully intact under a renamed path.
  • Volume Shadow Copy Service (VSS). Windows' snapshot system. By default, Windows creates shadow copies before major updates and on a schedule. Mounting a shadow copy as a virtual volume often reveals files the current state has overwritten. vssadmin list shadows enumerates them on a live system; on a dead image, Autopsy and libvshadow expose them.
  • Page file (pagefile.sys). The on-disk overflow for RAM. Holds pages the OS swapped out. The page file can contain fragments of process memory, including decrypted document content, browser sessions and chat data, that never made it to a deliberately written file. Examined with bulk-extractor and grep across the raw image.
  • Hibernation file (hiberfil.sys). A compressed image of RAM written when the machine hibernates. Bigger than the page file and contains the kernel state at the moment of hibernation. Volatility can parse hiberfil.sys as if it were a RAM dump.
  • File and drive slack. As covered in Section 5.
  • Alternate Data Streams (ADS). NTFS feature that lets a file carry a named secondary data stream (file.txt:hidden.exe). Common anti-forensic location for malware. Autopsy and EnCase enumerate ADS by default.

On Linux, the equivalent set is the swap partition (raw block device, often /dev/sda2 or a swapfile, parseable with strings and bulk-extractor), /tmp and /var/tmp (often cleared at boot, but not always), ~/.Trash or ~/.local/share/Trash, journal blocks in ext4, and snapper or timeshift snapshots on btrfs systems.

On macOS, the relevant zones are the swap files in /private/var/vm/, the ~/.Trash, the APFS snapshots accessible via tmutil listlocalsnapshots, and the Keychain (which is itself an SQLite database with a binary-format envelope).

For deep recovery from these zones, the next topic to read is Data Recovery, File Carving and Deleted / Encrypted Content, which covers the tools and the signature databases. The courtroom side of presenting recovered data is covered in Bharatiya Sakshya Adhiniyam: Forensic Evidence in Court.

Practice
Question 1 of 5· 0 answered

On a Windows 11 NTFS volume, a 100-byte text file is created on an 8 KiB-cluster volume. What is the total amount of slack space associated with this file?

Frequently asked questions

What is the difference between MBR and GPT, and which one will an examiner see in 2026?
MBR is the legacy partition table at LBA 0, limited to four primary partitions and 2 TiB. GPT is the UEFI-era replacement, with around 128 partitions, GUID identifiers, CRC checksums on its headers, and a mandatory backup table at the end of the disk. In 2026, every new Indian-market Windows 11 OEM laptop ships GPT. MBR still appears on legacy government desktops, on small USB sticks, and on installer media in legacy BIOS mode.
What does the NTFS Master File Table actually contain?
The $MFT holds one 1024-byte record per file on the volume. Each record contains a set of typed attributes: $STANDARD_INFORMATION (timestamps, file attributes), $FILE_NAME (name and parent reference), $DATA (the file's content, either resident in the record for small files or as cluster runs), and $INDEX_ROOT / $INDEX_ALLOCATION for directories. The $MFT also describes itself: record 0 is the $MFT, record 5 is the root directory. An examiner can reconstruct most of an NTFS volume from $MFT alone.
What are the three kinds of slack space and why do they matter forensically?
File slack is the unused space from the end of the file to the end of its last cluster. RAM slack is the portion from the end of the file to the end of the sector, historically padded with RAM contents. Drive slack is the portion from the end of the sector to the end of the cluster, and holds whatever was previously stored there. All three are recoverable surfaces; Autopsy, FTK and EnCase expose them as first-class data sources, and they routinely yield deleted-file fragments that the user had no idea were still readable.
Why does APFS make some forensic tasks easier and others harder than HFS+?
Easier because APFS snapshots and Time Machine local snapshots preserve point-in-time copies of the file system, so a recently 'deleted' file is often readable from a snapshot. Harder because APFS is copy-on-write and uses a complex container model with multiple volumes sharing free space, which means traditional file-carving and inode-walking approaches need APFS-aware tools. Magnet AXIOM, Cellebrite Inspector and the open-source apfs-fuse are the working set in 2026.
How does Secure Boot affect a forensic examination?
Secure Boot is the UEFI feature that verifies the bootloader signature against keys in the firmware key store before handing over control. For the examiner it has two implications. First, it constrains the live-boot forensic tools that will run on the seized machine: an unsigned forensic Linux USB will be refused at boot unless Secure Boot is disabled, which is itself a recorded change. Second, the Secure Boot keys and the recorded PCR measurements in the TPM are themselves artifacts: a system whose Secure Boot was disabled and re-enabled leaves a trail.
What is the practical difference between atime on Linux and last-access on Windows NTFS?
On Linux, atime is updated every time a file is read, unless the volume is mounted with noatime or relatime. On Windows NTFS since Vista, NTFS last-access updates are disabled by default for performance, controlled by the NtfsDisableLastAccessUpdate registry value. An examiner who treats 'last accessed' as a uniform field across the two systems will draw the wrong conclusions. Always note the OS and its mount or registry configuration before reasoning about atime.
Which Indian case is the leading authority on the admissibility of computer-generated records, and how does it apply to a file-system image?
Anvar P V v P K Basheer (2014) is the foundational Supreme Court authority. It held that electronic records require a certificate under what was Section 65B of the Evidence Act, now Section 63 of the Bharatiya Sakshya Adhiniyam 2023. A file-system image is an electronic record and the Section 63 certificate has to describe the imaging tool, the hash algorithm, the resulting digest, and the operator's identity. Arjun Panditrao Khotkar v Kailash Kushanrao Gorantyal (2020) is the follow-on that clarified when the certificate must be produced. The courtroom side is covered at /topics/crime-scene-management/bharatiya-sakshya-adhiniyam-forensic-evidence-in-court.

Test yourself on Digital Forensics with free, timed mocks.

Practice Digital Forensics questions

Found this useful? Pass it along.

Share

Spotted an error in this page? Report a correction or read our editorial standards.

Your journey to becoming a forensic professional starts here.

Practice with mock tests, learn from structured notes, and get your questions answered by a global forensic community, all in one place.