T The Triage ManualTechnical Guides for IT Emergencies
v1.0 · 12 domains · 173 entries · Updated 2026-05-30

Diagnostic playbooks for IT engineers, written like an engineer briefing a colleague at 2am.

Every entry is sourced from real incidents, grounded with verbatim source spans, and reviewed by a senior engineer before it gets published. No filler, no AI fluff, no untraceable claims.

Domains

22 entries

Active Directory

Domain controller recovery, FSMO seizure, Kerberos and secure-channel failures, replication faults, GPO, SYSVOL/DFSR — the backbone of Windows identity infrast…

7 entries

Exchange & Mail Flow

Exchange Online and on-premises mail flow failures, mailbox database recovery, federation trust breaks, hybrid mail routing, and outbound delivery disruptions.

7 entries

Microsoft 365 & Collaboration

Entra ID / Conditional Access lockouts, Azure AD Connect sync failures, Teams connectivity, Intune enrollment, and Microsoft 365 Backup restore operations.

8 entries

Virtualisation & Storage

Hyper-V host crashes and VM recovery, VMware ESXi PSOD and host disconnection, RAID array degradation, and storage subsystem performance collapse.

21 entries

Windows Server

Windows Server boot failures, OS performance degradation, NTFS permissions, RDS CAL exhaustion, licensing and activation, cumulative update failures, in-place…

15 entries

Network Infrastructure

Switching, VLAN misconfiguration, STP storms, DNS failures, Wi-Fi client drops, router/gateway loss, WAN circuit outages, and firewall policy issues.

11 entries

Remote Access & VPN

Site-to-site and remote-access VPN failures — IPSec SA negotiation, split tunnelling, routing conflicts, overlapping subnets, OpenVPN throughput, and Cisco ASA…

15 entries

Cyber Incident Response

Ransomware containment, breach triage, BEC, credential compromise, forensic preservation, and nation-state TTPs — the first hours decide outcomes and notificat…

7 entries

Backup & Recovery

Veeam / Datto / BCDR appliance failures, restore verification, bare-metal recovery, item-level restores, and backup job certificate mismatches.

38 entries

Endpoint & Device Management

Intune and MECM policy failures, Windows Update and WSUS patch deployment, Autopilot provisioning, BitLocker/encryption issues, and device compliance remediati…

9 entries

Cloud & Hybrid Infrastructure

Azure IaaS VM failures, Site Recovery health, hybrid VPN Gateway and ExpressRoute outages, Azure Backup, AWS EC2 connectivity, and Kubernetes control-plane cer…

13 entries

PKI & Certificate Management

Expired TLS/SSL certificates, ADCS enrollment failures, certificate chain trust, OCSP/CRL unreachability, NDES/SCEP for mobile, code-signing blocks, and Let's…

Severity legend