T The Triage ManualTechnical Guides for IT Emergencies
P1 · Cloud & Hybrid Infrastructure

Azure VM Unresponsive or Failed to Start — Allocation or Boot Failure

An Azure virtual machine is unreachable via RDP/SSH, stuck in a failed or deallocated state, or refusing to boot due to OS-level errors. Resolution path depends on whether the failure is an Azure allocation constraint, OS corruption, or network/NSG misconfiguration.

Indicators

Likely causes

Diagnostic steps

  1. Azure portal > VM > Activity log — identify exact error code, timestamp and initiating principal
  2. Enable and check Boot Diagnostics: VM > Diagnostics > Boot diagnostics > view screenshot and serial console output
  3. If allocation failure: fully stop/deallocate the VM (not just restart), then start again — forces reallocation to an available host
  4. If OS boot failure: use Azure VM Repair: az vm repair create -g <rg> -n <vm> --repair-username admin — attaches OS disk to a recovery VM for offline repair
  5. Verify NSG inbound rules allow port 3389 (RDP) or 22 (SSH) on both the VM NIC and subnet NSG; check Effective Security Rules
  6. Check Azure Service Health for region-specific incidents: portal.azure.com > Service Health > Health alerts

Resolution path

Prevention

Tools

azureazure-vmiaasallocationboot-failurecloudrdp