T The Triage ManualTechnical Guides for IT Emergencies
P2 · Cloud & Hybrid Infrastructure

Azure Backup Vault — Restore Failed or RPO Breach

Azure Backup restore job fails, backup job stops completing, or the Recovery Point Objective (RPO) has been breached because no recent recovery points exist. Affects Azure VMs, SQL Server in Azure, Azure Files, and on-premises workloads protected via MARS/MABS agents.

Indicators

Likely causes

Diagnostic steps

  1. Recovery Services vault > Backup jobs — find the failed job, click it, expand 'Error details' — note the error code (e.g., UserErrorVmNotInDesirableState, ExtensionFailedInVMAccessExtension)
  2. Check VM agent status: Azure portal > VM > Settings > Extensions — verify 'MicrosoftAzureRecoveryServices' or 'VMSnapshot' extension is healthy
  3. For encrypted VM restores: verify the vault's MSI has 'Key Vault Administrator' or appropriate Get/List permissions on the Key Vault
  4. Check restore target: ensure target VM size exists in target region, storage account is in same region as vault, and no resource locks block creation
  5. For MARS agent: check agent version in the vault > Backup infrastructure > confirm minimum supported version; update if outdated
  6. Verify backup policy RPO settings: vault > Backup policies — confirm schedule and retention match SLA expectations

Resolution path

Prevention

Tools

azureazure-backupbackuprecoveryrpomarsvmclouddisaster-recovery