T The Triage ManualTechnical Guides for IT Emergencies
P1 · Virtualisation & Storage

RAID array degraded or failed

One or more disks have failed in a RAID set; array is degraded, rebuilding, or offline. Goal: avoid a second-disk failure during rebuild.

Indicators

Likely causes

Diagnostic steps

  1. Identify controller and array state via vendor tool (perccli, ssacli, storcli) — never reseat or replace blindly
  2. Verify backup currency BEFORE touching the array — last successful job timestamp, restore-test status
  3. Pull SMART/predictive-failure data on every member disk; refuse to rebuild onto a disk reporting reallocated sectors
  4. Reduce I/O load during rebuild — pause non-critical VMs, postpone backup jobs
  5. If array is offline due to multi-disk failure: STOP. Do not initialise, do not 'force online' without a copy / image of every member disk
  6. For unrecoverable arrays — engage data-recovery specialist with raw disk images, not the live disks

Resolution path

Prevention

Tools

References

raidstoragediskcontrollerbackup