• When RAID 6 (ADG) is used, two drives can fail simultaneously (and be replaced simultaneously)
without data loss.
• If the of
fl
ine drive is a spare, the degraded drive can be replaced.
•
Do not remove a failed second hard drive from an array until the
fi
rst failed or missing hard drive has
been replaced and the rebuild process is complete. (When the rebuild is complete, the online LED
on the front of the hard drive stops blinking.)
Exceptions:
• In RAID 6 (ADG) con
fi
gurations, any two drives in the array can be replaced simultaneously.
• In RAID1+0 con
fi
gurations, any drives that are not mirrored to other removed or failed drives can be
simultaneously replaced of
fl
ine without data loss.
Automatic data recovery (rebuild)
When you replace a hard drive in an array, the controller uses the fault-tolerance information on the
remaining drives in the array to reconstruct the missing data (the data that was originally on the replaced
drive) and write it to the replacement drive. This process is called automatic data recovery, or rebuild. If
fault tolerance is compromised, this data cannot be reconstructed and is likely to be permanently lost.
If another hard drive in the array fails while fault tolerance is unavailable during rebuild, a fatal system
error may occur, and all data on the array is then lost. In exceptional cases, however, failure of another
drive need not lead to a fatal system error. These exceptions include:
•
Failure after activation of a spare drive.
•
Failure of a drive that is not mirrored to any other failed drives (in a RAID 1+0 con
fi
guration).
•
Failure of a second drive in a RAID ADG con
fi
guration.
Time required for a rebuild
The time required for a rebuild varies considerably, depending on several factors:
•
The priority that the rebuild is given over normal I/O operations (you can change the priority setting
through the Array Con
fi
guration Utility (ACU) or MSA Command Line Interface (MSA-CLI).
•
The amount of I/O activity during the rebuild operation
•
The rotational speed of the hard drives
•
The availability of drive cache
•
The brand, model, and age of the drives
•
The amount of unused capacity on the drives
•
The number of drives in the array (for RAID 5 and RAID ADG)
Allow approximately 15 minutes per gigabyte for the rebuild process to be completed. This
fi
gure is
conservative, and newer drive models usually require less time to rebuild.
System performance is affected during the rebuild, and the system is unprotected against further drive
failure until the rebuild has
fi
nished. Therefore, replace drives during periods of low activity when possible.
CAUTION:
If the Online LED of the replacement drive stops blinking and the amber Fault LED glows, or if other drive
LEDs in the array go out, the replacement drive has failed and is producing unrecoverable disk errors.
Remove and replace the failed replacement drive.
When automatic data recovery has
fi
nished, the Online LED of the replacement drive stops blinking
and begins to glow steadily.
If ADR process aborts, restart the storage system and allow ADR to begin again. If ADR fails again,
back up all data on the system, do a surface analysis (using your diagnostics utility), and restore the
data from backup.
100
Hard drive failures and faulted LUNs
Summary of Contents for AD510A - StorageWorks Modular Smart Array 1500 cs 2U Fibre Channel SAN Attach Controller Shelf Hard Drive
Page 8: ...8 ...
Page 12: ...12 About this guide ...
Page 18: ...18 Specifications ...
Page 60: ...60 LCD panel and message descriptions ...
Page 96: ...96 Capacity expansion and extension ...
Page 102: ...102 Hard drive failures and faulted LUNs ...
Page 108: ...108 SCSI hard drive firmware ...