
Recognizing and recovering from hard drive failures and
faulted LUNs
The purpose of fault-tolerant array con
fi
gurations is to protect against data loss due to hard drive failure.
Each RAID con
fi
guration has inherent limitations on the number of hard drive failures that it can tolerate.
If the fault-tolerance level of a particular LUN or array con
fi
guration is exceeded, the array will be locked
from any further I/O. This protection is designed to preserve the integrity of the local drive, but does
require manual intervention to recover or re-enable the LUN.
Although controller
fi
rmware is designed to protect against normal hard drive failure, it is imperative that
you perform the correct actions to recover from a hard drive failure without inadvertently introducing any
additional hard drive failures.
Included sections:
• Recognizing hard drive failure
• Compromised fault tolerance
• Recovering from compromised fault tolerance (enabling failed LUNs)
• Automatic data recovery (rebuild)
• Replacing a hard drive
Recognizing hard drive failure
LEDs on the front of each hard drive are visible from the front of the external storage unit. When a hard
drive is con
fi
gured as a part of an array and attached to a powered-on controller, the status of the hard
drive can be determined from the illumination pattern of these LEDs.
For detailed descriptions of the various LED combinations, see
Hard drive LEDs
.
Other ways to determine that a hard drive has failed include the following:
•
LEDs on the storage system chassis illuminate amber if failed hard drives are inside. (However, this
LED also illuminates when other problems occur, such as when a fan or a redundant power supply
fails, or when the system overheats.)
•
LEDs on the hard drives illuminate amber if a hard drive has failed or is a member of a faulted LUN.
•
Front-panel LCD display messages list faulted LUNs and failed hard drives whenever the system is
restarted, as long as the controller detects one or more good hard drives.
•
The ACU and SMU represent faulted LUNs and failed drives with distinctive icons.
•
HP-SIM can detect failed hard drives.
•
ADU lists all failed hard drives.
For more information on troubleshooting hard drive problems, see the
HP ProLiant servers troubleshooting
guide
.
Effects of hard drive failure
When a hard drive fails, all logical drives that are in the same array are affected. Each logical drive in an
array may be using a different fault-tolerance method, so each logical drive can be affected differently.
•
RAID 0 con
fi
gurations cannot tolerate hard drive failure. If any physical hard drive in the array
fails, all non-fault-tolerant (RAID 0) LUNs in the same array also are failed.
•
RAID 1 and RAID 1+0 con
fi
gurations can tolerate multiple hard drive failures, as long as none of
the failed hard drives are mirrored to one another.
•
RAID 5 con
fi
gurations can tolerate one hard drive failure.
•
RAID 6 con
fi
gurations can tolerate simultaneous failure of two hard drives in the array.
1510i Modular Smart Array installation and user guide
93
Содержание StorageWorks 1510i - Modular Smart Array
Страница 8: ...8 ...
Страница 58: ...58 Installation ...
Страница 76: ...76 Configuration ...
Страница 104: ...104 Operation and management ...
Страница 140: ...140 Regulatory compliance and safety ...
Страница 152: ...152 MSA1510i worksheets ...