•
RAID 6 con
fi
gurations can tolerate simultaneous failure of two hard drives in the array.
Compromised fault tolerance
Each RAID con
fi
guration has inherent limitations on the number of physical hard drive failures that it can
tolerate. If more hard drives fail than the fault-tolerance method allows, fault tolerance is compromised.
When the MSA determines that the fault tolerance of a LUN is compromised, the LUN is taken of
fl
ine
and subsequent I/O requests are rejected. This is designed to protect the integrity of the LUN, but does
require manual intervention to recover or re-enable the LUN. You are likely to lose data, although it
can sometimes be recovered.
Common causes of compromised fault tolerance include:
•
More hard drives fail than the LUN can tolerate.
For example, in a RAID 5 array, if a hard drive in an array fails while another drive in the array is
being rebuilt. If the array has no online spare, any logical drives in this array that are con
fi
gured with
RAID 5 fault tolerance will fail.
•
A SCSI cable could be broken or disconnected.
•
A temporary loss of power.
For example, if both power supplies are inappropriately connected to the same power source and that
power source it interrupted, fault tolerance may be compromised.
Recovering from compromised fault tolerance (enabling failed
LUNs)
If fault tolerance is compromised, inserting replacement hard drives does not improve the condition of the
logical unit. The procedure to re-enable or accept a LUN that is unresponsive is performed in the Array
Con
fi
guration Utility (ACU) or the MSA Command Line Interface (MSA-CLI).
1.
Stop all I/O activity.
2.
Turn off the system as described in
Removing power from the MSA
.
3.
Check for loose, dirty, broken, or bent cabling and connectors on all devices.
4.
Remove and then reinsert all hard drives and controllers.
CAUTION:
Data can be lost if the hard drives are not
fi
rmly reseated.
5.
Turn the system on as described in
Applying power to the MSA
.
NOTE:
In some cases, a marginal hard drive might work again for long enough to allow you
to make copies of important
fi
les.
6.
If using the MSA LCD panel:
a.
If one of the following messages are displayed on the MSA array controller LCD front panel, an
issue was found with one or more con
fi
gured LUNs that may result in data loss, so all of the
hard drives in the LUNs have been disabled. Press the right push button to re-enable the LUNs.
02 ENABLE VOLUME <n>?
'<'=NO, '>'=YES
04 ENABLE VOLUMES ? '<'=NO, '>'=YES
98
Hard drive failures and faulted LUNs
Summary of Contents for AD510A - StorageWorks Modular Smart Array 1500 cs 2U Fibre Channel SAN Attach Controller Shelf Hard Drive
Page 8: ...8 ...
Page 12: ...12 About this guide ...
Page 18: ...18 Specifications ...
Page 60: ...60 LCD panel and message descriptions ...
Page 96: ...96 Capacity expansion and extension ...
Page 102: ...102 Hard drive failures and faulted LUNs ...
Page 108: ...108 SCSI hard drive firmware ...