Troubleshooting
96-30065-001 Rev. A0
DataDirect Networks EF4024 FC RAID System Setup Guide | 33
5.2.2.1
Via RAIDar
RAIDar uses health icons to indicate the status of the system and its components. Possible
states are OK, Degraded, Fault, and Unknown. RAIDar enables you to monitor the health of
the system and its components. If any component has a problem, the system health will be
Degraded, Fault, or Unknown. Use RAIDar’s GUI to further identify each component that has
a problem, and follow actions in the component Health Recommendations field to resolve
the problem.
5.2.2.2
Via CLUI
As an alternative to using RAIDar, you can run the
show system
command in the CLUI to
view the health of the system and its components. If any component has a problem, the
system health will be Degraded, Fault, or Unknown. The failed components will be listed as
Unhealthy Components. Follow the recommended actions in the component Health
Recommendation field to resolve the problem.
5.2.2.3
Monitor Event Notification
With event notification configured and enabled, you can view event logs to monitor the
health of the system and its components. If a message tells you to check whether an event
has been logged, or to view information about an event in the log, you can do so using either
RAIDar or the CLUI. Using RAIDar, you would view the event log and then click on the event
message to see details about that event. Using the CLUI, you would run the
show events
detail
command (with additional parameters to filter the output) to see the details of an
event.
5.2.2.4
View the Enclosure LEDs
You can look at the LEDs on the hardware (see
Appendix A
) to identify component status.
If a problem prevents access to either RAIDar or the CLUI, this is the only option available.
However, monitoring and management are often done at a management console using
storage management interfaces, rather than relying on line-of-sight to LEDs of racked
hardware components.
5.2.3
Performing Basic Steps
You can use any of the available options described above in performing the basic steps
comprising the fault isolation methodology.
5.2.3.1
Gather Fault Information
When a fault occurs, it is important to gather as much information as possible. Doing so will
help you determine the correct action needed to remedy the fault.
Begin by reviewing the reported fault:
• Is the fault related to an internal data path or an external data path?
• Is the fault related to a hardware component such as a disk drive module, controller
module, or power supply unit?
By isolating the fault to
one
of the components within the storage system, you will be able
to determine the necessary corrective action more quickly.