1-12
Sun Netra X4450 Server Service Manual • August 2008
ILOM enables you to remotely run diagnostics that would otherwise require
physical proximity to the server’s serial port. You can also configure ILOM to send
email alerts of hardware failures, hardware warnings, and other events related to the
server or to ILOM.
Faults detected by ILOM, POST, and the Solaris Predictive Self Healing (PSH)
technology are forwarded to ILOM for fault handling. In the event of a system fault,
ILOM ensures that the Fault Indicator is lit, the FRU ID PROMs are updated, the
fault is logged, and alerts are displayed (faulty FRUs are identified in fault messages
using the FRU name).
The service processor detects when a fault is no longer present and clears the fault in
several ways:
■
Fault recovery – The system automatically detects that the fault condition is no
longer present. ILOM extinguishes the Service Required Indicator and updates
the FRU’s PROM, indicating that the fault is no longer present.
■
Fault repair – The fault has been repaired by human intervention. In most cases,
the service processor detects the repair and extinguishes the Service Required
Indicator. If the service processor does not perform these actions, you must
perform these tasks manually.
The service processor also detects the removal of a FRU, in many cases even if the
FRU is removed while the service processor is powered off (that is, if the system
power cables are unplugged during service procedures). This situation enables
ILOM to know that a fault, diagnosed to a specific FRU, has been repaired.
Note –
ILOM does not automatically detect hard drive replacement.
Many environmental faults can automatically recover. A temperature that is
exceeding a threshold might return to normal limits. An unplugged power supply
can be plugged in, and so on. Recovery of environmental faults is automatically
detected. Recovery events are reported using one of two forms:
■
fru
at
location
is OK.
■
sensor
at
location
is within normal range.
Environmental faults can be repaired through hot-removal of the faulty FRU. FRU
removal is automatically detected by the environmental monitoring, and all faults
associated with the removed FRU are cleared. The message for that case, and the
alert sent for all FRU removals is:
fru
at
location
has been removed.
There is no ILOM command to manually repair an environmental fault.