1-2
Sun Netra X4450 Server Service Manual • August 2008
1.2
Server Diagnostics Overview
There are a variety of diagnostic tools, commands, and indicators you can use to
monitor and troubleshoot a server:
■
Status indicators
– These indicators provide a quick visual notification of the
status of the server and of some of the FRUs.
■
Fault management architecture
– FMA provides simplified fault diagnostics
through use of the
/var/adm/messages
file, the
fmdump
command, and a Sun
Microsystems web site.
■
ILOM firmware
– This system firmware runs on the service processor. In
addition to providing the interface between the hardware and OS, ILOM also
tracks and reports the health of key server components. ILOM works closely with
POST and Solaris™ Predictive Self Healing technology to keep the system up and
running even when there is a faulty component.
■
Power-on self-test (POST)
– POST performs diagnostics on system components
upon system reset to ensure the integrity of those components. POST is
configurable and works with ILOM to take faulty components offline if needed.
■
Solaris Predictive Self-Healing (PSH)
– This technology continuously monitors
the health of the CPU and memory, and works with ILOM to take a faulty
component offline if needed. The Predictive Self Healing technology enables
Sun™ systems to accurately predict component failures and mitigate many
serious problems before they occur.
■
Log files and console messages
– These provide the standard Solaris Operating
System (OS) log files and investigative commands that can be accessed and
displayed on the device of your choice.
The LEDs, ILOM, Solaris PSH, and many of the log files and console messages are
integrated. For example, a fault detected by the Solaris software will display the
fault, log it, pass information to ILOM where it is logged, and, depending on the
fault, might light one or more LEDs.