This soft copy for use by IBM employees only.
The format of this file is considerably different between the SP Switch
and the High Performance Switch. This in part reflects the difference
between the two switches in the way they detect and handle switch
faults.
dtbx.trace
This provides traces from switch diagnostics.
dtbx_failed.trace
This is created if any of the switch diagnostics fail. It is basically the
same as the dtbx.trace file, with the addition of error messages. If the
diagnostics run clean, this file will
not be created.
daemon.stderr
General error messages. For example, if some nodes could not get
initialized when the
Estart
was issued, you may see the following:
Processing Estart. The following node(s) could not be initialized:
sp21n03
In this case the node in question (sp21n03) was not initialized because its
Worm daemon was not running.
daemon.stdout
This log is a detailed account of the switch initialization process. Most
switch problems do not require analysis. However, in some
circumstances it may prove useful to diagnose the problem.
There are also many normal, informational switch messages that you will
see in the error report. For instance, when you issue an
Estart
, you will
see a switch fault in the error report.
Expect one of each of these messages for each
Estart
command:
ERROR_ID TIMESTAMP T CL Res Name
ERROR_Description
34FFBE83 0502140496 T H
Worm
HPS Fault - detected by switch chip
C3189234 0502135796 T H
Worm
HPS Fault - not isolated
In addition to these log files, there is always a possibility that errors will be
recorded in the error report (
errpt). It is important to note the time of the failure
in order to correlate messages in the error report
errpt to the error.
For many problems, looking in the log files does not provide sufficient
information to solve the problem.
In the directory /usr/lpp/ssp/css, there are a few helpful tools, such as:
css_dump
This will format trace entries relating to the cssdd. To run the command,
issue
css_dump > /tmp/css_dump.out &
. You will find the most recent
entries at the bottom of the output file. This information is helpful in
conditions where the
css driver code hangs for unknown reasons. This
command should be run on the primary node and on any of the failing
nodes.
Chapter 4. The Switch
115
Summary of Contents for RS/6000 SP
Page 2: ......
Page 14: ...This soft copy for use by IBM employees only xii SP PD Guide...
Page 16: ...This soft copy for use by IBM employees only xiv SP PD Guide...
Page 106: ...This soft copy for use by IBM employees only 86 SP PD Guide...
Page 178: ...This soft copy for use by IBM employees only 158 SP PD Guide...
Page 214: ...This soft copy for use by IBM employees only 194 SP PD Guide...
Page 248: ...This soft copy for use by IBM employees only 228 SP PD Guide...
Page 290: ...This soft copy for use by IBM employees only 270 SP PD Guide...
Page 292: ...This soft copy for use by IBM employees only 272 SP PD Guide...
Page 300: ...This soft copy for use by IBM employees only 280 SP PD Guide...
Page 304: ...This soft copy for use by IBM employees only 284 SP PD Guide...
Page 308: ...This soft copy for use by IBM employees only 288 SP PD Guide...
Page 310: ...This soft copy for use by IBM employees only 290 SP PD Guide...
Page 316: ...IBML This soft copy for use by IBM employees only Printed in U S A SG24 4778 00...