Customer Messaging Policy
•
Only light a diagnostic LED for memory DIMM errors when isolation is to a specific memory
DIMM. If any uncertainty about a specific DIMM, then point customer to the SEL for any
action and do not light the suspect DIMM CRU LED on the System Insight Display.
•
For configuration style errors, for example, no DIMMs installed in 0A and 0B, follow the
HP ProLiant policy of lighting all of the CRU LEDs on the diagnostic LED panel for all of
the DIMMs that are missing.
•
No diagnostic messages are reported for single-byte errors that are corrected in both zx2
caches and DIMMs during corrected platform error (CPE) events. Diagnostic messages are
reported for CPE events when thresholds are exceeded for both single-byte and double byte
errors; all fatal memory subsystem errors cause global MCA events.
•
PDT logs for all double byte errors are permanent; single byte errors are initially logged as
transient errors. If the server logs 2 single byte errors within 24 hours, then upgrade them
to permanent in the PDT.
Table 5-17 Memory Subsystem Events That Light System Insight Display LEDs
Notes
Source
Cause
Sample IPMI Events
Diagnostic
LED(s)
Light all DIMM
LEDs in rank 0 of
cell 0
SFW
No DIMMs installed (in
rank 0 of cell 0)
Type E0h, 208d:04d
MEM_NO_DIMMS_INSTALLED
DIMMs
Either EEPROM is
misprogrammed
SFW
A DIMM has a serial
presence detect (SPD)
Type E0h, 172d:04d
MEM_DIMM_SPD_CHECKSUM
DIMMs
or this DIMM is
incompatible
EEPROM with a bad
checksum
Memory rank is
about to fail or
WIN
Agent
This memory rank is
correcting too many
single-bit errors
Type E0h, 4652d:26d
WIN_AGT_PREDICT_MEM_FAIL
DIMMs
environmental
conditions are
causing more
errors than usual
Table 5-18 Memory Subsystem Events That May Light System Insight Display LEDs
Notes
Source
Cause
Sample IPMI Events
Diagnostic
LED(s)
The failing DIMM
rank is
deallocated
SFW
Detected that an SDRAM
is failing on the DIMM
Type E0h, 4000d:26d
MEM_CHIPSPARE_DEALLOC_RANK
DIMMs
SFW
DIMM type is not
compatible with current
DIMMs for this platform
Type E0h, 174d:26d
MEM_DIMM_TYPE_INCOMPATIBLE
DIMMs
SFW
Detected a fatal error in
DIMM serial presence
detect (SPD)
Type E0h, 173d:26d
MEM_DIMM_SPD_FATAL
DIMMs
Troubleshooting rx2660 SBA
The rx2660 server shares a common I/O backplane that supports a total of three PCIe/PCI-X slots.
The System Bus Adapter (SBA) logic within the zx2 chip of a rx2660 server uses 16 rope interfaces
to support up to eight Lower Bus Adapter (LBA) chips. Each LBA chip interfaces with the SBA
in the zx2 chip through one or multiple rope interfaces, as follows.
CPU/Memory/SBA
129