Processor Installation Order
For a minimally loaded server blade, one processor module must be installed in processor slot 0.
See
Figure 17 (page 34)
for the processor slots. Install a processor of the same version into processor
slot 1, then 2, and then 3 (if purchased).
Processor Module Behaviors
All physical processors become functional after server power is applied. Each processor is in a
race to fetch their instructions from their processor instruction and data caches to complete early
self test and rendezvous.
It is the processor cache controller logic that issues cache line fetches from PDH/physical shared
memory, when a requested cache line is not within its instruction or data cache. Cache line fetches
are transferred over the McKinley bus, between processors and PDH/physical shared memory.
Local machine check abort (MCA) events cause one IPF processor module to fail, while the other
IPF processor module continues operating. Double-bit data cache errors in any physical processor
core causes a Global MCA event, that causes all IPF processor modules to fail and reboot the
operating system.
Customer Messaging Policy
No diagnostic messages are reported for single-bit errors, that are corrected in both instruction
and data caches, during corrected machine check (CMC) events to any physical processor core.
Diagnostic messages are reported for CMC events, when thresholds are exceeded for single-bit
errors; fatal processor errors cause global/local MCA events.
Troubleshooting Memory
The memory controller logic in the zx2 chip supports two physical ranks, that hold four DIMMs
each.
DIMMs installed in groups of four are known as a quads, and must be the same size and
configuration.
DIMM Installation Order
For a minimally loaded server, four equal-size memory DIMMs must be installed into rank 0 slots
0A through 0D. The next four DIMMs are loaded into slots 1A through 1D, then 2A through 2D,
then 3A through 3D, then 4A through 4D, then 5A through 5D.
Memory Subsystem Behaviors
All server blades with zx2 chips provide error detection and correction of all memory DIMM
single-bit errors, and error detection of most multibit errors within a 128 byte cache line.
The zx1 chip provides memory DIMM error correction for up to 4 bytes of a 128 byte cache line,
during cache line misses initiated by processor cache controllers, and by Direct Memory Access
(DMA) operations, initiated by I/O devices. This feature is called chip sparing, as 1 of 72 total
DRAMs in any memory pair can fail without any loss of server blade performance.
Customer Messaging Policy
PDT logs for all double bit errors are permanent; single bit errors are initially logged as transient
errors. If the server logs two single bit errors within 24 hours, then it upgrades them to permanent
status in the PDT.
Troubleshooting System Bus Adapter
Each server blade’s system bus adapter (SBA) supports core I/O, SAS, LAN, and FibreChannel
functions. The System Bus Adapter (SBA) logic within the zx2 chip of a server blade uses 6 of 8
106 Troubleshooting