DIMM Fault Handling
DIMM Fault Handling
A variety of features play a role in how the memory subsystem is configured and how memory
faults are handled. Understanding the underlying features helps you identify and repair memory
problems.
The following server features manage memory faults:
■
POST
– By default, POST runs when the server is powered on.
For CEs, POST forwards the error to the PSH daemon for error handling. If an
uncorrectable memory fault is detected, POST displays the fault with the device name of the
faulty DIMMs, and logs the fault. POST then disables the faulty DIMMs. Depending on the
memory configuration and the location of the faulty DIMM, POST disables half of physical
memory in the server, or half the physical memory and half the processor threads. When
this offlining process occurs in normal operation, you must replace the faulty DIMMs based
on the fault message and enable the disabled DIMMs with the Oracle ILOM command
set
device
component_state=enabled
where
device
is the name of the DIMM being enabled.
■
PSH technology
– The Oracle PSH feature uses the Fault Manager daemon (
fmd
) to watch
for various kinds of faults. When a fault occurs, the fault is assigned a UUID and logged.
PSH reports the fault and suggests a replacement for the DIMMs associated with the fault.
If you suspect the server has a memory problem, run the Oracle ILOM
show faulty
command.
This command lists memory faults and identifies the DIMM modules associated with the fault.
Related Information
■
■
“Understanding DIMM Configurations” on page 69
■
■
“DIMM Configuration Errors” on page 72
Identifying Faulty DIMMs
You can identify faulty DIMMs using the following methods:
■
“Determine Which DIMM Is Faulty (Oracle ILOM)” on page 75
■
“Determine Which DIMM Is Faulty (PSH)” on page 75
74
SPARC T7-4 Server Service Manual • May 2017
Содержание SPARC T7-4
Страница 1: ...SPARC T7 4 Server Service Manual Part No E54994 07 May 2017 ...
Страница 2: ......
Страница 10: ...10 SPARC T7 4 Server Service Manual May 2017 ...
Страница 12: ...12 SPARC T7 4 Server Service Manual May 2017 ...
Страница 86: ...86 SPARC T7 4 Server Service Manual May 2017 ...
Страница 98: ...98 SPARC T7 4 Server Service Manual May 2017 ...
Страница 110: ...110 SPARC T7 4 Server Service Manual May 2017 ...
Страница 124: ...124 SPARC T7 4 Server Service Manual May 2017 ...
Страница 141: ...Verify the Battery Related Information Replace the Battery on page 137 Servicing the Battery 141 ...
Страница 142: ...142 SPARC T7 4 Server Service Manual May 2017 ...
Страница 164: ...164 SPARC T7 4 Server Service Manual May 2017 ...
Страница 175: ...Remove a PCIe Card 2 Unlatch and open the PCIe card carrier top cover Servicing PCIe Cards 175 ...
Страница 192: ...192 SPARC T7 4 Server Service Manual May 2017 ...
Страница 200: ...200 SPARC T7 4 Server Service Manual May 2017 ...
Страница 208: ...208 SPARC T7 4 Server Service Manual May 2017 ...