21
Table 5-1 mcemonitor command
Item name
Description
Per page status corrected error over threshold:
Shows result of Memory Page Offline.
100000: offline-failed
Indicates that offlining failed for 0x10000 page of
memory address.
10000000: offline
Indicates 0x10000 page of memory address was
offlined.
Per page status uncorrected error:
Show Memory Page that uncorrected error occurred.
CPU errors
Shows CPU fault information.
CPUx/corey
Shows fault information of CPU core.
CPU x
:
Indicates physical CPU socket number (x).
corey
:
Indicates CPU core number (y).
corrected errors:
Shows number of occurrence of correctable errors.
x total
Indicates that errors occurred x times.
uncorrected errors:
Shows number of occurrence of uncorrectable errors.
CPU1/uncore
Fault information of CPU Uncore.
Per CPU status corrected error over threshold:
Shows result of CPU Offline.
/sys/devices/system/cpu5 offline-failed
Indicates that offlining logical processor 5 failed.
/sys/devices/system/cpu15 offline
Indicates that offlining logical processor 5 succeeded.
/sys/devices/system/cpu16 online
Indicates that offlined logical processor 16 is returned
to online by user.
Note: This CPU was made offline due to failure. Do not
make it online.