14
Table 3-1 mcemonitor configuration file(core-ce-action)
Setting in mcemonitor.conf
Description
core-ce-action = soft
Collects log and makes CPU Core Offline if the CPU error count
exceeds the threshold value. (Default)
core-ce-action = account
Collects log but does not make CPU Core Offline even if the CPU
error count exceeds the threshold value.
core-ce-action = off
Does not collect log nor make CPU Core Offline.
Table 3-2 mcemonitor configuration file(memory-ce-action)
Setting in mcemonitor.conf
Description
memory-ce-action = soft
Collects log and makes Memory Page Offline if the memory error count
exceeds the threshold value. (Default)
The process running on the relevant memory is transferred to another
memory.
memory-ce-action = off
Does not collect log nor make Memory Page Offline.
The system must be rebooted if configuration file is modified.
3.3.3
Disabling CMCI
In RHEL6.6 kernel 2.6.32-504.23.4.el6.x86_64, it is reported that the frequent occurrence of
CMCI(Corrected Machine Check Interrupt), which notifies the operating system of the detected
corrrectable error, may cause System panic.
To change the error detecting mode from "interrupt mode" to "polling mode", you need to add
"mce=no_cmci" to the kernel line in the "/boot/efi/EFI/redhat/grub.conf".
The system must be rebooted if configuration file is modified.
title Red Hat Enterprise Linux Server (2.6.32-504.23.4.el6.x86_64)
root (hd0,0)
kernel /vmlinuz-2.6.32-504.23.4.el6.x86_64 ro
root=/dev/mapper/VolGroup00-LogVol00
rd_LVM_LV= VolGroup00/LogVol00 rd_NO_LUKS nomodeset rd_NO_MD rhgb quiet
crashkernel=256M KEYBOARDTYPE=pc KEYTABLE=jp106 LANG=ja_JP.UTF-8 rd_NO_DM
mce=no_cmci
initrd /initramfs-2.6.32-504.23.4.el6.x86_64.img
3.3.4
Disabling kdump restart on udev triggered by logical processor offline
Add # at the top of the following line in /etc/udev/rules.d/98-kexec.rules file to disable the rule.
#SUBSYSTEM=="cpu", ACTION=="offline", PROGRAM="/etc/init.d/kdump restart"
Restart udev after modifying configuration file.
udevadm control --reload-rules
Note
kdump is restarted when capmonitor executes script upon completion of Core
Offline. You need to place the script file to be used after Core Offline according to
"3.3.5 Script file to be executed after Core Offline".