105
Chapter 4
Error Reporting and Handling
This section defines how errors are handled. Also discussed is the role of the BIOS in error handling and the
interaction between the BIOS, platform hardware, and server management firmware with regard to error
handling. In addition, error-logging techniques are described and beep codes and POST messages are
defined.
Fault Resilient Booting (FRB)
Fault Resilient Booting (FRB) is a set of BIOS and BMC algorithms and hardware support that allow a
multiprocessor system to boot in case of failure of the bootstrap processor (BSP) under certain conditions.
With on-board platform instrumentation, should a processor failure be detected during POST, the mBMC does
not have the ability to disable the failed or failing processor. Therefore the system may or may not continue to
boot. An FRB-2 error will be logged in the System Event Log (SEL) and an error will be displayed at POST.
FRB2 is a BIOS-based algorithm that uses the mBMC IPMI watchdog timer to protect against BIOS hangs
during the POST process
FRB1 – BSP Self-Test Failures
The BIOS provides an FRB1 timer. Early in POST, the BIOS checks the Built-in Self Test (BIST) results of the
BSP. If the BSP fails BIST, the BIOS will notify the user that the BIST failed; no processors will be disabled.
The BIST failure is displayed during POST and an error is logged to the SEL.
FRB2 –
BSP POST Failures
A second timer (FRB2) is set to several minutes by BIOS and is designed to guarantee that the system
completes POST. The FRB2 timer is enabled just before the FRB3 timer is disabled to prevent any
“unprotected” window of time. Near the end of POST, the BIOS disables the FRB2 timer. If the system contains
more than 1 GB of memory and the user chooses to test every DWORD of memory, the watchdog timer is
extended before the extended memory test starts, because the memory test can exceed the timer duration.
The BIOS will also disable the watchdog timer before prompting the user for a boot password. If the system
hangs during POST, before the BIOS disables the FRB2 timer, the appropriate event will be logged in the
System Event Log (SEL), and displayed to the user.
Options are provided by the BIOS to control the policy applied to FRB2 failures. These options are not
supported by the Server Board SE7320VP2, and mBMC does not support the option to disable the BSP.
FRB3 – BSP Reset Failures
The BIOS and firmware provide a feature to guarantee that the system boots, even if one or more processors
fail during POST. The mBMC contains one watchdog timer that can be configured to reset the system upon
time-out. The first timer (FRB3) starts counting down whenever the system comes out of hard reset. If the
BSP successfully resets and begins executing, the BIOS disables the FRB-3 timer in the mBMC and the
system continues executing POST. If the timer expires because of the BSP’s failure to fetch or execute BIOS
code, the mBMC resets the system and changes the bootstrap processor until the BIOS successfully disables
the FRB3 timer. The mBMC generates beep codes on the system speaker if it fails to find a good processor. It
will continue to cycle until it finds a good processor. The process of cycling through all the processors is
repeated upon system reset or power cycle. Soft resets do not affect the FRB3 timer. The duration of the
FRB3 timer is set by system firmware. The mBMC supports the algorithm described above, but does not
disable the processor and the failure will be logged as an FRB2 failure.
Watchdog Timer - Operating System Load Failures
The OS Watchdog Timer feature is designed to allow watchdog timer protection of the operating system load
process. This is done in conjunction with an operating system-present device driver or application that will
disable the watchdog timer once the operating system has successfully loaded. If the operating system load
process fails, the mBMC will reset the system.
The BIOS shall disable the OS Watchdog Timer before handing control to the OS loader if it is determined to
be booting from removable media or the BIOS cannot determine the media type.
If the BIOS is going to boot to a known hard drive, it will read a user option for the OS Watchdog Timer for HDD
Boots. If this is disabled, the BIOS will ensure the watchdog timer is disabled and boot. Otherwise the BIOS
Содержание Altos R510
Страница 6: ...VI ...
Страница 13: ...4 Chapter 1 System Block Diagram ...
Страница 68: ...61 Chapter 3 B A TP01301 ...
Страница 72: ...65 Chapter 3 ...
Страница 91: ...Chapter 3 84 TP01167 B C D F A E F E D B C ...
Страница 138: ...131 Chapter 5 ...
Страница 143: ...151 Appendix C ...