Advanced System Diagnostics and Troubleshooting Guide
37
4
Software Exception Handling
This chapter describes the software exception handling features built into Extreme hardware and
software products to detect and respond to problems to maximize switch reliability and availability.
This chapter contains the following sections:
•
Overview of Software Exception Handling Features on page 37
•
Configuring System Recovery Actions on page 40
•
Configuring Reboot Loop Protection on page 43
•
Dumping the “i” Series Switch System Memory on page 45
Overview of Software Exception Handling Features
In the context of using the Extreme Advanced System Diagnostics—either manually or automatically,
there are several things you must keep in mind that can affect the operation of the diagnostics and/or
the reliable operation of the switch itself:
•
System watchdog behavior
•
System software exception recovery behavior (configuration options)
•
Redundant MSM behavior (and failover, in BlackDiamond systems)
System Watchdog Behavior
The system watchdog is a system self-reliancy diagnostic mechanism to monitor the CPU and ensure
that it does not become trapped in a processing loop.
In normal operation, the system’s normal task processing periodically resets the watchdog timer and
restarts it, maintaining uninterrupted system operation. But if the watchdog timer expires before the
normal task processing restarts it, the system is presumed to be malfunctioning and the watchdog
expiry triggers a reboot of the master MSM.
Depending on the persistence of an error and the system recovery actions configured in the
configure
sys-recovery-level
command (reboot, shutdown, system dump, or—in the case of BlackDiamond
systems equipped with redundant MSMs—MSM failover), the reboot might cause the system to
perform the configured system recovery actions.
Содержание ExtremeWare Version 7.8
Страница 8: ...8 Advanced System Diagnostics and Troubleshooting Guide Contents...
Страница 14: ...14 Advanced System Diagnostics and Troubleshooting Guide Introduction...
Страница 24: ...24 Advanced System Diagnostics and Troubleshooting Guide i Series Switch Hardware Architecture...
Страница 48: ...48 Advanced System Diagnostics and Troubleshooting Guide Software Exception Handling...
Страница 102: ...102 Advanced System Diagnostics and Troubleshooting Guide Additional Diagnostics Tools...
Страница 110: ...110 Advanced System Diagnostics and Troubleshooting Guide Troubleshooting Guidelines...
Страница 114: ...114 Advanced System Diagnostics and Troubleshooting Guide Limited Operation Mode and Minimal Operation Mode...
Страница 120: ...120 Advanced System Diagnostics and Troubleshooting Guide Index...