40
Advanced System Diagnostics and Troubleshooting Guide
Software Exception Handling
Configuring System Recovery Actions
ExtremeWare provides a user-configurable system recovery software diagnostic tool whose main
function is to monitor the system boot processes. If an error occurs during the POST, the system enters
a fail-safe mode that allows the network or system administrator to view logs and troubleshoot the
fault.
The system recovery behavior can also be configured to ensure that no software exception errors occur
during normal runtime operation. Although exception errors are extremely rare, they are possible. Task
exceptions can lead to unpredictable network behavior. For this reason, it is important to plan for such
a situation and configure the system to react in the appropriate manner: several configurable variables
can be set to determine the action the system takes should an exception error occur.
Related Commands
configure sys-recovery-level
configure system-dump
Configuring System Recovery Actions on “
i
” Series Switches
To specify a system recovery scheme for “
i
” series switches when a software exception occurs, use this
command:
configure sys-recovery-level [none | [all | critical] [msm-failover | reboot |
shutdown | system-dump [maintenance-mode | msm-failover | reboot | shutdown]]]
where:
none
No recovery action is taken when a software exception occurs (no system shutdown or
reboot, etc.). This is the default action.
all
If
any task exception occurs, ExtremeWare logs an error in the system log and
automatically initiates the configured response action, selected from among the
following options:
•
msm-failover
—(BlackDiamond switches only.) If there is a software exception on
the master MSM, the slave MSM takes over control of the switch.
•
reboot
—Reboots the switch.
•
shutdown
—Shuts down the switch (see “
Usage Notes
,” below).
•
system-dump
—(Available only on switches with Ethernet management ports.) This
option should be used only with assistance from TAC. Initiates a memory dump
transfer to a remote TFTP dump server, followed by one of the following completion
actions:
msm-failover
,
reboot
,
shutdown
, or
maintenance-mode
.
Three of the completion actions under the
system-dump
option—
msm-failover
,
reboot
, and
shutdown
—are the same as described above. The remaining
completion action for the
system-dump
option,
maintenance-mode
, leaves the
switch in whatever state the dump transfer puts it in. Some subsystems might not
work correctly, or work at all after a system dump.
These four system dump completion actions specify the action to be taken when the
system dump is complete. These actions occur whether or not the system dump
was successful.
critical
If a
critical task exception occurs, ExtremeWare logs an error in the system log and
automatically initiates the configured response action, selected from among the
configuration options described for
all
, above. Critical tasks include the tBGTask,
tNetTask, tEdpTask, and tEsrpTask.
Summary of Contents for ExtremeWare Version 7.8
Page 8: ...8 Advanced System Diagnostics and Troubleshooting Guide Contents...
Page 14: ...14 Advanced System Diagnostics and Troubleshooting Guide Introduction...
Page 24: ...24 Advanced System Diagnostics and Troubleshooting Guide i Series Switch Hardware Architecture...
Page 48: ...48 Advanced System Diagnostics and Troubleshooting Guide Software Exception Handling...
Page 102: ...102 Advanced System Diagnostics and Troubleshooting Guide Additional Diagnostics Tools...
Page 110: ...110 Advanced System Diagnostics and Troubleshooting Guide Troubleshooting Guidelines...
Page 120: ...120 Advanced System Diagnostics and Troubleshooting Guide Index...