System Health Checks: A Diagnostics Suite
Advanced System Diagnostics and Troubleshooting Guide
55
Automatic Mode.
Automatic mode for initiating a memory scan is set up when the system health
check
auto-recovery
option is enabled (see
“System (CPU and Backplane) Health Check” on page 70
).
When system health checks fail at the specified frequency, packet memory is invoked automatically.
Automatic mode status is listed in the “sys-health-check” field of the display for the
show switch
command.
When
auto-recovery
is configured, an automated background polling task checks every 20 seconds to
determine whether any fabric checksums have occurred. Three consecutive samples must be corrupted
for any module to invoke autoscan.
CAUTION
If the automatic mode is invoked—regardless of the “i” series platform type or number of errors—there
is an initial period where the device is taken offline so that the scan can be run.
The ExtremeWare diagnostics suite provides packet memory checking capabilities on “
i
” series Summit,
Alpine, and BlackDiamond systems at four levels:
•
Manually, as a subset of the extended system diagnostic, through the command:
run diagnostics extended
•
Manually, through the command:
run diagnostics packet-memory
These two options are available on “
i
” series Summit, Alpine, and BlackDiamond systems, and are
described in
“Runtime Diagnostics on “i” Series Systems” on page 57
.
•
Automatically, as a background task under the global system health check umbrella, as configured
in the commands:
enable sys-health-check
configure sys-health-check auto-recovery <number of tries> [offline | online]
(BlackDiamond)
configure sys-health-check alarm-level auto-recovery [offline | online] (Alpine or Summit)
This option is available on “
i
” series Summit, Alpine, and BlackDiamond systems, and is described
in
“System (CPU and Backplane) Health Check” on page 70
.
•
Automatically, on a per-slot basis, to scan and check the health of a specific BlackDiamond module,
as configured in the command:
configure packet-mem-scan-recovery-mode
This option is available on BlackDiamond systems only, and is described in
“Per-Slot Packet Memory
Scan on BlackDiamond Switches” on page 67
.
The Role of Processes to Monitor System Operation
When you are in the process of implementing the ExtremeWare diagnostics, keep in mind the software
fault recovery features built into Extreme hardware and software products to detect and respond to
problems to maximize switch reliability and availability. The System-Watchdog,
System-Recovery-Mode, and Reboot-Loop-Protection functions ensure that the switch can not only pass
all POST test diagnostics, but also verify that all processes continue to perform properly during runtime
operation. For more information, see
Chapter 4
,
“Software Exception Handling”
.
Содержание ExtremeWare Version 7.8
Страница 8: ...8 Advanced System Diagnostics and Troubleshooting Guide Contents...
Страница 14: ...14 Advanced System Diagnostics and Troubleshooting Guide Introduction...
Страница 24: ...24 Advanced System Diagnostics and Troubleshooting Guide i Series Switch Hardware Architecture...
Страница 48: ...48 Advanced System Diagnostics and Troubleshooting Guide Software Exception Handling...
Страница 102: ...102 Advanced System Diagnostics and Troubleshooting Guide Additional Diagnostics Tools...
Страница 110: ...110 Advanced System Diagnostics and Troubleshooting Guide Troubleshooting Guidelines...
Страница 114: ...114 Advanced System Diagnostics and Troubleshooting Guide Limited Operation Mode and Minimal Operation Mode...
Страница 120: ...120 Advanced System Diagnostics and Troubleshooting Guide Index...