Automatic Packet Memory Scan (via sys-health-check)
Advanced System Diagnostics and Troubleshooting Guide
65
During the memory scan, the CPU utilization is high and mostly dedicated to executing the
diagnostics—as is normal for running any diagnostic on the modules. During this time, other network
activities where this system is expected to be a timely participant could be adversely affected, for
example, in networks making use of STP and OSPF.
The alarm-level option of the global system health check facility does not attempt to diagnose a
suspected module; instead, it simply logs a message at a specified level.
The auto-recovery option
does
attempt to diagnose and recover a failed module a configured number of
times. You should plan carefully before you use this command option. If you enable the system health
check facility on the switch and configure the auto-recovery option to use the offline auto-recovery
action, once a module failure is suspected, the system removes the module from service and performs
extended diagnostics. If the number of auto-recovery attempts exceeds the configured threshold, the
system removes the module from service. The module is permanently marked “down,” is left in a
non-operational state, and cannot be used in a system running ExtremeWare 6.2.2 or later. A log
message indicating this will be posted to the system log.
NOTE
Keep in mind that the behavior described above is configurable by the user, and that you can enable
the system health check facility on the switch and configure the auto-recovery option to use the online
auto-recovery action, which will keep a suspect module online regardless of the number of errors
detected.
Example log messages for modules taken offline:
01/31/2005 01:16.40 <CRIT:SYST> Sys-health-check [ACTION] (PBUS checksum)
(CARD_HWFAIL_PBUS_CHKSUM_EDP_ERROR) slot 3
01/31/2005 01:16.40 <INFO:SYST> Card in slot 1 is off line
01/31/2005 01:16.40 <INFO:SYST> card.c 2035: Set card 1 to Non-operational
01/31/2005 01:16.40 <INFO:SYST> Card in slot 2 is off line
01/31/2005 01:16.44 <INFO:SYST> card.c 2035: Set card 2 to Non-operational
01/31/2005 01:16.44 <INFO:SYST> Card in slot 3 is off line
01/31/2005 01:16.46 <INFO:SYST> card.c 2035: Set card 3 to Non-operational
01/31/2005 01:16.46 <INFO:SYST> Card in slot 4 is off line
01/31/2005 01:16.46 <INFO:SYST> card.c 2035: Set card 4 to Non-operational
Содержание ExtremeWare Version 7.8
Страница 8: ...8 Advanced System Diagnostics and Troubleshooting Guide Contents...
Страница 14: ...14 Advanced System Diagnostics and Troubleshooting Guide Introduction...
Страница 24: ...24 Advanced System Diagnostics and Troubleshooting Guide i Series Switch Hardware Architecture...
Страница 48: ...48 Advanced System Diagnostics and Troubleshooting Guide Software Exception Handling...
Страница 102: ...102 Advanced System Diagnostics and Troubleshooting Guide Additional Diagnostics Tools...
Страница 110: ...110 Advanced System Diagnostics and Troubleshooting Guide Troubleshooting Guidelines...
Страница 114: ...114 Advanced System Diagnostics and Troubleshooting Guide Limited Operation Mode and Minimal Operation Mode...
Страница 120: ...120 Advanced System Diagnostics and Troubleshooting Guide Index...