background image

 

 

H3C S9820 Switch Series  

Troubleshooting Guide 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

Document version: 6W100-20190815 

 

Copyright © 2019 New H3C Technologies Co., Ltd. All rights reserved. 
No part of this manual may be reproduced or transmitted in any form or by any means without prior written consent of New 
H3C Technologies Co., Ltd. 
Except for the trademarks of New H3C Technologies Co., Ltd., any trademarks that may be mentioned in this document are 
the property of their respective owners. 
The information in this document is subject to change without notice 

Содержание S9820 Series

Страница 1: ...is manual may be reproduced or transmitted in any form or by any means without prior written consent of New H3C Technologies Co Ltd Except for the trademarks of New H3C Technologies Co Ltd any trademarks that may be mentioned in this document are the property of their respective owners The information in this document is subject to change without notice ...

Страница 2: ...lure with an error message 10 Symptom 10 Solution 10 ACL application failure without an error message 10 Symptom 10 Troubleshooting flowchart 11 Solution 11 Related commands 12 Troubleshooting IRF 13 IRF fabric setup failure 13 Symptom 13 Troubleshooting flowchart 14 Solution 15 Related commands 17 Troubleshooting Ethernet link aggregation 17 Link aggregation failure 17 Symptom 17 Troubleshooting ...

Страница 3: ...ailure 32 Symptom 32 Troubleshooting flowchart 32 Solution 33 Related commands 33 Troubleshooting system management 34 High CPU utilization 34 Symptom 34 Troubleshooting flowchart 34 Solution 35 High memory utilization 36 Symptom 36 Troubleshooting flowchart 37 Solution 37 Related commands 38 Troubleshooting other problems 38 Layer 2 forwarding failure 38 Symptom 38 Troubleshooting flowchart 39 So...

Страница 4: ...collect system and configuration information including Symptom time of failure and configuration Network topology information including the network diagram port connections and points of failure Common log messages and diagnostic log information For more information about collecting this information see Collecting log and operating information Physical evidence of failure Photos of the hardware St...

Страница 5: ...hovers have occurred You must collect log files from all member devices To more easily identify log files from different member devices use two directories to save the log files from each member device and include member IDs in the directory names Table 1 Log and operating information Category File name format Log file directory Content Common log logfile log flash logfile Command execution events...

Страница 6: ... messages from the diagnostic log file buffer to a diagnostic log file By default the diagnostic log file is saved in the diagfile directory of the flash memory on each member device Sysname diagnostic logfile save The contents in the diagnostic log file buffer have been saved to the file flash diagfile diagfile log 2 Identify the diagnostic log file on each member device Display the diagnostic lo...

Страница 7: ...Y save N display Y N 2 At the prompt choose to save or display operating statistics To save operating statistics enter y at the prompt and then specify the destination file name Save or display diagnostic information Y save N display Y N y Please input the file name tar gz flash diag_Sysname_20160101 000704 tar gz Diagnostic information is outputting to flash diag_Sysname_20160101 000704 tar gz Pl...

Страница 8: ...9 53 UTC Tue 01 01 2016 More Troubleshooting hardware This section provides troubleshooting information for common hardware problems NOTE This section describes how to troubleshoot unexpected switch reboot power module failure and fan tray failure To troubleshoot ports see Troubleshooting ports Unexpected switch reboot Symptom The switch reboots unexpectedly when it is operating ...

Страница 9: ...nnot access the CLI go to step 2 2 Verify that the system software image on the switch is correct Connect to the switch through the console port and restart the switch If BootWare reports that a CRC error has occurred or that no system software image is available perform the following steps Unexpected switch reboot Can you access the CLI Collect operating information Reload the system software ima...

Страница 10: ...r command to verify that the power module has changed to Normal state If the power module remains in Absent state replace the power module 3 When the power module is in Fault state do the following a Verify that the power module is connected to the power source securely If it has been disconnected from the power source connect the power source to it b Determine whether the power module is in high ...

Страница 11: ...securely Then execute the display power command to verify that the power module has changed to Normal state If the power module remains in Absent state go to step b b Remove and install the power module into an empty power module slot Then execute the display power command to verify that the power module has changed to Normal state in the new slot If the power module remains in Absent state go to ...

Страница 12: ...nstall the fan tray to make sure the fan tray is installed securely Then execute the display fan command to verify that the fan tray has changed to Normal state If the fan tray remains in Absent state replace the fan tray 3 Execute the display environment command to display temperature information If the temperature continues to rise put your hand at the air outlet to feel if air is being expelled...

Страница 13: ...on provides troubleshooting information for common problems with ACLs ACL application failure with an error message Symptom The system fails to apply a packet filter or an ACL based QoS policy to the hardware It also displays the Reason Not enough hardware resource message Solution To resolve the problem 1 Execute the display qos acl resource command and then check the Remaining field for ACL reso...

Страница 14: ... is configured correctly a Use one of the following commands to check the QoS policy for configuration errors depending on the policy application destination Destination Command Interface display qos policy interface VLAN display qos vlan policy Global display qos policy global Control plane display qos policy control plane slot slot number b If the QoS policy does not contain a class behavior ass...

Страница 15: ...3 If the problem persists contact H3C Support ACL used in a packet filter To resolve the problem when the ACL is used in a packet filter 1 Verify that the packet filter is configured correctly Execute the display packet filter command to check whether the packet filter is configured correctly If there are any configuration errors reconfigure the packet filter If there is no configuration error go ...

Страница 16: ...control plane display qos policy global Displays information about global QoS policies display qos policy interface Displays information about the QoS policies applied to an interface or to all interfaces display qos policy user defined Displays user defined QoS policies display qos vlan policy Displays information about QoS policies applied to VLANs display traffic classifier user defined Display...

Страница 17: ...o Yes Resolved No Yes Yes No Resolved No Yes No Yes Correct transceiver modules or cables used Resolved No Yes Yes No Upgrade software to the same version in the fabric Resolved Yes No No Yes Same software version IRF links up Resolved No Yes Yes No Replace with correct cables or transceiver modules Bring up the IRF links or activate IRF port configuration IRF setup failure Number of members fewer...

Страница 18: ...rts are IRF capable You can use QSFP28 ports on the front panel of the S9820 64H switch as IRF physical interfaces The 10 GE or 25 GE breakout interfaces of a QSFP28 port cannot be used as IRF physical interfaces If there are binding errors reconfigure the IRF port bindings c Verify that the IRF physical interfaces are correct connected When you connect two neighboring IRF members you must connect...

Страница 19: ...and to identify the software version of each member device b Upgrade the software of all member devices to the same version NOTE Typically the irf auto update enable command can automatically synchronize a member device with the software version of the master device However the synchronization might fail when the gap between the software versions is too large For more information see the release n...

Страница 20: ...isplay irf link Displays IRF link information Use this command to verify that each IRF port has a minimum of one physical interface in up state display irf topology Displays the IRF fabric topology including the member IDs IRF port state and adjacencies of IRF ports display version Displays system version information irf port configuration active Activates IRF configuration on IRF ports Troublesho...

Страница 21: ...ure 4 Troubleshooting link aggregation failure Solution To resolve the problem 1 Verify that all physical connections are correct You can verify the physical connections against your network plan 2 Verify that all member ports are up ...

Страница 22: ...erface command on the peer device to display the configurations of the peer member ports b Configure the peer member ports to make sure the peer ports have the same operational key and attribute configurations as the peer port of the reference port 6 Verify that the number of member ports in the aggregation group does not exceed the configured maximum number of Selected ports a Execute the link ag...

Страница 23: ...etailed information about the aggregation groups that correspond to the existing aggregate interfaces link aggregation selected port maximum Configure the maximum number of Selected ports allowed in an aggregation group Troubleshooting ports This section provides troubleshooting information for common port problems A QSFP28 fiber port fails to come up Symptom A QSFP28 fiber port fails to come up ...

Страница 24: ...x command to set the rate and duplex mode for the port 2 Verify that the speed and duplex mode of the local port match the speed and duplex mode of the transceiver module No Yes No Yes No Yes Yes No No Yes Yes No No Yes Yes No No Yes Yes No A port failed to come up Contact the support Resolved Speed duplex match on local and peer port Speed duplex match on transceiver module and port Local peer po...

Страница 25: ...isplays None if no error has occurred The device displays alarms if the transceiver module has failed or if the type of the transceiver module does not match the port type b Use an optical power meter to verify that the Tx power and Rx power of the transceiver module are stable and are within the correct range c Execute the display transceiver interface command to verify that the local transceiver...

Страница 26: ...nsceiver information Transceiver Type 40G_BASE_LR4_QSFP_PLUS Connector Type LC Wavelength nm 1301 Transfer Distance km 10 SMF Digital Diagnostic Monitoring YES Vendor Name H3C Ordering Name QSFP 40G LR4 WDM1300 If the vendor name field does not display H3C replace the transceiver module with an H3C transceiver module If the vendor name field displays H3C perform the following tasks Execute the dis...

Страница 27: ...diagnosis function Sysname display transceiver diagnosis interface hundredgige 1 0 1 The transceiver does not support this function Troubleshooting flowchart Figure 7 Troubleshooting digital diagnosis failure on a transceiver module Solution To resolve the problem 1 Verify that the transceiver module is an H3C transceiver module Execute the display transceiver interface command to view the vendor ...

Страница 28: ...H3C Support to verify that the transceiver module supports the digital diagnosis function Sysname display transceiver interface hundredgige 1 0 1 HundredGigE 1 0 1 transceiver information Transceiver Type 40G_BASE_LR4_QSFP_PLUS Connector Type LC Wavelength nm 1301 Transfer Distance km 10 SMF Digital Diagnostic Monitoring YES Vendor Name H3C Ordering Name QSFP 40G LR4 WDM1300 3 If the problem persi...

Страница 29: ...ics changes more clearly b Use the display interface command to display the incoming packet statistics and outgoing packet statistics of the port c Determine the type of error frames that are accumulating 2 If the port is a fiber port verify that the optical power of the transceiver module is operating correctly a Use the display transceiver diagnosis interface command to view the present measured...

Страница 30: ...dium into a new port that is operating correctly If error frames still exist replace the link medium 5 Verify that the port is operating correctly If the port is a copper port connect the port directly to a PC If the port is a fiber port replace the transceiver module in the port If error frames do not exist troubleshoot the remaining possible points of failure on the transmission path The trouble...

Страница 31: ...command to clear the packet statistics of the port This command resets all packet counters to 0 so that you can view the statistics changes more clearly b Use the display interface command to verify that the number of incoming packets is accumulating c Verify that the number of error frames is not accumulating If the number of error frames is accumulating remove the errors For more information see...

Страница 32: ...f the port is in an aggregation group use the display link aggregation summary command to verify that the status of the port is Selected If the status of the port is Unselected the port cannot send or receive data packets Determine the reasons why the port becomes Unselected for example the attribute configurations of the port are different from the reference port Modify the attribute configuratio...

Страница 33: ...ors For more information see Error frames for example CRC errors on a port 3 Verify that the port configurations do not affect packet sending a Use the display interface brief command to verify that the port configurations are correct The port configurations include the duplex mode speed port type and VLAN configurations of the ports at both ends of the link If configuration errors exist modify th...

Страница 34: ...link medium 5 Verify that the port is operating correctly If the port is a copper port connect the port directly to a PC If the port is a fiber port replace the transceiver module in the port If the port can send packets troubleshoot the remaining possible points of failure on the transmission path The troubleshooting process is beyond the scope of this document 6 If the problem persists contact H...

Страница 35: ...s cannot set up an EBGP or IBGP neighbor relationship Troubleshooting flowchart Figure 11 Troubleshooting EBGP or IBGP neighbor relationship setup failure BGP settings correct Yes Yes No Modify BGP settings End Yes No BGP neighbor relationship setup failure Do BGP neighbors have connectivity Remove link failure or modify routing configuration Problem resolved No No Yes Problem resolved Contact the...

Страница 36: ...elow the critical alarm threshold in the last 10 minutes c If the memory usage stays above the critical alarm threshold contact Hewlett Packard Enterprise Support 4 Perform the following tasks to collect information and contact Hewlett Packard Enterprise Support a Execute the debugging bgp event command to view possible causes for neighbor relationship setup failure such as connection setup errors...

Страница 37: ...g bgp open Enables BGP OPEN message debugging debugging tcp packet Enables TCP packet debugging Troubleshooting system management This section provides troubleshooting information for common system management problems High CPU utilization Symptom The sustained CPU utilization of the device is over 80 Troubleshooting flowchart Figure 12 Troubleshooting high CPU utilization Identify the job that has...

Страница 38: ...0 0 migration 3 13 0 0 0 0 0 0 ksoftirqd 3 14 0 0 0 0 0 0 watchdog 3 15 0 0 0 0 0 0 migration 4 16 0 0 0 0 0 0 ksoftirqd 4 17 0 0 0 0 0 0 watchdog 4 18 0 0 0 0 0 0 migration 5 19 0 0 0 0 0 0 ksoftirqd 5 20 0 0 0 0 0 0 watchdog 5 21 0 0 0 0 0 0 migration 6 More The output shows the average CPU usage values of jobs for the last 5 seconds 1 minute and 5 minutes Typically the average CPU usage of a jo...

Страница 39: ...n 4 of 5 Kernel stack 80480754 schedule 0x954 0x1250 8028f720 watchdog 0xb0 0x410 802656d0 kthread 0x130 0x140 8021d730 kernel_thread_helper 0x10 0x20 Iteration 5 of 5 Kernel stack 80480754 schedule 0x954 0x1250 8028f720 watchdog 0xb0 0x410 802656d0 kthread 0x130 0x140 8021d730 kernel_thread_helper 0x10 0x20 3 Save the information displayed in the previous steps 4 Contact Hewlett Packard Enterpris...

Страница 40: ...ryCache 0 0 656 0 23 4 0 0 MFW_FsCache 2 39 768 0 39 8 1 1 biovec 64 0 0 96 8 30 1 0 0 cfq_io_context 0 0 52 0 42 1 0 0 ARP_Static_Entry_Cachep 0 0 432 0 34 4 0 0 LFIB_IlmEntryCache 0 0 80 0 34 1 0 0 LFIB_NhlfeCacheCache 0 0 536 0 28 4 0 0 jffs2_i 11 92 52 4 46 1 2 2 pktpcb 1 26 576 0 26 4 1 1 shmem_inode_cache 515 650 256 8 25 2 24 26 kmalloc 256 0 0 1936 0 16 8 0 0 MFW_FsCache 0 42 4096 0 7 8 0 ...

Страница 41: ... contact Hewlett Packard Enterprise Support You might lose critical diagnostic information if you reboot the device Related commands This section lists the commands that you might use for troubleshooting system management Command Description display cpu usage Displays the current CPU usage statistics display memory Displays memory usage statistics display process cpu Displays the CPU usage statist...

Страница 42: ...blem 1 Verify that no error packets have been received on the local port a Execute the display interface command and check for error packets Sysname display interface hundredgige 1 0 32 HundredGigE1 0 32 current state UP Line protocol state UP IP Packet Frame Type PKTFMT_ETHNT_2 Hardware Address 000f e200 002b ...

Страница 43: ...fic correctly you can determine that the hardware of the local port fails In this event you must replace the local port with a correctly operating port Transceiver module fiber or twisted pair failure To test and resolve such a failure replace the transceiver module fiber or twisted pair with a good one Inconsistent configurations Verify that the configurations including speed and duplex mode of t...

Страница 44: ...o no 0x2a 16 down 1 HGE1 0 15 0 62 ce30 no no 0x2b 16 down 1 HGE1 0 16 0 65 ce31 no no 0x30 16 down 1 HGE1 0 17 0 68 ce32 no no 0x31 16 down 2 HGE1 0 18 0 71 ce33 no no 0x36 16 down 2 HGE1 0 19 0 72 ce34 no no 0x37 16 down 2 HGE1 0 20 0 75 ce35 no no 0x3c 16 down 2 HGE1 0 21 0 76 ce36 no no 0x3d 16 down 2 HGE1 0 22 0 79 ce37 no no 0x42 16 down 2 HGE1 0 23 0 80 ce38 no no 0x43 16 down 2 HGE1 0 24 0...

Страница 45: ...o 0x72 16 down 0 HGE1 0 39 0 13 ce6 no no 0x73 16 down 0 HGE1 0 40 0 16 ce7 no no 0x78 16 down 0 HGE1 0 41 0 17 ce8 no no 0x79 16 down 0 HGE1 0 42 0 20 ce9 no no 0x7e 16 down 0 HGE1 0 43 0 21 ce10 no no 0x7f 16 down 0 HGE1 0 44 0 24 ce11 no no 0x84 16 down 0 HGE1 0 45 0 25 ce12 no no 0x85 16 down 0 HGE1 0 46 0 28 ce13 no no 0x8a 16 down 0 HGE1 0 47 0 29 ce14 no no 0x8b 16 down 0 HGE1 0 48 0 32 ce1...

Страница 46: ... GE0 0 1 0 137 no no 0xc0 16 down 0 The output shows that HundredGigE 1 0 1 is associated with chip port ce8 Execute the bcm slot 1 chip 0 show c ce16 command to check the RDBGC and TDBGC fields for Rx and Tx dropped packet statistics respectively The statistics displayed were generated between the last and the current execution of the command To view the change in dropped packet statistics execut...

Страница 47: ...hether the port is configured with the portal authentication Packets of users that fail to pass the portal authentication will be dropped by the port Use the display portal interface command to display the portal configuration information of the specified VLAN interface Determine whether the portal authentication can be disabled based on the network conditions To disable the portal authentication ...

Страница 48: ...rmation rate CIR and the committed burst size CBS are appropriate To adjust the CIR and CBS values execute the qos lr inbound outbound cir committed information rate cbs committed burst size command Storm suppression Execute the display this command in Ethernet interface view to display the configuration of storm suppression Storm suppression includes broadcast suppression multicast suppression an...

Страница 49: ...qos policy global Displays information about global QoS policies display qos policy interface Displays information about the QoS policies applied to an interface or all interfaces display qos queue statistics interface Displays traffic statistics collected for an interface on a per queue basis display qos vlan policy Displays information about QoS policies applied to VLANs display smart link group...

Страница 50: ...ries execute the arp static command to configure static ARP entries b Execute the display mac address command to verify that the output interfaces in the MAC address entries and ARP entries are the same If the output interfaces are different execute the reset arp command to clear the ARP entries Then the switch can learn ARP entries again 3 Verify that ND entries are correct if Layer 3 forwarding ...

Страница 51: ...Execute the display fib command to verify that the output interfaces in the FIB entries and route entries are the same If the output interfaces are not the same execute the reset command to clear the route entries Then the switch can learn route entries again 5 If the problem persists contact Hewlett Packard Enterprise Support Related commands This section lists the commands that you might use for...

Страница 52: ...Dyn Swi Hash AC Lmax 0 ROOT 0 0 0 0 3000 S On SMAC 0 1 ISIS 0 0 0 0 2000 D On SMAC 8 2 ESIS 0 0 0 0 600 S On SMAC 8 3 CLNP 0 0 0 0 1000 S On SMAC 8 4 VRRP 0 0 0 0 2000 S On SMAC 8 5 UNKNOWN_IPV4MC 0 0 0 0 600 S On SMAC 8 6 UNKNOWN_IPV6MC 0 0 0 0 600 S On SMAC 8 7 IPV4_MC_RIP 0 0 0 0 1000 D On SMAC 8 8 IPV4_BC_RIP 0 0 0 0 1000 D On SMAC 8 Layer 2 packet loss occurs Layer 3 packet loss occurs Troubl...

Страница 53: ...50 4 If the problem persists contact Hewlett Packard Enterprise Support When you contact Hewlett Packard Enterprise Support provide diagnostic information if software related packet loss occurred ...

Отзывы: