background image

vii

Using This Documentation

This service manual provides detailed procedures that describe the service of the Sun
Network QDR InfiniBand Gateway Switch from Oracle. This document is written for
technicians, system administrators, and users who have advanced experience
servicing InfiniBand fabric hardware.

“Product Notes” on page vii

“Related Documentation” on page vii

“Feedback” on page viii

“Access to Oracle Support” on page viii

Product Notes

For late-breaking information and known issues about this product, refer to the
product notes at:

http://docs.oracle.com/cd/E36256_01

Related Documentation

Documentation

Links

Sun Network QDR InfiniBand Gateway Switch
Firmware Version 2.1

http://docs.oracle.com/cd/E36256_01

Summary of Contents for Sun Network QDR InfiniBand Gateway Switch

Page 1: ...Sun Network QDR InfiniBand Gateway Switch Service Manual for Firmware Version 2 1 Part No E36262 01 March 2013 Revision A ...

Page 2: ...és sous licence et soumis à des restrictions d utilisation et de divulgation Sauf disposition de votre contrat de licence ou de la loi vous ne pouvez pas copier reproduire traduire diffuser modifier breveter transmettre distribuer exposer exécuter publier ou afficher le logiciel même partiellement sous quelque forme et par quelque procédé que ce soit Par ailleurs il est interdit de procéder à tout...

Page 3: ...tus LEDs 6 Check Fan Status LEDs 7 Managing Faulty Components 7 Display Faulty Components fault_state 8 Display Faulty Components SP faultmgmt 9 Clear a Fault Manually 10 Clearable Fault Targets 11 Identify Faults in the Oracle ILOM Event Log 12 Determining the Alarm State of a Component or System 13 Display the General Alarm State of Systems and Components 14 System Alarm Targets 15 Component Ala...

Page 4: ...r 24 Temperature Sensor Values 24 Temperature Out of Range 25 Evaluating a Speed Sensor Alarm 26 Evaluate a Speed Sensor 26 Speed Sensor Values 27 Speed Out of Range 27 Evaluating a State Sensor Alarm 29 Evaluate a State Sensor 29 State Sensor Alarm Conditions 30 Evaluating a Presence Sensor Alarm 30 Evaluate a Presence Sensor 31 Presence Sensor Alarm Conditions 31 Evaluating an Indicator State 32...

Page 5: ...3 Identify the Power Supply 43 Inspect the Power Supply Hardware 45 Inspect the Power Supply Connectors 45 Power Off a Power Supply 46 Remove a Power Supply 47 Install a Power Supply 49 Power On a Power Supply 51 Servicing Fans 55 Determine If a Fan Is Faulty 55 Inspecting a Fan 57 Identify the Fan 57 Inspect the Fan Hardware 58 Inspect the Fan Connector 59 Remove a Fan 60 Install a Fan 61 Servici...

Page 6: ... 2 1 March 2013 Inspect the Data Cable Hardware 67 Inspect the Data Cable Connectors or Transceivers 67 Remove a Data Cable 68 Install a Data Cable 72 Servicing the Battery 75 Determine If the Battery Is Faulty 75 Remove the Gateway From the Rack 77 Replace the Battery 78 Index 85 ...

Page 7: ...ced experience servicing InfiniBand fabric hardware Product Notes on page vii Related Documentation on page vii Feedback on page viii Access to Oracle Support on page viii Product Notes For late breaking information and known issues about this product refer to the product notes at http docs oracle com cd E36256_01 Related Documentation Documentation Links Sun Network QDR InfiniBand Gateway Switch ...

Page 8: ...s have access to electronic support through My Oracle Support For information visit http www oracle com pls topic lookup ctx acc id info or http www oracle com pls topic lookup ctx acc id trs visit if you are hearing impaired Oracle Solaris 11 OS http www oracle com goto Solaris11 docs Oracle Integrated Lights Out Manager ILOM 3 0 http docs oracle com cd E19860 01 All Oracle products http docs ora...

Page 9: ...1 Servicing Fans on page 55 Servicing Data Cables on page 65 Servicing the Battery on page 75 Interpreting Status LEDs Use these topics to interpret LEDs to determine if a component has failed Front Panel LEDs on page 2 Rear Panel LEDs on page 3 Description Links Investigate whether there is a fault condition Interpreting Status LEDs on page 1 Managing Faulty Components on page 7 Identify Faults i...

Page 10: ... LEDs on page 1 Managing Faulty Components on page 7 Identify Faults in the Oracle ILOM Event Log on page 12 Determining the Alarm State of a Component or System on page 13 Evaluating Sensor Alarms on page 17 Accessing CLI Prompts on page 34 Front Panel LEDs No LED Link 1 Power supply AC LED Check Power Supply Status LEDs on page 6 2 Power supply Attention LED Check Power Supply Status LEDs on pag...

Page 11: ...nformation Front Panel LEDs on page 2 Check Chassis Status LEDs on page 4 Check NET MGT Port Status LEDs on page 4 Check Link Status LEDs on page 5 Check Power Supply Status LEDs on page 6 Check Fan Status LEDs on page 7 No LED Link 1 NET MGT status LEDs Check NET MGT Port Status LEDs on page 4 2 InfiniBand link status LEDs Check Link Status LEDs on page 5 3 Ethernet link status LEDs Check Link St...

Page 12: ...ort Status LEDs on page 4 Check Link Status LEDs on page 5 Check Power Supply Status LEDs on page 6 Check Fan Status LEDs on page 7 Check NET MGT Port Status LEDs The NET MGT port status LEDs are located on the NET MGT connector of the rear panel See Rear Panel LEDs on page 3 1 Visually inspect the NET status LEDs 2 Compare what you see to this table Glyph Location Name Color State and Meaning Top...

Page 13: ...ink status LEDs are located at the data cable connectors of the rear panel See Rear Panel LEDs on page 3 1 Visually inspect the link status LEDs 2 Compare what you see for a particular link to this table 3 If the Link LED flashes there might be a problem with the data cable See Servicing Data Cables on page 65 Name Location Color State and Meaning Link speed Left Amber or green Amber on 100BASE T ...

Page 14: ...use of a thermal or overcurrent condition signified by the amber Attention LED lighting remove the respective power cord from the chassis Allow the power supply to completely cool for at least 15 minutes A shorter cooling time might cause damage to the power supply when the power cord is reattached If the Attention LED lights amber upon reattaching the power cord replace the power supply 3 If the ...

Page 15: ... the fan status LEDs 2 If the LED is lit there is a fault with that fan See Servicing Fans on page 55 Related Information Front Panel LEDs on page 2 Rear Panel LEDs on page 3 Check Chassis Status LEDs on page 4 Check NET MGT Port Status LEDs on page 4 Check Link Status LEDs on page 5 Check Power Supply Status LEDs on page 6 Managing Faulty Components If Oracle ILOM has detected a fault with a comp...

Page 16: ...n page 35 2 Display the fault state of components 3 Look in the Value column for Faulted 4 Look in the same row under the Target column to find the Oracle ILOM target of the faulty component For example SYS FAN2 5 Identify the component that has faulted and might need to be replaced See Clearable Fault Targets on page 11 Related Information Display Faulty Components SP faultmgmt on page 9 Clear a ...

Page 17: ...get of the faulty component Note If there are several faulty components then their respective targets are listed with increasing target sequence numbers Note If no number is displayed there are no faulty components For example 3 Display details of the fault where x is the target sequence number starting at 0 show d targets SP faultmgmt SP faultmgmt Targets x faulted_target show d targets SP faultm...

Page 18: ...age 10 Clearable Fault Targets on page 11 Clear a Fault Manually If Oracle ILOM detects a fault and consequential component replacement Oracle ILOM automatically clears the fault However you can manually clear the fault after replacing the component if necessary 1 Access the Oracle ILOM CLI See Access the Oracle ILOM CLI NET MGT Port on page 35 show SP faultmgmt 0 faults 0 SP faultmgmt 0 faults 0 ...

Page 19: ...aulty Components SP faultmgmt on page 9 Clear a Fault Manually on page 10 Identify Faults in the Oracle ILOM Event Log on page 12 Related Information Display Faulty Components fault_state on page 8 Display Faulty Components SP faultmgmt on page 9 set target clear_fault_action true set SYS PSU0 clear_fault_action true Are you sure you want to clear SYS PSU0 y n y Set clear_fault_action to true Comp...

Page 20: ...he faulty components in the output The Oracle ILOM targets of the faulty components follow the word component For example show SP logs event list Class class Type type show SP logs event list Class Fault show SP logs event list Class Fault Event ID Date Time Class Type Severity 18820 Tue Sep 25 13 44 56 2012 Fault Fault critical Fault detected at time Tue Sep 25 13 44 56 2012 The suspect component...

Page 21: ... in slot 0 was now functional Continuing up the output Event ID 18820 on September 25 indicated that a critical fault occurred again in the component with Oracle ILOM target SYS PSU0 4 Depending on the severity of the fault replace the component See Clearable Fault Targets on page 11 for servicing links Related Information Interpreting Status LEDs on page 1 Managing Faulty Components on page 7 Det...

Page 22: ... General Alarm State of Systems and Components 1 Access the Oracle ILOM CLI See Access the Oracle ILOM CLI NET MGT Port on page 35 2 Type where target is from the tables in System Alarm Targets on page 15 and Component Alarm Targets on page 15 For example to display the general alarm state of fan 1 type 3 Compare the value displayed to the alarm states See Oracle ILOM Target Alarm States on page 1...

Page 23: ... 14 Component Alarm Targets on page 15 Oracle ILOM Target Alarm States on page 16 Component Alarm Targets This table lists components or sensors that have the ability to report an alarm and their Oracle ILOM targets Use these targets for the procedure Display the General Alarm State of Systems and Components on page 14 System Target Cooling system SYS COOLING_ATTN Signal cable monitoring SYS CABLE...

Page 24: ...CB alarm SYS MB V_ECB 3 3v main voltage alarm SYS MB V_3 3VMainOK 5v alarm SYS MB V_5VOK 1 0v alarm SYS MB V_1 0VOK I4 switch chip voltage alarm SYS MB V_I41 2VOK 2 5 v alarm SYS MB V_2 5VOK Digital power alarm SYS MB V_V1P2DIG Analog power alarm SYS MB V_V1P2ANG BridgeX chip voltage alarm SYS MB V_BX1 2VOK 1 8V alarm SYS MB V_1 8VOK I4 switch chip boot alarm SYS MB BOOT_I4A SSD drive alarm SYS MB...

Page 25: ...identified a condition that is abnormal but does not affect any individual component minor An alarm has identified a condition that might affect an individual component major An alarm has identified a condition that affects only the individual component The condition might affect a system but not enough to compromise the operation of the gateway critical An alarm has identified a condition that af...

Page 26: ...sensor and display its value Display Oracle ILOM Sensor Status on page 18 2 Determine the sensor target and alarm type Determine Oracle ILOM Sensor Target Types on page 20 3 Evaluate the sensor type alarm Evaluating a Voltage Sensor Alarm on page 20 Evaluating a Temperature Sensor Alarm on page 23 Evaluating a Speed Sensor Alarm on page 26 Evaluating a State Sensor Alarm on page 29 Evaluating a Pr...

Page 27: ...t and value For example SYS MB V_3 3VStby and 3 490 volts 7 Determine the sensor type See Determine Oracle ILOM Sensor Target Types on page 20 Related Information Determine Oracle ILOM Sensor Target Types on page 20 Evaluating a Voltage Sensor Alarm on page 20 Evaluating a Temperature Sensor Alarm on page 23 Evaluating a Speed Sensor Alarm on page 26 Evaluating a State Sensor Alarm on page 29 Eval...

Page 28: ...sensor alarms Evaluate a Voltage Sensor on page 21 Voltage Sensor Values on page 22 Sensor Target Sensor Type Links SYS FANx string Fan state Fan speed Fan presence Evaluating a State Sensor Alarm on page 29 Evaluating a Speed Sensor Alarm on page 26 Evaluating a Presence Sensor Alarm on page 30 SYS I_string Indicator Evaluating an Indicator State on page 32 SYS MB T_string Main board temperature ...

Page 29: ... See Display Oracle ILOM Sensor Status on page 18 Determine Oracle ILOM Sensor Target Types on page 20 2 Compare the displayed value with a known good range See Voltage Sensor Values on page 22 3 Learn why a voltage sensor might alarm See Voltage Out of Range on page 22 4 Determine your next step Related Information Voltage Sensor Values on page 22 Voltage Out of Range on page 22 Voltage Sensor Ta...

Page 30: ...ltage drifts outside of the acceptable range and goes too high or too low When a voltage is too high it can be caused by The load for which the voltage is provided is missing A component has failed or has been removed from the electrical connection The regulator for that voltage has failed Voltage Sensor Target Typical Value Acceptable Range SYS MB V_3 3VMain 3 266V 3 112 to 3 403V SYS MB V_3 3VSt...

Page 31: ...ry heavy throughput loading quite possibly in conjunction with overheating Because both types of voltage extremes for the SYS MB V_I41 2V sensor target can be indicative of a thermal problem with the I4 switch chip it follows that a check of the temperature at sensor target SYS MB T_I4A is in order Note The 3 3VMain 3 3VStby and the 12V are provided by the power supplies redundantly If one of thes...

Page 32: ...or Target Types on page 20 2 Compare the displayed value with a known good range See Temperature Sensor Values on page 24 3 Learn why a temperature sensor might alarm and take action See Temperature Out of Range on page 25 Related Information Temperature Sensor Values on page 24 Temperature Out of Range on page 25 Temperature Sensor Values This table lists typical values and acceptable ranges for ...

Page 33: ...ed to overvoltage situations when a voltage regulator fails they generate more heat For example if the temperature at sensor target SYS MB T_I4A is too high then the fans speeds SYS FANx TACH are collectively too low the cooling air temperature SYS MB T_FRONT is too high the voltage powering the I4 switch chip SYS MB V_I41 2V is too high or the loading on the switch chip is too high When a tempera...

Page 34: ... and replace any that are not operating properly See Servicing Fans on page 55 If new fans do not resolve the problem then replace the gateway Related Information Evaluate a Temperature Sensor on page 24 Temperature Sensor Values on page 24 Evaluating a Speed Sensor Alarm These topics help you resolve speed sensor alarms Evaluate a Speed Sensor on page 26 Speed Sensor Values on page 27 Speed Out o...

Page 35: ...is near a boundary or outside of the acceptable range refer to Speed Out of Range on page 27 Related Information Evaluate a Speed Sensor on page 26 Speed Out of Range on page 27 Speed Out of Range The speed of the fans is varied by the management controller The management controller uses an algorithm that considers the cooling air temperature the number of fans spinning and the temperatures within...

Page 36: ...ll overheat When a fan speed is too low it also is an indication of the condition of the fan which directly affects the operation of the gateway A too low fan speed can be caused by Coil failure The fan motor uses alternating electromagnetic fields to spin the fan impeller Depending upon the fan motor design if the coil that creates a magnetic field fails the fan might spin much slower or not at a...

Page 37: ...luating a Presence Sensor Alarm on page 30 Evaluating an Indicator State on page 32 Evaluate a State Sensor 1 Display the sensor status and determine the target type See Display Oracle ILOM Sensor Status on page 18 Determine Oracle ILOM Sensor Target Types on page 20 2 Learn why a state sensor might alarm See State Sensor Alarm Conditions on page 30 3 Determine your next step State Sensor Target A...

Page 38: ...s State Asserted there is a problem with fan 1 Related Information Evaluate a State Sensor on page 29 Evaluating a Presence Sensor Alarm These topics help you resolve presence sensor alarms Evaluate a Presence Sensor on page 31 Presence Sensor Alarm Conditions on page 31 SYS MB V_3 3VMainOK SYS POWER_ATTN SYS POWER_REDUN SYS PSUx ALERT SYS PSUx AC_PRESENT SYS PSUx FAULT Replace the power supply Se...

Page 39: ...nditions on page 31 Presence Sensor Alarm Conditions The presence sensors for the power supplies and fans indicate that the component is physically installed The sensors do not provide status or health of a component During the boot process the management controller looks for presence sensors to build a list of Oracle ILOM targets If the presence sensor cannot be read yet the component is physical...

Page 40: ...age 33 Indicator State Conditions on page 33 Related Information Display Oracle ILOM Sensor Status on page 18 Determine Oracle ILOM Sensor Target Types on page 20 Evaluating a Voltage Sensor Alarm on page 20 Evaluating a Temperature Sensor Alarm on page 23 Evaluating a Speed Sensor Alarm on page 26 Evaluating a State Sensor Alarm on page 29 Evaluating a Presence Sensor Alarm on page 30 Evaluate an...

Page 41: ...page 33 Related Information Evaluate an Indicator State on page 32 Indicator State Conditions on page 33 Indicator State Conditions Three primary LED indicators provide management controller status general chassis status and identification The table correlates the indicator target with the LED that represents that target When the locator LED is on it is actually flashing If the gateway is installe...

Page 42: ...ion See Check Chassis Status LEDs on page 4 and Display Oracle ILOM Sensor Status on page 18 to help determine the alarm condition of the gateway Related Information Evaluate an Indicator State on page 32 Indicator State Values on page 33 Accessing CLI Prompts These tasks enable you to issue Oracle ILOM and restricted shell commands on the management controller Access the Oracle ILOM CLI NET MGT P...

Page 43: ...ement controller by specifying the controller s host name For example where nm2name is the host name of the management controller Initially the password is ilom admin Note You can change the password at a later time Refer to Gateway Remote Management changing a user role or password for instructions on how to change Oracle ILOM user passwords The Oracle ILOM shell prompt is displayed Related Infor...

Page 44: ... Linux Shell on page 36 Exit the Restricted Linux Shell When you want to leave the restricted shell use the exit command On the management controller type Related Information Access the Oracle ILOM CLI NET MGT Port on page 35 Enter the Restricted Linux Shell on page 35 show SYS Fabric_Mgmt NOTE show on Fabric_Mgmt will launch a restricted Linux shell User can execute switch diagnosis SM Configurat...

Page 45: ... a component Once a failed part is identified it can be replaced The topics listed here help you service gateway chassis components Replaceable Components on page 37 Suggested Tools for Service on page 39 Antistatic Precautions for Service on page 39 Related Information Detecting and Managing Faults on page 1 Servicing Power Supplies on page 41 Servicing Fans on page 55 Servicing Data Cables on pa...

Page 46: ...E Replaceable Components Related Information Servicing Power Supplies on page 41 Servicing Fans on page 55 Servicing Data Cables on page 65 Servicing the Battery on page 75 Suggested Tools for Service on page 39 Antistatic Precautions for Service on page 39 Figure Legend 1 Battery 2 Fan 3 Power supply ...

Page 47: ... Related Information Replaceable Components on page 37 Antistatic Precautions for Service on page 39 Antistatic Precautions for Service When installing the gateway chassis take care to follow antistatic precautions Use an antistatic mat as a work surface Wear an antistatic wrist strap that is attached to either the mat or a metal portion of the gateway chassis Related Information Replaceable Compo...

Page 48: ...40 Sun Network QDR InfiniBand Gateway Switch Service Manual for Firmware Version 2 1 March 2013 ...

Page 49: ...ermine which power supply is faulty before you replace it Description Links Add a power supply Inspecting a Power Supply on page 43 Install a Power Supply on page 49 Power On a Power Supply on page 51 Replace a power supply Determine If a Power Supply Is Faulty on page 41 Power Off a Power Supply on page 46 Remove a Power Supply on page 47 Inspecting a Power Supply on page 43 Install a Power Suppl...

Page 50: ...f a power supply is faulty you will see SYS PSUx listed in the output under Target where x is 0 left power supply or 1 right power supply For example If a power supply is faulty replace it See Remove a Power Supply on page 47 If a FRU value in addition to or different from SYS PSUx is displayed see Clearable Fault Targets on page 11 to identify which component is faulty In no Oracle ILOM targets a...

Page 51: ...upply 1 Identify the prerequisite and subsequent service tasks you must perform in conjunction with this procedure See Inspecting a Power Supply on page 43 2 Use this illustration to identify the various features of a power supply Step Description Links 1 Identify the Power Supply Identify the Power Supply on page 43 2 Inspect the hardware Inspect the Power Supply Hardware on page 45 3 Inspect the...

Page 52: ... Manual for Firmware Version 2 1 March 2013 3 Inspect the power supply hardware See Inspect the Power Supply Hardware on page 45 Related Information Identify the Fan on page 57 Identify the Data Cable on page 66 1 AC connector 2 Release tab 3 Status LEDs ...

Page 53: ...age to the power supply chassis 4 Verify that the release tab moves freely and smoothly 5 Inspect the power supply connectors See Inspect the Power Supply Connectors on page 45 Related Information Inspect the Fan Hardware on page 58 Inspect the Data Cable Hardware on page 67 Inspect the Power Supply Connectors 1 Identify the prerequisite and subsequent service tasks you must perform in conjunction...

Page 54: ... Inspect the Data Cable Connectors or Transceivers on page 67 Power Off a Power Supply Note Powering off both power supplies powers off the gateway 1 Identify the prerequisite and subsequent service tasks you must perform in conjunction with this procedure See Servicing Power Supplies on page 41 2 Determine which power supply is to be removed 3 At the front of the gateway chassis remove the power ...

Page 55: ... the power supply See Remove a Power Supply on page 47 Related Information Power On a Power Supply on page 51 Remove a Power Supply 1 Identify the prerequisite and subsequent service tasks you must perform in conjunction with this procedure See Servicing Power Supplies on page 41 ...

Page 56: ...irmware Version 2 1 March 2013 2 Locate the power supply to be removed 3 Press and hold the release tab to the left and pull on the handle of the power supply 4 Continue to pull the handle of the power supply to remove it from the chassis 5 Set the power supply aside ...

Page 57: ...requisite and subsequent service tasks you must perform in conjunction with this procedure See Servicing Power Supplies on page 41 2 Inspect the replacement power supply See Inspecting a Power Supply on page 43 3 Verify that the slot where the power supply installs is clean and free of debris 4 Verify that the slot connector pins are straight and not missing 5 Verify that the slot connector recept...

Page 58: ...witch Service Manual for Firmware Version 2 1 March 2013 8 When the power supply seats push firmly so that the release tab clicks to secure the power supply into the chassis 9 Power on the power supply See Power On a Power Supply on page 51 ...

Page 59: ...er Supply 1 For residual power discharge the power cord must remain unattached to the power supply for at least one minute before powering on a power supply 2 Identify the prerequisite and subsequent service tasks you must perform in conjunction with this procedure See Servicing Power Supplies on page 41 3 Reconnect the power cord to the power supply ...

Page 60: ...ly is at full power 4 Access the Oracle ILOM CLI See Access the Oracle ILOM CLI NET MGT Port on page 35 5 Enter the restricted Linux shell See Enter the Restricted Linux Shell on page 35 6 Verify the power supply s operation with the checkpower and checkvoltages commands on the management controller For example to check the power supplies FabMan gateway_name checkpower PSU 0 present status OK PSU ...

Page 61: ...ame checkvoltages Voltage ECB OK Measured 3 3V Main 3 30 V Measured 3 3V Standby 3 42 V Measured 12V 12 06 V Measured 5V 5 03 V Measured VBAT 3 17 V Measured 1 0V 1 01 V Measured I4 1 2V 1 22 V Measured 2 5V 2 51 V Measured V1P2 DIG 1 18 V Measured V1P2 ANG 1 18 V Measured 1 2V BridgeX 1 22 V Measured 1 8V 1 80 V Measured 1 2V Standby 1 20 V All voltages OK FabMan gateway_name ...

Page 62: ...54 Sun Network QDR InfiniBand Gateway Switch Service Manual for Firmware Version 2 1 March 2013 ...

Page 63: ...n page 75 Determine If a Fan Is Faulty You must determine which power supply is faulty before you replace it 1 Check to see if any System Service Required LEDs are lit or flashing See Check Chassis Status LEDs on page 4 Description Links Add a fan Inspecting a Fan on page 57 Install a Fan on page 61 Replace a fan Determine If a Fan Is Faulty on page 55 Remove a Fan on page 60 Inspecting a Fan on p...

Page 64: ...f a fan is faulty replace it See Remove a Fan on page 60 If a FRU value in addition to or different from SYS FANx is displayed see Clearable Fault Targets on page 11 to identify which component is faulty If no Oracle ILOM targets are listed go to Step 5 5 Within the Oracle ILOM interface verify the fan speed where x is 0 left fan to 4 right fan For example 6 Compare the value seen with the typical...

Page 65: ...its suitability for installation Related Information Inspecting a Power Supply on page 43 Inspecting the Data Cables on page 65 Identify the Fan 1 Identify the prerequisite and subsequent service tasks you must perform in conjunction with this procedure See Inspecting a Fan on page 57 2 Use this illustration to identify the various features of a fan Step Description Links 1 Identify the fan Identi...

Page 66: ...are 1 Identify the prerequisite and subsequent service tasks you must perform in conjunction with this procedure See Inspecting a Fan on page 57 2 Unwrap the replacement fan from its antistatic packaging 3 Verify that there is no visible damage to the fan chassis 4 Verify that the thumbscrew spins freely and smoothly 5 Inspect the fan connector See Inspect the Fan Connector on page 59 Related Info...

Page 67: ...Inspecting a Fan on page 57 2 Verify that the connector is clean and without damage 3 Verify that the connector receptacles are free from obstructions 4 Verify that the connector freely floats in its mounting 5 The fan is ready for installation See Install a Fan on page 61 Related Information Inspect the Power Supply Connectors on page 45 Inspect the Data Cable Connectors or Transceivers on page 6...

Page 68: ...n two operational fans the gateway shuts down to prevent thermal overload 1 Identify the prerequisite and subsequent service tasks you must perform in conjunction with this procedure See Servicing Fans on page 55 2 Determine which fan is to be removed If a fan has failed its Attention LED lights 3 Loosen the captive thumbscrew at the right side of the fan 4 Grasp the handle and pull the fan straig...

Page 69: ...removing the fan as a subtractive action you are finished Related Information Remove a Power Supply on page 47 Remove a Data Cable on page 68 Remove the Gateway From the Rack on page 77 Replace the Battery on page 78 Install a Fan 1 Identify the prerequisite and subsequent service tasks you must perform in conjunction with this procedure See Servicing Fans on page 55 ...

Page 70: ...lot where the fan installs is clean and free of debris 4 Verify that the slot connector pins are straight and not missing 5 Orient the fan to the opening in the gateway chassis with the thumbscrew on the right 6 Firmly slide the fan into the chassis until the fan stops The fan might immediately power on 7 Tighten the captive thumbscrew to secure the fan in the gateway chassis ...

Page 71: ... management controller to verify the fan s operation Note You should see a fan speed for the fan you just installed For example to check the fans Related Information Gateway Reference getfanspeed command Install a Power Supply on page 49 Install a Data Cable on page 72 Replace the Battery on page 78 FabMan gateway_name getfanspeed Fan 0 not present Fan 1 running at rpm 11212 Fan 2 running at rpm 1...

Page 72: ...64 Sun Network QDR InfiniBand Gateway Switch Service Manual for Firmware Version 2 1 March 2013 ...

Page 73: ...ecting the Data Cables Before installing a data cable inspect its hardware and connectors to verify its suitability for installation Description Links Add a data cable Inspecting the Data Cables on page 65 Install a Data Cable on page 72 Replace a data cable Remove a Data Cable on page 68 Inspecting the Data Cables on page 65 Install a Data Cable on page 72 Subtract a data cable Remove a Data Cabl...

Page 74: ...equisite and subsequent service tasks you must perform in conjunction with this procedure See Inspecting the Data Cables on page 65 2 Use this illustration to identify the various features of the data cable 2 Inspect the hardware Inspect the Data Cable Hardware on page 67 3 Inspect the connectors Inspect the Data Cable Connectors or Transceivers on page 67 1 Retraction strap 2 L groove 3 Paddle bo...

Page 75: ...cable connectors or transceivers See Inspect the Data Cable Connectors or Transceivers on page 67 Related Information Inspect the Power Supply Hardware on page 45 Inspect the Fan Hardware on page 58 Inspect the Data Cable Connectors or Transceivers 1 Identify the prerequisite and subsequent service tasks you must perform in conjunction with this procedure See Inspecting the Data Cables on page 65 ...

Page 76: ...s how to remove the cables from the gateway chassis so that the cable can be replaced If you are removing all cables for gateway replacement start removing the cables from the left side of the gateway working your way to the right Note These instructions are valid for both InfiniBand and Ethernet data cables 1 Identify the prerequisite and subsequent service tasks you must perform in conjunction w...

Page 77: ...er your next steps If the cable is a one piece data cable follow these steps a Grasp the cable connector to support its weight and apply the removal force b Pull on the retractor strap while simultaneously pulling on the cable connector The cable connector comes free ...

Page 78: ...ce Manual for Firmware Version 2 1 March 2013 c Carefully move the cable out of the cable management hardware d Continue to Step 5 If the cable is an assembled data cable follow these steps a Grasp the release collar on the MTP connector and pull back ...

Page 79: ...The transceiver comes free d Set the transceiver aside e Continue to Step 5 5 Open hook and loop fasteners from bundles and securing hard points to gently lower the cable to the floor Caution Do not allow the cable or transceiver to drop or strike the floor Jerking bending pulling on or dropping the cable can damage the cable 6 Consider your next steps If you are removing a single cable for replac...

Page 80: ...quent service tasks you must perform in conjunction with this procedure See Servicing Data Cables on page 65 2 Determine your next steps If you are cabling an entire gateway after a replacement procedure locate the cable for the connector 0B and go to Step 6 If you are installing a replacement cable to the gateway start the procedure at Step 3 3 If necessary assemble the data cable Refer to Gatewa...

Page 81: ...d so on ensure that the L groove and retraction strap are up When installing QSFP cables in the bottom row receptacles 0B 1B 2B and so on ensure that the L groove and retraction strap are down See Identify the Data Cable on page 66 8 Slowly move the connector in As you slide the connector in the shell should be in the center of the QSFP receptacle If the connector stops or binds after about 1 4 in...

Page 82: ... management hardware Close hook and loop fasteners at bundles and securing hard points 11 If you are installing all cables as part of a gateway replacement procedure repeat from Step 6 for all cables including the Ethernet data cables at connectors 0A and 1A on the right side of the rear panel 12 Replace the cover for the cable management bracket and tighten the thumbscrews Related Information Ins...

Page 83: ...Data Cables on page 65 Determine If the Battery Is Faulty You must determine if the battery is faulty before you replace it 1 Check to see if any System Service Required LEDs are lit or flashing See Check Chassis Status LEDs on page 4 Step Description Links 1 Determine if the battery is faulty Determine If the Battery Is Faulty on page 75 2 Remove all data cables Remove a Data Cable on page 68 3 P...

Page 84: ...ft of SYS MB c Type where number is the number to the left of SYS MB For example show d targets SP faultmgmt show d targets SP faultmgmt SP faultmgmt Targets 0 SYS MB show d properties SP faultmgmt number faults 0 show d properties SP faultmgmt 0 faults 0 SP faultmgmt 0 faults 0 Properties class fault chassis device battery low sunw msg id DCSIB 8000 45 uuid 82e90599 8650 47dc b613 1e602607441b ti...

Page 85: ...lty replace it See Replace the Battery on page 78 6 If you are unable to determine if the battery is faulty seek further information See Detecting and Managing Faults on page 1 Related Information Determine If a Power Supply Is Faulty on page 41 Determine If a Fan Is Faulty on page 55 Remove the Gateway From the Rack Note This procedure assumes that you have removed all data cables from the gatewa...

Page 86: ...ve a Power Supply on page 47 Remove a Fan on page 60 Remove a Data Cable on page 68 Replace the Battery on page 78 Replace the Battery Note This procedure assumes that you have removed the Sun Network QDR InfiniBand Gateway Switch from Oracle from the rack If not see Remove the Gateway From the Rack on page 77 1 Identify the prerequisite and subsequent service tasks you must perform in conjunction...

Page 87: ... eight screws that secure the long front brackets at the front sides of the gateway chassis 4 Remove the 16 screws that secure the top cover to the chassis There are five screws on each side and six screws across the top front of the cover ...

Page 88: ...rk QDR InfiniBand Gateway Switch Service Manual for Firmware Version 2 1 March 2013 5 Slide the cover forward and lift it off 6 Depress the clip that retains the battery and release the battery from the main board ...

Page 89: ...Servicing the Battery 81 7 Properly dispose of the old battery 8 Unwrap the replacement battery from its antistatic packaging 9 Install the replacement battery into the main board with the side up ...

Page 90: ...rvice Manual for Firmware Version 2 1 March 2013 10 Orient the cover over the chassis and lower it in place 11 Slide the cover rearward so that it engages at the rear panel Ensure that the screw holes in the cover align with the holes in the chassis ...

Page 91: ...vicing the Battery 83 12 Use a No 1 Phillips screwdriver to install the 16 screws that secure the cover to the chassis 13 Use eight screws to attach the two front brackets to the front sides of the chassis ...

Page 92: ...Use eight screws to attach the two C shaped brackets to the rear sides of the chassis 15 Install the gateway into the rack Refer to Gateway Installation installing the gateway into the rack Related Information Install a Power Supply on page 49 Install a Fan on page 61 Install a Data Cable on page 72 ...

Page 93: ...s command 51 clearable fault targets 11 CLI displaying faulty components 8 9 command checkpower 51 checkvoltages 51 components alarm state 14 alarm targets 15 determining alarm state 13 managing faulty 7 resetting 10 D data cable features 66 inspecting 65 connectors 67 hardware 67 transceivers 67 installing 72 removing 68 servicing 65 detecting faults 1 determining component alarm state 13 faulty ...

Page 94: ...ulty 55 features 57 inspecting 57 connector 59 hardware 58 installing 61 LED 2 removing 60 servicing 55 faults clearing manually 10 detecting 1 identifying in log 12 managing 1 faulty battery 75 fan 55 power supply 41 faulty components 8 9 features data cable 66 Ethernet cable 66 fan 57 InfiniBand cable 66 power supply 43 front status LEDs 2 G gateway powering off 46 removing from rack 77 I identi...

Page 95: ...accessing NET MGT port 34 35 out of range speed sensor 27 temperature sensor 25 voltage sensor 22 P paddle boards 66 power supply checking LEDs 6 determining faulty 41 features 43 inspecting 43 connectors 45 hardware 45 installing 49 LEDs 2 powering off 46 powering on 51 removing 47 servicing 41 powering off gateway 46 power supply 46 powering on power supply 51 presence sensor alarm conditions 31...

Page 96: ...iniBand cable 65 power supply 41 speed sensor evaluating 26 out of range 27 values 27 state sensor alarm conditions 30 evaluating 29 system alarm state 14 alarm targets 15 determining alarm state 13 T targets alarm state component 15 system 15 temperature sensor evaluating 24 out of range 25 values 24 tools 39 U understanding service procedures 37 V values indicator state 33 speed sensor 27 temper...

Reviews: