background image

BladeCenter

 

QS21

 

Type

 

0792

Problem

 

Determination

 

and

 

Service

 

Guide

 

 

  

Summary of Contents for QS21 - BladeCenter - 0792

Page 1: ...BladeCenter QS21 Type 0792 Problem Determination and Service Guide ...

Page 2: ......

Page 3: ...BladeCenter QS21 Type 0792 Problem Determination and Service Guide ...

Page 4: ...x C Notices on page 113 and the Warranty and Support Information on the Documentation CD Fifth Edition September 2008 Copyright International Business Machines Corporation 2006 2008 US Government Users Restricted Rights Use duplication or disclosure restricted by GSA ADP Schedule Contract with IBM Corp ...

Page 5: ...server firmware levels 17 Updating the BMC firmware 18 Using the BMC update package 18 Using the Advanced Management Module 18 Installing the system firmware 20 The firmware update package 21 Using the package 21 Updating the system firmware automatically 22 Installing the firmware manually 22 Updating the system firmware images 23 Updating the optional expansion card firmware 23 Integrating the G...

Page 6: ...2 Problems indicated by the system board LEDS 54 Power problems 57 Power throttling 57 Network connection problems 57 Service processor problems 58 Software problems 58 Recovering the system firmware code 59 Checking the boot image 59 Booting from the TEMP image 59 Recovering the TEMP image from the PERM image 59 Supported boot media 59 Booting the system 60 Diagnostic programs and messages 62 Run...

Page 7: ...cycling and disposal 115 Battery return program 116 Electronic emission notices 117 Federal Communications Commission FCC statement 117 Industry Canada Class A emission compliance statement 118 Avis de conformité à la réglementation d Industrie Canada 118 Australia and New Zealand Class A statement 118 United Kingdom telecommunications safety requirement 118 Deutschsprachiger EU Hinweis Hinweis fü...

Page 8: ...vi BladeCenter QS21 Type 0792 Problem Determination and Service Guide ...

Page 9: ...leert eerst de veiligheidsvoorschriften Ennen kuin asennat tämän tuotteen lue turvaohjeet kohdasta Safety Information Avant d installer ce produit lisez les consignes de sécurité Vor der Installation dieses Produkts die Sicherheitshinweise lesen Prima di installare questo prodotto leggere le Informazioni sulla Sicurezza Les sikkerhetsinformasjonen Safety Information før du installerer dette produk...

Page 10: ...rds such as loose or missing hardware To inspect the product for potential unsafe conditions complete the following steps 1 Make sure that the power is off and the power cord is disconnected 2 Make sure that the exterior cover is not damaged loose or broken and observe any sharp edges 3 Check the power cord v Make sure that the third wire ground connector is in good condition Use a meter to measur...

Page 11: ...disconnected from a circuit Check it to make sure that it has been disconnected v If you have to work on equipment that has exposed electrical circuits observe the following precautions Make sure that another person who is familiar with the power off controls is near you and is available to turn off the power if necessary When you are working with powered on electrical equipment use only one hand ...

Page 12: ...ts in this documentation before performing the instructions Read any additional safety information that comes with the blade server or optional device before you install the device x BladeCenter QS21 Type 0792 Problem Determination and Service Guide ...

Page 13: ...equipment when there is evidence of fire water or structural damage v Disconnect the attached power cords telecommunications systems networks and modems before you open the device covers unless instructed otherwise in the installation and configuration procedures v Connect and disconnect cables as described in the following table when installing moving or opening covers on this product or attached...

Page 14: ...hium battery replace it only with the same module type made by the same manufacturer The battery contains lithium and can explode if not properly used handled or disposed of Do not v Throw or immerse into water v Heat to more than 100 C 212 F v Repair or disassemble Dispose of the battery as required by local ordinances or regulations xii BladeCenter QS21 Type 0792 Problem Determination and Servic...

Page 15: ...he device v Use of controls or adjustments or performance of procedures other than those specified herein might result in hazardous radiation exposure DANGER Some laser products contain an embedded Class 3A or Class 3B laser diode Note the following Laser radiation when open Do not stare into the beam do not view directly with optical instruments and avoid direct exposure to the beam Class 1 Laser...

Page 16: ...vice and the power switch on the power supply do not turn off the electrical current supplied to the device The device also might have more than one power cord To remove all electrical current from the device ensure that all power cords are disconnected from the power source 1 2 xiv BladeCenter QS21 Type 0792 Problem Determination and Service Guide ...

Page 17: ...tact a service technician Statement 13 DANGER Overloading a branch circuit is potentially a fire hazard and a shock hazard under certain conditions To avoid these hazards ensure that your system electrical requirements do not exceed branch circuit protection requirements Refer to the information that is provided with your device for electrical specifications Statement 21 CAUTION Hazardous energy i...

Page 18: ...handling ADVERTENCIA El contacto con el cable de este producto o con cables de accesorios que se venden junto con este producto pueden exponerle al plomo un elemento químico que en el estado de California de los Estados Unidos está considerado como un causante de cancer y de defectos congénitos además de otros riesgos reproductivos Lávese las manos después de usar el producto xvi BladeCenter QS21 ...

Page 19: ...d User s Guide This printed document contains general information about the blade server including how to install supported options and how to configure the blade server v Safety Information This document is in Portable Document Format PDF on the Documentation CD It contains translated caution and danger statements Each caution and danger statement that appears in the documentation has a number th...

Page 20: ...the instruction or situation in which damage could occur v Caution These statements indicate situations that can be potentially hazardous to you A caution statement is placed just before the description of a potentially hazardous procedure step or situation v Danger These statements indicate situations that can be potentially lethal or extremely hazardous to you A danger statement is placed just b...

Page 21: ...switch in the rear of the chassis and various options to attach storage to that integrated SAS switch An optional SAS expansion card is available for the BladeCenter QS21 Storage can be attached via the external SAS host controller The BladeCenter QS21 supports the SAS drives of the IBM System Storage DS3200 and the IBM System Storage EXP3000 expansion unit Check the IBM BladeCenter support Web si...

Page 22: ...unit or power off the BladeCenter unit To avoid loss of data shut down the Linux operating system before you turn off the blade server Shut down the operating system by entering the shutdown h now command at the command prompt or by choosing shutdown if you are using a graphical user interface GUI See your operating system documentation for additional information about shutting down the operating ...

Page 23: ...configure the Advanced Management Module to turn off the blade server automatically if the system is not operating correctly Note After turning off the blade server wait at least 5 seconds before turning it on again Chapter 1 Introduction 5 ...

Page 24: ...dule or through IBM Director Console Blade error LED This amber LED lights when a system error has occurred in the blade server Power control button Press this button to turn the blade server on or off The power control button only has effect if local power control is enabled for the blade server Local power control is enabled and disabled through the BladeCenter Advanced Management Module Web int...

Page 25: ...location LED can be turned off through the BladeCenter Management Module Web interface System board LEDs The BladeCenter QS21 has status LEDs on the system board to indicate the health of various components Some are within the light box while others are in different location A lit LEDs indicates an error condition Complete information about the LEDs can be found in Troubleshooting charts on page 5...

Page 26: ...rror LED JDIM01 Error LED JDIM 00 Light path diagnostics switch Light box Temperature fault LED System board LED NMI error LED CPU fail LED Light path diagnostics LED NMI CPU SBRD TEMP LP 1 JDIM00 slot JDIM01 slot JDIM10 slot JDIM11 slot Figure 3 System board LEDs Connector at J200 1 Connector at J201 Connector at J22 Connector at JFC_18 Figure 4 Locations of the expansion option connectors on the...

Page 27: ... to the blade server The command line interface See Using the command line interface on page 10 for further information Serial over LAN SOL This is similar to the serial interface but allows you to connect to the blade server over the network See Using Serial over LAN on page 10 for further information The serial interface You can connect a PC or compatible terminal directly to the BladeCenter H o...

Page 28: ...ant for your Web session The BladeCenter management and configuration window opens For additional information see the IBM BladeCenter Advanced Management Module User s Guide Using the command line interface The IBM BladeCenter Advanced Management Module also provides a command line interface to provide direct access to BladeCenter management functions You can use this as an alternative to using th...

Page 29: ...t Module This is the System Management Services SMS utility program The SMS utility program allows you to view and update the VPD change the boot list and set network parameters Starting SMS Complete the following steps to start SMS 1 Using a Telnet or SSH client connect to the Advanced Management Module external Ethernet interface IP address 2 When prompted enter a valid user ID and password The ...

Page 30: ...about the machine type or model serial number and the universal unique ID Complete the following steps to see this information 1 Start SMS by completing the above steps The SMS menu appears PowerPC Firmware Version HEAD SLOF SMS 1 6 c Copyright IBM Corp 2000 2005 2007 All rights reserved Main Menu 1 Select Language 2 Setup Remote IPL Initial Program Load 3 Change SCSI Settings 4 Select Console 5 S...

Page 31: ...the process stops to allow you to enter the machine type or model and serial number Boot does not continue until the information is provided To enter new FRU information complete the following steps 1 Using a Telnet or SSH client connect to the Advanced Management Module external Ethernet interface IP address 2 When prompted enter a valid user ID and password The default management module user ID ...

Page 32: ...ctions on the screen and press Enter to continue 5 You must confirm the model number PowerPC Firmware Version HEAD SLOF SMS 1 6 c Copyright IBM Corp 2000 2005 2007 All rights reserved Number entered is 1234567 Accept number Enter y or Y to accept or n or N to decline Select Navigation key Type y or Y and press Enter to confirm the number 6 At the following screen type the serial number 14 BladeCen...

Page 33: ...G Accept number Enter y or Y to accept or n or N to decline Select Navigation key Type y or Y and press Enter to confirm the number This completes the process and the blade server continues to boot as normal Updating the system and BMC firmware The firmware consists of two distinct packages v A firmware package for the baseboard management controller BMC This is referred to as the BMC firmware v A...

Page 34: ...bility to restart the server at a time when it is most convenient to do so As a best practice use the online update packages to perform all of your basic update functions IBM periodically makes updates to both BMC and system firmware These may be downloaded from http www ibm com support us en Note To avoid problems and to maintain proper system performance always make sure that both the BMC firmwa...

Page 35: ...and revision level of both the system firmware BIOS and the BMC firmware In the example above the system firmware or BIOS version is QB01020000 and the BMC firmware is BNBT06b Compare this information to the firmware information provided at http www ibm com support us en If the two match then the blade server has the latest firmware If not download the firmware package from the IBM Support Web sit...

Page 36: ...plete the following steps to update the BMC firmware from the Linux command prompt 1 Check the README that comes with the BMC firmware as it contains specific information about that particular firmware release 2 Boot the blade server and the operating system 3 Download the package from the IBM support site at http www ibm com support us en The update package has a sh extension 4 Change to the dire...

Page 37: ...he blade server you want to update target and browse to the firmware image file 6 Click on Update 7 The validity of the image is checked then the following screen appears Chapter 2 Configuring the blade server 19 ...

Page 38: ...shooting on page 51 for further information about troubleshooting the BladeCenter QS21 blade server You can update the system firmware v Through IBM Director See the IBM Director documentation on the IBM Director CD for further information v Using the update package available from http www ibm com support us en See Updating the system firmware automatically on page 22 for further information on ho...

Page 39: ...This has a chg extension v A file containing the update package This has an sh extension v A readme file for the update package This contains specific installation and configuration information v An XML file This file is for use by IBM Systems Management tools including IBM Director Update Manager UpdateXpress CD and UpdateXpress System Pack Installer Using the package The package consists of a fi...

Page 40: ...ructions Installing the firmware manually If you cannot update the firmware using the update_flash script it is possible to update the firmware manually You can use rtas_flash over proc Complete the following steps to install the firmware manually 1 Download the update package from http www ibm com support us en 2 Extract the system firmware image package At the command prompt enter update package...

Page 41: ...e are two commands you can use to update an old image on PERM v From the Linux prompt issue the following command update_flash c Note The script checks whether the board has booted from the TEMP image If not the script does not complete v From the Linux prompt issue the following command echo 0 proc rtas manage_flash For more information on booting from the TEMP or PERM images see Recovering the s...

Page 42: ...ch modules that are mentioned in this section see the documentation that comes with the Ethernet switch module that you are using Updating the Ethernet controller firmware To update the Ethernet controller firmware you must download an update package from http www ibm com support us en This section describes how to use the update package to install the firmware update The update package consists o...

Page 43: ...irectory where you have downloaded the package 4 Run the package with the u option Using the example from above at the command prompt enter brcm_fw_nic_2 0 3 e 1_rhel5_cell sh u During the update process messages similar to the following appear on the console root c4b14 brcm 2 0 3 ppc brcm_fw_nic_2 0 3 e 1_rhel5_cell sh u IBM Ethernet Firmware Update Tool Version 1 0 2 Warning No Broadcom NetXtrem...

Page 44: ...agement in the BladeCenter Management Module Web interface 3 Enable only one of the Ethernet controller ports on the blade server Note the designation that the blade server operating system has for the controller port 4 Ping an external computer on the network connected to the Ethernet switch module If you can ping the external computer the Ethernet controller port that you enabled is associated w...

Page 45: ... service technicians For information about the terms of the warranty and getting service and assistance see Warranty and Support Information The following table lists which replaceable components are available for the BladeCenter QS21 Description FRU No Tier 1 CRU No Tier 2 CRU No DIMM VLP 512 MB DDR2 I O Buffer 39M5860 Cisco 4X Infiniband Expansion Card for IBM BladeCenter 32R1763 InfiniBand 4X D...

Page 46: ...28 BladeCenter QS21 Type 0792 Problem Determination and Service Guide ...

Page 47: ... You do not have to turn off the blade server or disconnect the BladeCenter unit from power to install or replace any of the hot swappable modules on the rear of the BladeCenter unit v Before you remove a hot swappable blade server from the BladeCenter unit you must shut down the operating system on it by typing the shutdown h now command or choosing the shut down option from your GUI See Turning ...

Page 48: ...t Movement can cause static electricity to build up around you v Handle the device carefully holding it by its edges or its frame v Do not touch solder joints pins or exposed printed circuitry v Do not leave the device where others can handle and damage it v While the device is still in its static protective package touch it to an unpainted metal part of the BladeCenter chassis for at least 2 seco...

Page 49: ...swappable bays Therefore you can install or remove the blade server without removing power from the BladeCenter unit However you must turn off the blade server before removing it from the BladeCenter unit Complete the following steps to remove the blade server 1 Read the safety information beginning on page vii and Installation guidelines on page 29 2 If the blade server is operating the power on ...

Page 50: ...er and lift the outer cover open see Figure 6 4 If you want to remove the cover carefully lift it from the cover pins and set it aside see Figure 6 Statement 21 CAUTION Hazardous energy is present when the blade server is connected to the power source Always replace the blade cover before installing the blade server Removing the BladeCenter PCI Express I O Expansion Unit You must remove BladeCente...

Page 51: ...us energy is present when the blade server is connected to the power source Always replace the blade cover before installing the blade server Installing the optional InfiniBand card The InfiniBand card connects to the high speed connector on the system board using the two expansion card locator pins to assist with fitting and locking in place Use the blue handling areas to handle the card and when...

Page 52: ... the connector cover 6 Locate the expansion card locator pins at the back of the system board 7 Locate the connector and ball socket on the InfiniBand card Locator pin holes Locking clip Handling areas Figure 8 InfiniBand card handling areas 1 Expansion card standoffs with locator pins High speed connector Ball stud Figure 9 Expansion card connector locator pins and ball stud 34 BladeCenter QS21 T...

Page 53: ... If you do not want to install any other options replace the cover and insert the BladeCenter QS21 into the BladeCenter unit Attention The connectors on the system board and the InfiniBand card are not designed for repeated removal or replacement of components Avoid removing the InfiniBand card once it is in position Connector Locator pin holes Ball socket Locking clip Figure 10 InfiniBand card re...

Page 54: ... a single pair of DIMMs you must use slots JDIM00 and JDIM11 The BladeCenter QS21 supports VLP DDR2 512 MB DIMMs only Note The DIMMs are used as memory for the I O buffers only You cannot increase the size of system memory which is fixed at 1GB for each Cell B E processor To install extra I O buffer memory complete the following steps 1 Shut down the BladeCenter QS21 2 Remove the BladeCenter QS21 ...

Page 55: ...ntil the retaining clips snap into position Make sure that the clips are locked properly 8 Repeat steps 6 and 7 until you have installed all the optional DIMMs 9 Ensure that all unused DIMM slots are fitted with DIMM fillers 10 If you do not want to install any other options replace the cover and insert the BladeCenter QS21 into the BladeCenter unit Replacing DIMM fillers For the BladeCenter QS21 ...

Page 56: ...o position 7 Repeat step 6 until all unused slots are fitted with DIMM fillers 8 Replace the cover and insert the BladeCenter QS21 into the BladeCenter unit Installing the SAS expansion card The BladeCenter QS21 does not have any built in disk storage The SAS expansion card allows you to connect storage to the BladeCenter QS21 Use the blue handling areas to handle the card Complete the following s...

Page 57: ...em board engages with the ball socket on the SAS expansion card 8 If you do not want to install any other options replace the cover and insert the BladeCenter QS21 into the BladeCenter unit Connectors for SAS expansion card 1 Ball stud Figure 16 SAS expansion card connector and ball stud location Connectors Ball socket Figure 17 SAS expansion card reverse side Expansion card Figure 18 SAS expansio...

Page 58: ...delines on page 29 2 Remove the blade server cover and set it aside See Opening and removing the blade server cover on page 32 for further information 3 Remove the connector cover or any optional card from the high speed connector Figure 9 on page 34 shows the location of the high speed connector 4 Lower the expansion unit so that the slots at the rear slide down onto the cover pins at the rear of...

Page 59: ...s on page 29 2 Open the blade server cover 3 Carefully disconnect the control panel cable from the control panel connector 4 Press the front bezel release on both sides of the system board and pull the front bezel assembly away from the blade server 5 Store the front bezel assembly in a safe place Replacing the system board base and planar Bezel Assembly Release Bezel Assembly Release Bezel Blade ...

Page 60: ...e defective system board See Adding FRU information on page 13 for details 13 Configure the replacement blade server to boot from the same device as the original defective unit See the QS21 Installation and User s Guide for details Note Providing the options on the new blade server are the same as on the old you do not have to reinstall or reconfigure the operating system but simply configure the ...

Page 61: ...Installation guidelines on page 29 2 Follow any special handling and installation instructions that come with the battery 3 If the blade server is operating shut down the operating system by typing the shutdown h now command or by choosing shut down from the GUI If the blade server was not powered off press the power control button behind the blade server control panel door to turn off the blade s...

Page 62: ...ttery clip holds the battery securely 10 Close the blade server cover see Closing the blade server cover on page 49 Statement 21 CAUTION Hazardous energy is present when the blade server is connected to the power source Always replace the blade cover before installing the blade server 11 Reinstall the blade server into the BladeCenter unit 12 Turn on the blade server see Turning on the blade serve...

Page 63: ...bly Release Bezel Blade Cover Control Panel Connector Control Panel Cable I O E pansion Option NOTES DIMM DIMM Slot 00 DIMM Slot 01 DIMM Slot 11 DIMM Slot 10 Light Path Diagnostics Button Press button to find faults on the system board If a memory LED is on reseat the component If it is still on replace the component If any of the other LEDs are on check the to identify and solve the problem Probl...

Page 64: ...tion the replacement ball stud over the hole and screw into position taking care not to over tighten as this might damage the system board 46 BladeCenter QS21 Type 0792 Problem Determination and Service Guide ...

Page 65: ...ore installing the blade server 4 Reinstall the blade server into the BladeCenter unit 5 Turn on the blade server See Turning on the blade server on page 3 for further information 6 If you have replaced the battery or the system board assembly reset the system date and time through the operating system that you installed For additional information see your operating system documentation Note If yo...

Page 66: ...arefully slide the front bezel assembly onto the blade server as shown in Figure 23 until it clicks into place Note Make sure that you do not pinch any cables when you reinstall the front bezel assembly Bezel Assembly Release Bezel Assembly Release Bezel Blade Cover Control Panel Connector Control Panel Cable Blade Cover Release Blade Cover Release Figure 23 Reinstalling the front bezel assembly 4...

Page 67: ...slots at the rear slide down onto the pins at the rear of the blade server as shown Figure 24 Before closing the cover make sure that all components are installed and seated correctly and that you have not left loose tools or parts inside the blade server 4 Carefully close the cover as shown in Figure 24 until it clicks into place Input output connectors and devices The BladeCenter unit contains t...

Page 68: ...50 BladeCenter QS21 Type 0792 Problem Determination and Service Guide ...

Page 69: ...o the BladeCenter unit v All components are connected correctly v The BladeCenter QS21 has the latest firmware updates These include BMC System Gigabit Ethernet controller SAS expansion card if installed InfiniBand high speed expansion card if installed Basic checks If you install the blade server in the BladeCenter unit and the blade server does not start always perform the following basic checks...

Page 70: ...on See Related documentation on page 1 for additional information For the latest editions of the IBM BladeCenter documentation go to http www ibm com support us en on the World Wide Web Troubleshooting charts The following tables list problem symptoms and suggested solutions If you cannot find the problem in the troubleshooting charts or if carrying out the suggested steps do not solve the problem...

Page 71: ...ion LED remains on until turned off by Advanced Management Module or through IBM Director Console Check Advanced Management Module to see what the problem is See the BladeCenter Management Module User s Guide for further information about the error Activity LED Green There is network activity No action required For further information about troubleshooting networks see Network connection problems ...

Page 72: ...m or use slots 1 5 3 Go to Power problems on page 57 Problems indicated by the system board LEDS The blade server must be removed from the BladeCenter unit and the cover removed before you can use the light path LEDs for diagnostics To activate the light box and the other light path LEDs press the light path diagnostics switch The location of each LED on the system board is shown in the table belo...

Page 73: ...cates Ethernet 0 is active and sending or receiving packets BE0_PLL_LOCK Green D8 Indicates the phased lock loop of Cell B E 0 is working BE1_PLL_LOCK Green D13 Indicates the phased lock loop of Cell B E 1 is working MM_SELECT_A Green D19 Indicates Advanced Management Module A is active MM_SELECT_B Green D18 Indicates Advanced Management Module B is active Light path LEDs DIMM at JDIM11 error Yell...

Page 74: ...NMI pinhole reset on the front panel has been pressed Pressing the reset causes the operating system to call the system debugger CPU fail Yellow One of the Cell BE processors has failed Contact your IBM service representative as the system board needs replacement System board Yellow A critical error has occurred in a component on the system board Contact your IBM service representative as the syst...

Page 75: ...ks you may need to have a trained service technician replace the system blade assembly Power throttling Be aware that the BladeCenter unit automatically reduces the BladeCenter QS21 processor speed if certain conditions are met One such condition is temperature thresholds being exceeded for example when the blade server is running in acoustic mode This throttling occurs independent of your power c...

Page 76: ...ndetermined problems on page 95 Software problems Symptom Suggested action You suspect a software problem 1 To determine whether the problem is caused by the software make sure that v The blade server has the minimum memory that is needed to use the software For memory requirements see the software documentation v The software is designed to operate on the blade server v Other software works on th...

Page 77: ...blade system management processor from the Advanced Management Module 3 Turn on the blade server Note If the temp side is corrupted the boot times out and an automatic reboot occurs after switching to the PERM side If the blade server does not restart you must replace the system board assembly Contact a service support representative for assistance Recovering the TEMP image from the PERM image To ...

Page 78: ...e detailed below 1 The first part of the boot process shows the system name and build date You see an error at this point if the firmware image is corrupted QS21 Firmware Starting Check ROM OK Build Date Apr 24 2007 13 43 46 FW Version QB 1 6 0 0 Press F1 to enter Boot Configuration SMS 2 Memory initialization follows next Note It can take several seconds to initialize the RAMBUS memory 3 The memo...

Page 79: ... to continue booting the system Type reset all and press enter to reboot the system disable nvram logging done 4 The next screen displays system information It shows revision information about the chip set SMP size boot date time and the available memory SYSTEM INFORMATION Processor Cell B E TM DD3 2 3200 MHz I O Bridge Cell BE companion chip DD2 x Timebase 26666 kHz internal SMP Size 2 4 threads ...

Page 80: ... can also provide diagnostics for the following system components v Baseboard Management Controller v Memory stress v CPU stress Additionally DSA creates a merged log that includes events from all collected logs All collected information can be output as a compressed XML file that can be sent to IBM Service Additionally you can view the information locally through a generated text report file Opti...

Page 81: ... will not occur the next time you run the diagnostic programs If there are multiple error codes or light path diagnostics LEDs that indicate a microprocessor error the error might be in a microprocessor or in a microprocessor socket See Table 20 on page 90 for further information about diagnosing microprocessor problems If the server stops during testing and you cannot continue restart the server ...

Page 82: ...ctive action 089 802 xxx Abort System resource availability error 089 801 xxx Abort Internal program error BMC test results Table 6 BMC test results Test Number Status Extended results Actions I2C test 166 901 xxx Fail The BMC indicates a failure in the IPMB bus 1 Turn off the system and disconnect it from power The system must be removed from AC power in order to reset the BMC 2 After 45 seconds ...

Page 83: ...e reported memory size is the same as the installed memory size complete the following steps Otherwise go to step 8 a Turn off the system and disconnect it from power b Reseat all the system DIMMs within the system c Reconnect the system to power and turn on the system d Run the test again 8 Turn off the system and disconnect it from power 9 Remove all the system memory 10 Install the minimum memo...

Page 84: ...level and upgrade if necessary The installed firmware level can be found in the DSA Diagnostic Event Log within the Firmware VPD section for this component The latest level firmware for this component can be found on the IBM Support Web site athttp www ibm com support us en 6 Check Ethernet device firmware level and upgrade if necessary The installed firmware level can be found in the DSA Diagnost...

Page 85: ...necessary The installed firmware level can be found in the DSA Diagnostic Event Log within the Firmware VPD section for this component The latest level firmware for this component can be found on the IBM Support Web site at http www ibm com support us en 6 Run the test again 7 If the test continues to fail refer to the other sections of this chapter for diagnosis and corrective action 166 905 xxx ...

Page 86: ...upport us en 6 Run the test again 7 If the test continues to fail refer to the other sections of this chapter for diagnosis and corrective action 166 802 xxx BMC Abort BMC I2C test canceled the test cannot be completed for an unknown reason 166 803 xxx BMC Abort BMC I2C test canceled the node is busy try later 166 804 xxx BMC Abort BMC I2C test canceled invalid command 166 805 xxx BMC Abort BMC I2...

Page 87: ...code can be found on the IBM Support Web site at http www ibm com support docview wss uid psg1SERV DSA 5 Check BMC firmware level and upgrade if necessary The installed firmware level can be found in the DSA Diagnostic Event Log within the Firmware VPD section for this component The latest level firmware for this component can be found on the IBM Support Web site at http www ibm com support us en ...

Page 88: ...MC I2C test canceled a command response could not be provided BMC initialization is in progress 166 822 xxx BMC Abort BMC I2C test canceled the destination is unavailable 166 823 xxx BMC Abort BMC I2C test canceled cannot execute the command insufficient privilege level 166 824 xxx BMC Abort BMC I2C test canceled cannot execute the command 166 000 xxx Pass Memory tests Table 8 Memory test results ...

Page 89: ...he system and disconnect it from power 4 Reseat the DIMMs 5 Reconnect the system to power and turn on the system 6 Run the test again 7 Execute the standard DSA memory diagnostic to validate all memory 8 If you cannot reproduce the problem contact your IBM technical support representative 202 801 xxx Abort Internal program error 1 Turn off and restart the system 2 Make sure that the system firmwar...

Page 90: ...rs and handling The following sections describe boot errors and actions you can take to resolve these errors Boot list The following table describes boot list errors Table 9 System firmware boot list errors Code Message Description Action E3400 It was not possible to boot from any device specified in the VPD The firmware found a valid VPD but was not able to find bootable code on any of the device...

Page 91: ... compatibility issues If the problem persists contact your IBM service representative E3407 Load failed Load or boot failed to load requested file from the device This is informational message and may be preceded by one or more other error messages Based on the preceding error messages you may have to take an action on faulty hardware or use the Advanced Management Module to correct the system con...

Page 92: ...sks Configuration Boot Sequence If the problem persists contact your IBM service representative System firmware update errors The following table describes system firmware errors that can occur if there have been problems after an update Table 10 System firmware boot errors Code Message Description Action E4000 RTAS Flash unknown flash chip version The flash update code does not support the onboar...

Page 93: ...tive as the system board may need replacing E1200 System memory init failure during second pass calibration CPU halted Since the first pass calibration succeeded either the CPU or the system XDR memory could have a defective contact Power down then reboot the blade If this does not resolve the problem contact your IBM service representative as the system board may need replacing E1210 Memory contr...

Page 94: ...d E5020 USB Unknown media format The media is not recognized by the firmware Insert a suitable bootable CD E5030 USB Device communication error Firmware cannot communicate with the BladeCenter USB devices This could be a firmware or physical hardware problem Check v The Advanced Management Module for messages v The system firmware image is not corrupt See System firmware update errors on page 74 f...

Page 95: ...work boot errors The following table describes the network boot errors Table 13 Network boot errors Code Message Description Action E3000 net Could not read MAC address The firmware could not establish a communication socket for booting over the network due to an error while retrieving the MAC address of the network device Power down then reboot the blade server If this does not resolve the proble...

Page 96: ...server reported a file access violation Check the file name and the permissions of the file that should be downloaded E3011 net illegal TFTP operation The TFTP server is not able to handle the request There may be too many UDP ports open on the TFTP server Reboot the TFTP server and retry the transfer E3012 net unknown TFTP transfer ID The TFTP server could not assign the data to a UDP packet base...

Page 97: ...boot errors These error messages only appear if you have installed the optional SAS daughter card Table 14 SAS boot errors Code Message Description Action E4303 LSISAS1064 controller initialization failed The blade server firmware was not able to initialize the controller This could indicate a hardware blade server firmware or SAS expansion card firmware problem Try following steps in order to fix...

Page 98: ... Ensure the SAS Expansion Card firmware and blade firmware version are at the correct level 5 If the error started after a SAS Expansion Card firmware upgrade or a blade server firmware upgrade consider a rollback to the previous firmware versions Check with the documentation at http www ibm com systems bladecenter support to verify whether rollback is possible 6 Plug the SAS expansion card into a...

Page 99: ... SAS Expansion Card firmware and blade firmware version are at the correct level 5 If the error started after a SAS Expansion Card firmware upgrade or a blade server firmware upgrade consider a rollback to the previous firmware versions Check with the documentation at http www ibm com systems bladecenter support to verify whether rollback is possible 6 Plug the SAS expansion card into another blad...

Page 100: ...re the SAS Expansion Card firmware and blade firmware version are at the correct level 5 If the error started after a SAS Expansion Card firmware upgrade or a blade server firmware upgrade consider a rollback to the previous firmware versions Check with the documentation at http www ibm com systems bladecenter support to verify whether rollback is possible 6 Plug the SAS expansion card into anothe...

Page 101: ...with the documentation at http www ibm com systems bladecenter support to verify whether rollback is possible 4 If the error has started reporting after an Open Firmware script change for example in a custom startup script verify the script 5 If the error started after activating an Open Firmware script for example by setting the use nvramrc configuration variable verify the script 6 Reproduce the...

Page 102: ... then remove and reinstall the blade server in the BladeCenter unit 3 If the error started after a blade server firmware update consider a rolling back to the previous firmware version Check with the documentation at http www ibm com systems bladecenter support to verify whether rollback is possible 4 Update the blade firmware 5 If the error started after an Open Firmware script change for example...

Page 103: ... documentation at http www ibm com systems bladecenter support to verify whether rollback is possible 4 Update the blade firmware 5 If the error started after an Open Firmware script change for example a custom startup script verify the script 6 If the error started after activating an Open Firmware script for example by setting use nvramrc configuration variable verify the script 7 Reproduce the ...

Page 104: ...o read the disk from another blade server in the same BladeCenter unit This verifies SAS topology and remote storage operation as well Note Concurrent access to a disk from a different blade server might corrupt the disk s file system Check with a system administrator before attempting this step 2 Ensure the remote storage is configured correctly If configured as a RAID array check the RAID config...

Page 105: ...ace the DIMM with a DIMM with an SDRAM data width of 8 lanes E2031 Data error The DIMM is defective Replace the DIMM with a new one E2032 Address line error The DIMM is defective Replace the DIMM with a new one The following table shows the warning messages that may appear Table 16 I O DIMM warning messages Code Message Description Action W2081 Unsupported DIMM type not 512 MB The DIMM has a size ...

Page 106: ...ages Code Message Description Action E1001 Boot ROM CRC failure The firmware image was found to be inconsistent during bootup The inconsistency might be due to image corruption during flash update or might indicate a hardware problem The boot watchdog triggers Reject the malfunctioning flash image as described in Recovering the TEMP image from the PERM image on page 59 Power down then reboot the b...

Page 107: ... the blade If the problem persists contact your IBM service representative as the system board assembly may need replacement BMC firmware messages The following is a description of the BMC firmware messages that are sent to the Advanced Management Module Use the Advanced Management Module Web interface to view them No codes are associated with these messages However the status column indicates the...

Page 108: ... Warning Temperature 73 C Standard mode Cell B E processor 1 Temp above Warning Temperature 82 C Information Blade throttled Throttle Reduce Frequency This is a BladeCenter unit operation which reduces processor speed on the blade server concerned until the temperature has dropped to normal levels Performance modeCell B E processor 0 Temp above Warning Temperature 80 C Warning Processor 1 BE0 Temp...

Page 109: ...bove Shut Off Temperature 100 C Error Processor 3 SB1 Temp critical fault Power Off The temperature of the system board has reached a critical level Cell B E companion chip 2 Temp above Shut Off Temperature 100 C Error Processor 4 SB2 Temp critical fault Power Off The temperature of the system board has reached a critical level Cell B E Events Processor Failure Cell B E processor 0 Checkstop Error...

Page 110: ...ing the state of all the voltage signals Power Off Reboot If the error condition continues report the exact error message to IBM service Voltage 10 not BE voltage Error Blade voltage fault None Other Events NMI reset button on Front panel pressed Error Front panel critical Interrupt Soft Reset None This is a user initiated action Boot WDT Timeout Firmware Corrupted Warning Firmware BIOS ROM Corrup...

Page 111: ...on PCIe Channel 0 NMI Error DIAGS SB1 NMI PCI E P00 Error Cell B E companion chip 2 indicates an error on PCIe Channel 0 NMI Error DIAGS SB0 NMI PCI E P01 Error Cell B E companion chip 1 indicates an error on PCIe Channel 1 NMI Error DIAGS SB1 NMI PCI E P01 Error Cell B E companion chip 2 indicates an error on PCIe Channel 1 NMI Error DIAGS SB0 NMI DIMM 00 Error Cell B E companion chip 1 indicates...

Page 112: ...he name QS21 event log txt 6 Create a tgz file using the following commands tar cvfz QS21 error log customer date tgz tmp PROBLEM txt tmp QS21 fw nvram img where customer Contains a short name of the customer date Contains the creation date 7 Provide IBM support with the tgz file Problem description The problem description must added to tmp PROBLEM txt together with the following information v Cus...

Page 113: ...nector is correctly seated on the system board see Removing the blade server front bezel assembly on page 41 for the location of the connector 4 If no LEDs on the control panel are working of the blade server replace the bezel assembly Try to turn on the blade server from the Advanced Management Module see the documentation for the BladeCenter unit and Advanced Management Module for more informati...

Page 114: ...ailure been reported before v Diagnostic program type and version level v Hardware configuration print screen of the system summary v BIOS code level v Operating system type and version level You can solve some problems by comparing the configuration and software setups between working and nonworking servers When you compare servers to each other for diagnostic purposes consider them identical onl...

Page 115: ...ules Elpida 512MB 3200 MHz XDRlibrary v0 32 Bin A C RevB DualDD Calibrate Done Test Done SYSTEM INFORMATION Processor Cell B E TM DD3 2 3200 MHz I O Bridge Cell BE companion chip DD2 x Timebase 26666 kHz internal SMP Size 2 4 threads Boot Date 2007 06 08 11 20 Memory 2048MB CPU0 1024MB CPU1 1024MB Press F1 to enter the SMS menu The SMS utility menu Select SMS tasks from the SMS utility main menu C...

Page 116: ...language that is used to display the SMS menus A screen similar to the following appears PowerPC Firmware Version HEAD SLOF SMS 1 6 c Copyright IBM Corp 2000 2005 2007 All rights reserved Select Language 1 ISO8859 1 English United States Navigation Keys M return to Main Menu N Next page of list ESC key return to previous screen X eXit System Management Services Type menu item number and press Ente...

Page 117: ...een similar to the following appears PowerPC Firmware Version HEAD SLOF SMS 1 6 c Copyright IBM Corp 2000 2005 2007 All rights reserved Network Parameters NET axon 10000000000 plb5 plb4 pcix 4000004600000000 ethernet 1 1 IP Parameters 2 Adapter Configuration 3 Ping Test 4 Advanced Setup DHCP Navigation Keys M return to Main Menu ESC key return to previous screen X eXit System Management Services T...

Page 118: ...cancel and return to the main menu press Esc Adapter Configuration This allows you to set network parameters for the adapter PowerPC Firmware Version HEAD SLOF SMS 1 6 c Copyright IBM Corp 2000 2005 2007 All rights reserved Adapter Configuration NET axon 10000000000 plb5 plb4 pcix 4000004600000000 ethernet 1 1 Speed Duplex 2 Spanning Tree Enabled 3 Protocol Navigation Keys M return to Main Menu ES...

Page 119: ...P address in turn Advanced Setup DHCP You do not need to use this option unless your network requires a specific block size or filename PowerPC Firmware Version HEAD SLOF SMS 1 6 c Copyright IBM Corp 2000 2005 2007 All rights reserved Advanced Setup DHCP NET axon 10000000000 plb5 plb4 pcix 4000004600000000 ethernet 1 1 DHCP Retries 255 2 TFTP Blocksize 512 3 TFTP Retries 5 4 TFTP Filename Navigati...

Page 120: ...to install or boot from the BladeCenter unit media tray you must first allocate it to the blade server using the Advanced Management Module PowerPC Firmware Version HEAD SLOF SMS 1 6 c Copyright IBM Corp 2000 2005 2007 All rights reserved Multiboot 1 Select Install Boot Device 2 Configure Boot Device Order Navigation Keys M return to Main Menu ESC key return to previous screen X eXit System Manage...

Page 121: ...boot devices in list order until it finds a boot device If it does not an error is generated and placed in the Advanced Management Module You may only list boot devices if they are allocated or available to the blade server For example to include the CD DVD drive in the BladeCenter media tray in the list first been allocate it to the blade server using Advanced Management Module To select boot dev...

Page 122: ... type the number and press Enter To save your selection press M to return to the menu Firmware Boot Side Options Normally the BladeCenter QS21 boots from the Temporary side and you should not change this However there may be occasions for example boot failure where you must change the setting PowerPC Firmware Version HEAD SLOF SMS 1 6 c Copyright IBM Corp 2000 2005 2007 All rights reserved Firmwar...

Page 123: ...tory Creating common NVRAM partition C0880 Could not find SAS partition in NVRAM created Adapters on 000001460ec00000 00 0800 D 14e4 16a8 network ethernet 00 0900 D 14e4 16a8 network ethernet Adapters on 000001a040000000 00 0000 B 1014 032c pci Adapters on 000001a240000000 00 0000 B 1014 032c pci Navigation Keys M return to Main Menu N Next page of list ESC key return to previous screen X eXit Sys...

Page 124: ...hine type or model and serial number Boot does not continue until the information is provided To enter new FRU information complete the following steps 1 Using a Telnet or SSH client connect to the Advanced Management Module external Ethernet interface IP address 2 When prompted enter a valid user ID and password The default management module user ID is USERID and the default password is PASSW0RD ...

Page 125: ...ng to the instructions on the screen and press Enter to continue 5 You must confirm the model number PowerPC Firmware Version HEAD SLOF SMS 1 6 c Copyright IBM Corp 2000 2005 2007 All rights reserved Number entered is 1234567 Accept number Enter y or Y to accept or n or N to decline Select Navigation key Type y or Y and press Enter to confirm the number 6 At the following screen type the serial nu...

Page 126: ...ht IBM Corp 2000 2005 2007 All rights reserved Number entered is ABCDEFG Accept number Enter y or Y to accept or n or N to decline Select Navigation key Type y or Y and press Enter to confirm the number SAS Settings Use this option to configure or change the SAS settings if you have installed the IBM BladeCenter Boot Disk System Note You must use this option when configuring an IBM BladeCenter Boo...

Page 127: ...2007 All rights reserved Change SAS Boot Device Address Current SAS Disk Address Default 0 0 Navigation Keys M return to Main Menu ESC key return to previous screen X eXit System Management Services Type SAS Address in hexadecimal and press Enter or select navigation key The SAS address can be obtained from the Storage System Profile utility See the documentation that comes with your IBM BladeCent...

Page 128: ...return to previous screen X eXit System Management Services Type new LUN Id in hexadecimal and press Enter or select navigation key The LUN Id can be obtained from the Storage System Profile utility See the documentation that comes with your IBM BladeCenter Boot Disk System for more information about the Storage System Profile utility 110 BladeCenter QS21 Type 0792 Problem Determination and Servic...

Page 129: ...BM product The documentation that comes with BladeCenter systems also describes the diagnostic tests that you can perform Most BladeCenter systems operating systems and programs come with documentation that contains troubleshooting procedures and explanations of error messages and error codes If you suspect a software problem see the documentation for the software Using the documentation Informati...

Page 130: ... ibm com services us or see http www ibm com planetwide for support telephone numbers In the U S and Canada call 1 800 IBM SERV 1 800 426 7378 Hardware service and support You can receive hardware service through IBM Services or through your IBM reseller if your reseller is authorized by IBM to provide warranty service See http www ibm com planetwide for support telephone numbers or in the U S and...

Page 131: ...10504 1785 U S A INTERNATIONAL BUSINESS MACHINES CORPORATION PROVIDES THIS PUBLICATION AS IS WITHOUT WARRANTY OF ANY KIND EITHER EXPRESS OR IMPLIED INCLUDING BUT NOT LIMITED TO THE IMPLIED WARRANTIES OF NON INFRINGEMENT MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE Some states do not allow disclaimer of express or implied warranties in certain transactions therefore this statement may not ap...

Page 132: ...he internal clock speed of the microprocessor other factors also affect application performance CD or DVD drive speed is the variable read rate Actual speeds vary and are often less than the possible maximum When referring to processor storage real and virtual storage or channel volume KB stands for 1024 bytes MB stands for 1 048 576 bytes and GB stands for 1 073 741 824 bytes When referring to ha...

Page 133: ... a los propietarios de equipos a reciclar sus productos de TI Se puede encontrar información sobre las ofertas de reciclado de productos de IBM en el sitio web de IBM http www ibm com ibm environment products index shtml Notice This mark applies only to countries within the European Union EU and Norway This appliance is labeled in accordance with European Directive 2002 96 EC concerning waste elec...

Page 134: ...mation on disposal of batteries outside the United States go to http www ibm com ibm environment products index shtml or contact your local waste disposal facility In the United States IBM has established a return process for reuse recycling or proper disposal of used IBM sealed lead acid nickel cadmium nickel metal hydride and battery packs from IBM equipment For information on proper disposal of...

Page 135: ...entative For California Perchlorate material special handling may apply See http www dtsc ca gov hazardouswaste perchlorate The foregoing notice is provided in accordance with California Code of Regulations Title 22 Division 4 5 Chapter 33 Best Management Practices for Perchlorate Materials This product part may include a lithium manganese dioxide battery which contains a perchlorate substance Ele...

Page 136: ...chkeit in den EU Mitgliedsstaaten und hält die Grenzwerte der EN 55022 Klasse A ein Um dieses sicherzustellen sind die Geräte wie in den Handbüchern beschrieben zu installieren und zu betreiben Des Weiteren dürfen auch nur von der IBM empfohlene Kabel angeschlossen werden IBM übernimmt keine Verantwortung für die Einhaltung der Schutzanforderungen wenn das Produkt ohne Zustimmung der IBM verändert...

Page 137: ...roduct including the fitting of non IBM option cards This product has been tested and found to comply with the limits for Class A Information Technology Equipment according to CISPR 22 European Standard EN 55022 The limits for Class A equipment were derived for commercial and industrial environments to provide reasonable protection against interference with licensed communication equipment Attenti...

Page 138: ...Japanese Voluntary Control Council for Interference VCCI statement Korean Class A warning statement 120 BladeCenter QS21 Type 0792 Problem Determination and Service Guide ...

Page 139: ...9 memory 8 microprocessor 8 system board 8 controller enumeration 26 Ethernet 23 cover closing 40 49 cover continued opening 32 removing 32 D danger statements 2 device driver Ethernet controller 24 DIMM See I O DDR2 memory modules DIMM fillers installing 37 drive connectors 8 specifications 2 E electrical input 3 electronic emission Class A notice 117 environment 3 error messages BMC firmware 89 ...

Page 140: ...identifying problems 54 light box LEDs 54 Linux operating system 59 M media tray select button 6 memory specifications 2 3 memory module specifications 3 microprocessor specifications 3 miscellaneous parts kit 45 N network boot error messages 77 network connection identifying problems 57 NMI reset button 7 notes 2 notes important 114 notices 113 electronic emission 117 FCC Class A 117 notices and ...

Page 141: ... 4 storage support for local 3 supported boot media 59 system board connectors 8 LEDs 7 replacing 41 system boot 60 boot error list 72 boot errors 72 I O DIMM boot errors 86 network boot error messages 77 other error messages 88 SAS error messages 79 supported boot media 59 system firmware 15 PERM image 23 recovering 59 startup process errors 71 system firmware continued system firmware boot error...

Page 142: ...124 BladeCenter QS21 Type 0792 Problem Determination and Service Guide ...

Page 143: ......

Page 144: ... Part Number 42C4969 Printed in USA 1P P N 42C4969 ...

Reviews: