Bull Escala BL460 Скачать руководство пользователя страница 168

 

148 

Escala BL460 - 

Problem Determination and Service Guide

 

2.6

 

Checkout procedure 

The checkout procedure is the sequence of tasks that you should follow to diagnose a 

problem in the blade server. 

2.6.1

 

About the checkout procedure 

Review this information before performing the checkout procedure. 

 

Read “Safety” on page v and the “

Installation guidelines

” on page 

201

 

The firmware diagnostic program provides the primary methods of testing the major 

components of the blade server. If you are not sure whether a problem is caused by 

the hardware or by the software, you can use the firmware diagnostic program to 

confirm that the hardware is working correctly. The firmware diagnostic program runs 

automatically when the blade server is turned on. 

 

A single problem might cause more than one error message. When this happens, 

correct the cause of the first error message. The other error messages usually will not 

occur the next time you run the diagnostic programs. 
Exception:  If there are multiple error codes or light path diagnostic LEDs that indicate 

a microprocessor error, the error might be in a microprocessor or in a microprocessor 

socket. See “

Microprocessor problems

” on page 

161

 for information about 

diagnosing microprocessor problems. 

 

If the blade server hangs on a POST checkpoint, see 

POST progress codes (checkpoints)

” 

on page 

71

.

 If the blade server is halted and no error message is displayed, see 

Troubleshooting tables

” on page 

158

 and “

Solving undetermined problems

” on page 

194

. For intermittent problems, check the management-module event log and 

POST 

progress codes (checkpoints)

” on page 

71

.

 

 

If the blade server front panel shows no LEDs, verify the blade server status and errors 

in the management module Web interface; also see “

Solving undetermined problems

” 

on page 

194

 

If device errors occur, see “

Troubleshooting tables

” on page 

158

Содержание Escala BL460

Страница 1: ...Escala BL460 Problem Determination and Service Guide ESCALA Blade REFERENCE 86 A7 81FB 00...

Страница 2: ......

Страница 3: ...ESCALA Blade Escala BL460 Problem Determination and Service Guide Hardware October 2009 BULL CEDOC 357 AVENUE PATTON B P 20845 49008 ANGERS CEDEX 01 FRANCE REFERENCE 86 A7 81FB 00...

Страница 4: ...ical Publications you are invited to use the Ordering Form also provided at the end of this book Trademarks and Acknowledgements We acknowledge the rights of the proprietors of the trademarks mentione...

Страница 5: ...DIMMs 5 1 5 Blade server control panel buttons and LEDs 7 1 6 Turning on the blade server 10 1 7 Turning off the blade server 11 1 8 System board layouts 12 1 9 System board connectors 12 1 10 System...

Страница 6: ...65 2 10 12 Power problems 165 2 10 13 POWER Hypervisor PHYP problems 167 2 10 14 Service processor problems 169 2 10 15 Software problems 181 2 10 16 Universal Serial Bus USB port problems 181 2 11 Li...

Страница 7: ...lling a memory module 214 4 4 9 Removing the management card 216 4 4 10 Installing the management card 217 4 4 11 Removing and installing an I O expansion card 219 4 4 12 Removing the battery 224 4 4...

Страница 8: ...7 Figure 4 1 Removing the blade server from the Bull Blade Chassis Enterprise 203 Figure 4 2 Installing the blade server in a Bull Blade Chassis Enterprise 204 Figure 4 3 Removing the cover 206 Figure...

Страница 9: ...o C1645300 checkpoints 72 Table 2 16 C2001000 to C20082FF checkpoints 79 Table 2 17 C700xxxx Server firmware IPL status checkpoints 85 Table 2 18 CA000000 to CA2799FF checkpoints 85 Table 2 19 D1001xx...

Страница 10: ......

Страница 11: ...Preface vii Safety...

Страница 12: ...versions of the caution or danger statement in the Bull Safety Attention document For example if a caution statement begins with a number 1 translations for that caution statement appear in the Bull S...

Страница 13: ...Preface ix...

Страница 14: ...x Escala BL460 Problem Determination and Service Guide...

Страница 15: ...Preface xi...

Страница 16: ...xii Escala BL460 Problem Determination and Service Guide...

Страница 17: ...Preface xiii...

Страница 18: ...xiv Escala BL460 Problem Determination and Service Guide Guidelines for trained service technicians Inspecting for unsafe conditions...

Страница 19: ...Preface xv Guidelines for servicing electrical equipment...

Страница 20: ......

Страница 21: ...for your blade server Field replaceable unit FRU FRUs must be installed only by trained service technicians For information about the terms of the warranty and getting service and assistance see the...

Страница 22: ...ument are also in the Safety Attention document Each statement is numbered for reference to the corresponding statement in the Safety Attention document The following notices and statements are used i...

Страница 23: ...e Drive SSD P5IOC2 I O hub on board integrated features The baseboard management controller BMC is a flexible service processor FSP1 with Intelligent Platform Management Interface IPMI Serial over LAN...

Страница 24: ...rgy Scale thermal management for power management oversubscription throttling andenvironmental sensing Cluster support for eCluster 1350 Cluster Systems Management High performance computing HPC Open...

Страница 25: ...BL460 Both DIMMs in a pair must be the same size speed type and technology You can mix compatible DIMMs from different manufacturers Each DIMM within a processor support group 1 4 5 8 must be the sam...

Страница 26: ...6 Escala BL460 Problem Determination and Service Guide Figure 1 1 DIMM connectors...

Страница 27: ...lade Chassis Enterprise keyboard and video ports with the blade server Notes The operating system in the blade server must provide USB support for the blade server to recognize and use the keyboard ev...

Страница 28: ...rocessed then is lit when the ownership of the media tray has been transferred to the blade server It can take approximately 20 seconds for the operating system in the blade server to recognize the me...

Страница 29: ...the power status of the blade server in the following manner Flashing rapidly The service processor BMC is initializing the blade server Flashing slowly The blade server has completed initialization a...

Страница 30: ...he power on LED is flashing rapidly the service processor in the management module is initializing The power control button does not respond during initialization Note The enhanced service processor c...

Страница 31: ...ttons and LEDs on page 7 for the location Note The power control LED can remain on solidly for up to 1 minute after you push the power control button After you turn off the blade server wait until the...

Страница 32: ...em board in the blade server Figure 1 3 System board connectors Callout Escala BL460 server connectors 1 Operator panel connector 2 Expansion unit SMP connector 3 DIMM 1 4 connectors see Figure 1 5 fo...

Страница 33: ...Chapter 1 Introduction 13 Figure 1 4 shows individual DIMM connectors Figure 1 4 DIMM connectors...

Страница 34: ...ver to see any error LEDs that were turned on during error processing and use the following figure to identify the failing component Figure 1 5 System board LEDs Callout System board LEDs 1 Light path...

Страница 35: ...ses Hardware error checkers have these distinct attributes Continuous monitoring of system operations to detect potential calculation errors Attempted isolation of physical faults based on runtime det...

Страница 36: ...diagnostic LEDs on the system board to identify failing hardware If the system error LED on the system LED panel on the front or rear of the Bull Blade Chassis Enterprise is lit one or more error LEDs...

Страница 37: ...d alone Diagnostics CD to perform diagnostics on the Escala BL460 blade server no matter which operating system is loaded on the blade server However other supported operating systems might have diagn...

Страница 38: ...or checkpoints with location codes use the following table to identify the failing component when there is a hang condition For 8 digit codes not listed in Table 1 see Checkout procedure on page 148 T...

Страница 39: ...he error log the system reference code SRC and turn on the system attention LED The service processor logs the nine word eight digit per word error code in the management module event log Error codes...

Страница 40: ...how relative word positions The seventh word is the direct select address which is 77777777 in the example Table 2 2 Nine word system reference code in the management module event log Index Sev Source...

Страница 41: ...the second four characters designate the unit reference code URC The first character indicates the type of error In a few cases the first two characters indicate the type of error 1xxxxxxx System pow...

Страница 42: ...e 148 2 Replace the system board and chassis assembly as described in Replacing the Tier 2 system board and chassis assembly on page 229 2610 Power good pGood fault 1 Go to Checkout procedure on page...

Страница 43: ...169 2629 1 5V reg_pgood fault 1 Go to Checkout procedure on page 148 2 Replace the system board and chassis assembly as described in Replacing the Tier 2 system board and chassis assembly on page 229...

Страница 44: ...sis assembly on page 229 2649 Blade power fault 1 Go to Checkout procedure on page 148 2 Replace the system board and chassis assembly as described in Replacing the Tier 2 system board and chassis ass...

Страница 45: ...ocessor 2 VPD 1 Go to Checkout procedure on page 148 2 Replace the system board and chassis assembly as described in Replacing the Tier 2 system board and chassis assembly on page 229 8423 No processo...

Страница 46: ...of volumes and that the proper authority is granted 632BCFC5 A non recoverable error was detected while reading a virtual optical volume Resolve any errors on the Network File System server 632BCFC6...

Страница 47: ...on is required 632CCFF7 Informational system log entry only No corrective action is required 632CCFFE Informational system log entry only No corrective action is required 632CFF3D Informational system...

Страница 48: ...ing actions See Chapter 3 Parts listing on page 197 to determine which components are CRUs and which components are FRUs Attention code Description Action AA00E1A8 The system is booting to the open fi...

Страница 49: ...against the failing adapter For a Linux operating system boot the blade server using the stand alone Diagnostics CD or a NIM server then run diagnostics against the failing adapter AA060011 The firmw...

Страница 50: ...lated to an event or exception that occurred in the service processor firmware Table 2 10 describes error codes that might occur if POST detects a problem The description also includes suggested actio...

Страница 51: ...31 A problem occurred during the migration of a partition The migration of a partition did not complete Check for server firmware updates then install the updates if available 1132 A problem occurred...

Страница 52: ...ue to a validation error Go to Verifying the partition configuration on page 152 1225 A problem occurred during the startup of a partition The partition attempted to start up prior to the platform ful...

Страница 53: ...did not complete due to a copy error Go to Firmware problem isolation on page 185 2210 Informational system log entry only No corrective action is required 2220 Informational system log entry only No...

Страница 54: ...for server firmware updates then install the updates if available 3128 A problem occurred during the startup of a partition A return code for an unexpected failure was returned when attempting to que...

Страница 55: ...ing the startup of a partition There was an error writing the partition main storage dump to the partition load source The main store dump startup will continue Look for other errors and resolve them...

Страница 56: ...lve them 690A During the startup of a partition an error occurred while copying open firmware into the partition load area Go to Firmware problem isolation on page 185 7200 Informational system log en...

Страница 57: ...n on page 185 8140 Informational system log entry only No corrective action is required 8141 Informational system log entry only No corrective action is required 8142 Informational system log entry on...

Страница 58: ...the partition dump information then go to Firmware problem isolation on page 185 F004 Informational system log entry only No corrective action is required F005 Informational system log entry only No...

Страница 59: ...is necessary Continue running the system normally At the earliest convenient time or service window work with Bull Support to collect a platform dump and restart the system then go to Firmware problem...

Страница 60: ...ed 1160 Service processor failure 1 Go to Firmware problem isolation on page 185 2 Replace the system board and chassis assembly as described in Replacing the Tier 2 system board and chassis assembly...

Страница 61: ...tion is required 4788 Informational system log entry only No corrective action is required 5120 System firmware detected an error If the system is not exhibiting problematic behavior you can ignore th...

Страница 62: ...failure 1 Replace the system board and chassis assembly as described in Replacing the Tier 2 system board and chassis assembly on page 229 2 If the problem persists use the PCI expansion card PIOCARD...

Страница 63: ...Informational system log entry only No corrective action is required 697C Connection from service processor to system processor failed Replace the system board and chassis assembly as described in Rep...

Страница 64: ...ion Check the management module event log for partition firmware error codes especially BA00F104 then take the appropriate actions for those error codes F105 System firmware detected an internal error...

Страница 65: ...he Tier 2 system board and chassis assembly on page 229 BA000032 The firmware failed to register the lpevent queues 1 Reboot the blade server 2 If the problem persists a Go to Checkout procedure on pa...

Страница 66: ...ge 229 BA000081 Failed to get the firmware license policy 1 Reboot the blade server 2 If the problem persists a Go to Checkout procedure on page 148 b Replace the system board and chassis assembly as...

Страница 67: ...ufficient information to boot the systems 1 Go to Checkout procedure on page 148 2 Replace the system board and chassis assembly as described in Replacing the Tier 2 system board and chassis assembly...

Страница 68: ...006 The boot image is too large Start up from another device with a bootable image BA010007 The device does not have the required device_type property 1 Reboot the blade server 2 If the problem persis...

Страница 69: ...that all of the iSCSI configuration arguments on the operating system comply with the configuration for the iSCSI Host Bus Adapter HBA which is the iSCSI initiator BA01000F The chapid parameter string...

Страница 70: ...and chassis assembly as described in Replacing the Tier 2 system board and chassis assembly on page 229 BA012013 Closing TCP failed 1 Reboot the blade server 2 If the problem persists a Go to Checkout...

Страница 71: ...4 Closing the BOOTP node failed 1 Reboot the blade server 2 If the problem persists a Go to Checkout procedure on page 148 b Replace the system board and chassis assembly as described in Replacing the...

Страница 72: ...r no good offer DHCP discovery did not receive any DHCP offers from the servers that meet the client requirements Verify that the DHCP server configuration file is not overly constrained An over const...

Страница 73: ...ace the device specified by the location code 2 If the problem persists a Go to Checkout procedure on page 148 b Replace the system board and chassis assembly as described in Replacing the Tier 2 syst...

Страница 74: ...the Tier 2 system board and chassis assembly on page 229 BA060008 No configurable adapters found by the Remote IPL menu in the SMS utilities This error occurs when the firmware cannot locate any LAN a...

Страница 75: ...is intended for this partition The configuration of the partition supports an alpha mode operating system 2 If the problem remains a Go to Checkout procedure on page 148 b Replace the system board an...

Страница 76: ...ed sense data available 1 Go to Checkout procedure on page 148 2 Replace the system board and chassis assembly as described in Replacing the Tier 2 system board and chassis assembly on page 229 BA0900...

Страница 77: ...stem board and chassis assembly on page 229 BA090010 The request sense command failed 1 Troubleshoot the SCSD devices 2 Verify that the SCSD cables and devices are properly plugged Correct any problem...

Страница 78: ...available 1 Troubleshoot the SCSD devices 2 Verify that the SCSD cables and devices are properly plugged Correct any problems that are found 3 Replace the SCSD cables and devices 4 If the problem per...

Страница 79: ...013 USB CD ROM in the media tray bootable media is missing from the drive 1 Insert a bootable CD in the drive and retry the operation 2 If the problem persists a Retry the operation b Reboot the blade...

Страница 80: ...assembly as described in Replacing the Tier 2 system board and chassis assembly on page 229 BA140003 The SCSD read write optical send diagnostic failed sense data available 1 Troubleshoot the SCSD dev...

Страница 81: ...boot the blade server 2 If the problem persists a Go to Checkout procedure on page 148 b Replace the system board and chassis assembly as described in Replacing the Tier 2 system board and chassis ass...

Страница 82: ...ge 229 BA170210 Setenv Setenv parameter error name contains a null character 1 Go to Checkout procedure on page 148 2 Replace the system board and chassis assembly as described in Replacing the Tier 2...

Страница 83: ...eckout procedure on page 148 2 Replace the system board and chassis assembly as described in Replacing the Tier 2 system board and chassis assembly on page 258 BA180014 MSI software error 1 Reboot the...

Страница 84: ...00 Partition firmware reports a default catch 1 Reboot the blade server 2 If the problem persists a Go to Checkout procedure on page 148 b Replace the system board and chassis assembly as described in...

Страница 85: ...system board and chassis assembly as described in Replacing the Tier 2 system board and chassis assembly on page 229 BA210020 I O configuration exceeded the maximum size allowed by partition firmware...

Страница 86: ...RQ registration error partner vslot may not be valid Verify that this client virtual slot device has a valid server virtual slot device in a hosting partition BA278001 Failed to flash firmware invalid...

Страница 87: ...b Replace the system board and chassis assembly as described in Replacing the Tier 2 system board and chassis assembly on page 229 BA310010 Unable to obtain the SRC history 1 Reboot the blade server 2...

Страница 88: ...problem persists a Go to Checkout procedure on page 148 b Replace the system board and chassis assembly as described in Replacing the Tier 2 system board and chassis assembly on page 229 BA340002 The...

Страница 89: ...bric manager system initiator capability processing encountered an unexpected error 1 Reboot the blade server 2 If the problem persists a Go to Checkout procedure on page 148 b Replace the system boar...

Страница 90: ...Description Action BA400001 Informational message DMA trace buffer full 1 Reboot the blade server 2 If the problem persists a Go to Checkout procedure on page 148 b Replace the system board and chass...

Страница 91: ...n that identifies the failing component when there is a hang condition Notes For checkpoints with no associated location code see Light path diagnostics on page 182 to identify the failing component w...

Страница 92: ...r 2 system board and chassis assembly on page 258 C1001F0D Pre standby discovery completed in initial transition file While the blade server displays this checkpoint the service processor reads the sy...

Страница 93: ...9x18 Hardware object manager HOM GARD in progress 1 Go to Checkout procedure on page 148 2 Replace the system board and chassis assembly as described in Replacing the Tier 2 system board and chassis a...

Страница 94: ...on step in progress 1 Go to Checkout procedure on page 148 2 Replace the system board and chassis assembly as described in Replacing the Tier 2 system board and chassis assembly on page 229 C1009x46 P...

Страница 95: ...bly on page 229 C1009x6C Processor PSI initialization step in progress 1 Go to Checkout procedure on page 148 2 Replace the system board and chassis assembly as described in Replacing the Tier 2 syste...

Страница 96: ...9x98 ASIC wrap test in progress 1 Go to Checkout procedure on page 148 2 Replace the system board and chassis assembly as described in Replacing the Tier 2 system board and chassis assembly on page 22...

Страница 97: ...ress 1 Go to Checkout procedure on page 148 2 Replace the system board and chassis assembly as described in Replacing the Tier 2 system board and chassis assembly on page 229 C1009xC4 Dump initializat...

Страница 98: ...on page 229 C103A401 Instructions have been started on the system processors 1 Go to Checkout procedure on page 148 2 Replace the system board and chassis assembly as described in Replacing the Tier 2...

Страница 99: ...p 1 Go to Recovering the system firmware on page 186 2 Replace the system board and chassis assembly as described in Replacing the Tier 2 system board and chassis assembly on page 229 C2001010 Startup...

Страница 100: ...229 C2002110 Issuing a power on command 1 Go to Recovering the system firmware on page 186 2 Replace the system board and chassis assembly as described in Replacing the Tier 2 system board and chassis...

Страница 101: ...Recovering the system firmware on page 186 2 Replace the system board and chassis assembly as described in Replacing the Tier 2 system board and chassis assembly on page 229 C2003112 Waiting for bus...

Страница 102: ...ation on the load source 1 Go to Recovering the system firmware on page 186 2 Replace the system board and chassis assembly as described in Replacing the Tier 2 system board and chassis assembly on pa...

Страница 103: ...artition 1 Go to Recovering the system firmware on page 186 2 Replace the system board and chassis assembly as described in Replacing the Tier 2 system board and chassis assembly on page 229 C20080A0...

Страница 104: ...eived from system firmware 1 Go to Recovering the system firmware on page 186 2 Replace the system board and chassis assembly as described in Replacing the Tier 2 system board and chassis assembly on...

Страница 105: ...can be any number or letter Table 2 18 CA000000 to CA2799FF checkpoints If the system hangs on a progress code follow the suggested actions in the order in which they are listed in the Action column u...

Страница 106: ...ils 1 Go to Checkout procedure on page 148 2 Replace the system board and chassis assembly as described in Replacing the Tier 2 system board and chassis assembly on page 229 CA000070 Attempting to loa...

Страница 107: ...NVRAM script 1 Go to Checkout procedure on page 148 2 Replace the system board and chassis assembly as described in Replacing the Tier 2 system board and chassis assembly on page 229 CA00D010 First pa...

Страница 108: ...the system board and chassis assembly as described in Replacing the Tier 2 system board and chassis assembly on page 229 CA00E110 Create KDUMP properties 1 Reboot the blade server 2 If the problem pe...

Страница 109: ...sembly as described in Replacing the Tier 2 system board and chassis assembly on page 229 CA00E13A Create packages node 1 Go to Checkout procedure on page 148 2 Replace the system board and chassis as...

Страница 110: ...Go to Checkout procedure on page 148 2 Replace the system board and chassis assembly as described in Replacing the Tier 2 system board and chassis assembly on page 229 CA00E14D Load boot image Go to...

Страница 111: ...of PCI bus probe 1 Go to Checkout procedure on page 148 2 Replace the system board and chassis assembly as described in Replacing the Tier 2 system board and chassis assembly on page 229 CA00E172 Firs...

Страница 112: ...t The bootp server is correctly configured then retry the operation The network connections are correct then retry the operation 2 If the problem persists a Go to Checkout procedure on page 148 b Repl...

Страница 113: ...and chassis assembly as described in Replacing the Tier 2 system board and chassis assembly on page 229 CA00E19B NVRAM menu variable not found assume FALSE 1 Go to Checkout procedure on page 148 2 Re...

Страница 114: ...s described in Replacing the Tier 2 system board and chassis assembly on page 229 CA00E1AB System booting using default service mode boot list 1 Go to Checkout procedure on page 148 2 Replace the syst...

Страница 115: ...stem board and chassis assembly as described in Replacing the Tier 2 system board and chassis assembly on page 229 CA00E1D4 Create SCSD byte device node ST 1 Go to Checkout procedure on page 148 2 Rep...

Страница 116: ...Build boot device list for fibre channel adapters The location code of the SAN adapter being scanned is also displayed 1 Go to Checkout procedure on page 148 2 Replace the system board and chassis as...

Страница 117: ...eplacing the Tier 2 system board and chassis assembly on page 229 CA00E701 Create memory VPD 1 Go to Checkout procedure on page 148 2 Replace the system board and chassis assembly as described in Repl...

Страница 118: ...bly as described in Replacing the Tier 2 system board and chassis assembly on page 229 CA00E876 Initializing rtas_error_inject 1 Go to Checkout procedure on page 148 2 Replace the system board and cha...

Страница 119: ...ystem board and chassis assembly as described in Replacing the Tier 2 system board and chassis assembly on page 229 CA26ttss Waiting for lpevent of type tt and subtype ss 1 Reboot the blade server 2 I...

Страница 120: ...the control panel for at least 30 minutes with no other indication of activity If the system is hung on this checkpoint then CA2799FD and CA2799FF are not alternating and you must perform the followin...

Страница 121: ...ved If an action solves the problem you can stop performing theremaining actions See Chapter 3 Parts listing on page 197 to determine which components are CRUs and which components are FRUs Progress c...

Страница 122: ...e 148 2 Replace the system board and chassis assembly as described in Replacing the Tier 2 system board and chassis assembly on page 229 D1111xxx Dump opt p0 1 Go to Checkout procedure on page 148 2 R...

Страница 123: ...e system board and chassis assembly as described in Replacing the Tier 2 system board and chassis assembly on page 229 D11D1xxx Dump environment 1 Go to Checkout procedure on page 148 2 Replace the sy...

Страница 124: ...48 2 Replace the system board and chassis assembly as described in Replacing the Tier 2 system board and chassis assembly on page 229 D12E1xxx Remove core core 1 Go to Checkout procedure on page 148 2...

Страница 125: ...or dump codes These D1xx3yxx service processor dump codes use the format D1xx3yzz where xx indicates the cage or node ID that the dump component is processing y increments from 0 to F to indicate that...

Страница 126: ...and chassis assembly as described in Replacing the Tier 2 system board and chassis assembly on page 229 D1xx3y08 Send command 1 Go to Checkout procedure on page 148 2 Replace the system board and chas...

Страница 127: ...ould you take any of the actions described for a progress code Table 2 21 D1xx900C to D1xxC003 checkpoints If the system hangs on a progress code follow the suggested actions in the order in which the...

Страница 128: ...onents are CRUs and which components are FRUs Progress code Description Command Being Processed Action D1xxC002 Waiting for the hypervisor to send the power off message 1 Go to Checkout procedure on p...

Страница 129: ...Look up a service request number when you see an error code with a hyphen The SRN is in the first column of the SRN table in numerical order The SRN might have an associated FFC number Possible FFC v...

Страница 130: ...of day battery failed 1 Go to Removing the battery on page 224 to start the battery replacement procedure 2 Go to Installing the battery on page 225 to complete the procedure 109 200 The system crash...

Страница 131: ...operating environment 2 There is unrestricted air flow around the system 3 All system covers are closed 4 Verify that all fans in the Bull Blade Chassis Enterprise are operating correctly 651 159 210...

Страница 132: ...erforming the checkout procedure on page 149 651 625 214 Memory address error invalid address or access attempt Go to Performing the checkout procedure on page 149 651 626 214 Memory data error bad da...

Страница 133: ...em bus parity error Go to Performing the checkout procedure on page 149 651 712 214 System bus parity error Go to Performing the checkout procedure on page 149 651 713 214 System bus protocol transfer...

Страница 134: ...ce processor detects loss of voltage from the time of day clock backup battery Go to Performing the checkout procedure on page 149 651 770 292 Intermediate or system bus address parity error Go to Per...

Страница 135: ...t 2 There is unrestricted air flow around the system 3 There are no fan failures 651 841 152 2E2 Sensor detected a voltage outside of the normal range Go to Performing the checkout procedure on page 1...

Страница 136: ...3 2C8 292 A non critical error has been detected intermediate or system bus address parity error Schedule deferred maintenance Go to Performing the checkout procedure on page 149 652 734 2C8 292 A non...

Страница 137: ...heckout procedure on page 149 887 102 887I O register test failed Go to Performing the checkout procedure on page 149 887 103 887 Local RAM test failed Go to Performing the checkout procedure on page...

Страница 138: ...ransceiver test failed Go to Performing the checkout procedure on page 149 887 403 887 Ethernet 10 Base T transceiver test failed Go to Performing the checkout procedure on page 149 887 405 887 Ethern...

Страница 139: ...ming the checkout procedure on page 149 2506 9000 Controller detected device error during configuration discovery Go to Performing the checkout procedure on page 149 2506 9001 Controller detected devi...

Страница 140: ...ssing from a RAID 0 Disk Array Go to Performing the checkout procedure on page 149 2506 9062 One or more disks are missing from a RAID 0 Disk Array Go to Performing the checkout procedure on page 149...

Страница 141: ...any parts reported by the diagnostic program 3 Replace the system board and chassis assembly 252B 714 252B Temporary adapter failure 1 Check the management module event log If an error was recorded b...

Страница 142: ...agnostic program 3 Replace the system board and chassis assembly 254E 201 254E 221 Adapter configuration error Go to Performing the checkout procedure on page 149 254E 601 254 Error log analysis indic...

Страница 143: ...the diagnostic program 3 Replace the system board and chassis assembly 256D 606 256D Error Log Analysis indicates adapter failure 1 Check the management module event log If an error was recorded by t...

Страница 144: ...cates that an adapter error has occurred for the Fibre Channel adapter card Go to Performing the checkout procedure on page 149 2604 705 2604 Error Log Analysis indicates that a parity error has been...

Страница 145: ...apter system board and chassis assembly Go to Performing the checkout procedure on page 149 2624 101 2624 Configuration failure system board and chassis assembly Go to Performing the checkout procedur...

Страница 146: ...9 2640 134 2640 Hardware command or DMA failure Go to Performing the checkout procedure on page 149 2640 135 2640 IDE DMA error with no error status Go to Performing the checkout procedure on page 149...

Страница 147: ...lure could not be isolated 1 Check the management module event log if an error was recorded by the system see POST progress codes checkpoints on page 71 2 If no entry is found replace the system board...

Страница 148: ...module event log if an error was recorded by the system see POST progress codes checkpoints on page 71 2 If no entry is found replace the system board and chassis assembly A02 05x Memory Address Error...

Страница 149: ...t log if an error was recorded by the system see POST progress codes checkpoints on page 71 2 If no entry is found replace the system board and chassis assembly A03 11x System bus time out error 1 Che...

Страница 150: ...ternal temperature 1 Make sure that a The room ambient temperature is within the system operating environment b There is unrestricted air flow around the system c All system covers are closed d There...

Страница 151: ...If no entry is found replace the system board and chassis assembly A0D 00x Error log analysis indicates an error detected by the Service Processor but the failure could not be isolated 1 Check the ma...

Страница 152: ...odule event log if an error was recorded by the system see POST progress codes checkpoints on page 71 2 If no entry is found replace the system board and chassis assembly A0D 36x Other IPL Diagnostic...

Страница 153: ...eplace the system board and chassis assembly A11 50x Recoverable errors on resource indicate a trend toward an unrecoverable error However the resource could not be deconfigured and is still in use Th...

Страница 154: ...ule event log if an error was recorded by the system see POST progress codes checkpoints on page 71 2 If no entry is found replace the system board and chassis assembly A12 07x A non critical error ha...

Страница 155: ...y A13 01x A non critical error has been detected an I O bus address parity error 1 Check the management module event log if an error was recorded by the system see POST progress codes checkpoints on p...

Страница 156: ...es checkpoints on page 71 2 If no entry is found replace the system board and chassis assembly A13 16x A non critical error has been detected an I O expansion unit not in an operating state 1 Check th...

Страница 157: ...log if an error was recorded by the system see POST progress codes checkpoints on page 71 2 If no entry is found replace the system board and chassis assembly A15 19x Fan failure 1 Check the manageme...

Страница 158: ...progress codes checkpoints on page 71 2 If no entry is found replace the system board and chassis assembly A1D 05x A non critical error has been detected a service processor error accessing special r...

Страница 159: ...nd chassis assembly A1D 23x A non critical error has been detected Loss of heart beat from Service Processor 1 Check the management module event log if an error was recorded by the system see POST pro...

Страница 160: ...s codes checkpoints on page 71 2 If no entry is found replace the system board and chassis assembly A1D 50x Recoverable errors on resource indicate a trend toward an unrecoverable error However the re...

Страница 161: ...checkpoints on page 71 2 Replace any parts reported by the diagnostic program 3 Replace the system board and chassis assembly as described in Replacing the Tier 2 system board and chassis assembly on...

Страница 162: ...e system see POST progress codes checkpoints on page 71 2 Replace any parts reported by the diagnostic program 3 Replace the system board and chassis assembly as described in Replacing the Tier 2 syst...

Страница 163: ...e 71 2 Replace any parts reported by the diagnostic program 3 Replace the system board and chassis assembly as described in Replacing the Tier 2 system board and chassis assembly on page 229 ssss 132...

Страница 164: ...mbly as described in Replacing the Tier 2 system board and chassis assembly on page 229 ssss 640 ssss Error log analysis indicates a path error 1 Check the management module event log If an error was...

Страница 165: ...IMM 2 GB DIMM 4 GB DIMM 8 GB DIMM 2C7 System board and chassis assembly Memory controller 2C8 System board and chassis assembly 2C9 System board and chassis assembly 2D2 System board and chassis assem...

Страница 166: ...stem board and chassis assembly cache problem E19 System board and chassis assembly power supply sensor failed 252B System board and chassis assembly SAS controller 2553 SAS 73 4 GB or SAS 146 GB hard...

Страница 167: ...rst word of the SRC in this example is the message identifier B7001111 This example numbers each word after the first word to show relative word positions The seventh word is the direct select address...

Страница 168: ...correct the cause of the first error message The other error messages usually will not occur the next time you run the diagnostic programs Exception If there are multiple error codes or light path dia...

Страница 169: ...eckpoint and attempted the corrective action before going to Step 003 1 If the firmware hangs on an eight digit progress code see POST progress codes checkpoints on page 71 2 If the firmware records a...

Страница 170: ...e component See Using the diagnostics program on page 155 2 If you cannot perform AIX concurrent online diagnostics continue to Step 006 Step 006 Perform the following steps 1 Use the management modul...

Страница 171: ...ollowing responses a Progress codes are recorded in the management module event log b Record any messages or diagnostic information that might be in the log Continue with step 008 Step 008 Load the st...

Страница 172: ...AIX concurrent diagnostics from the AIX operating system 1 Log in to the AIX operating system as root user or use the CE login See Creating a CE login on page 235 for more information If you need hel...

Страница 173: ...Enter to continue The Function Selection screen will display See Using the diagnostics program on page 155 for more information about running the diagnostics program Note If the Define Terminal screen...

Страница 174: ...y f If the NIM server is setup to allow pinging the client system use the Ping Test option on the Network Parameters menu to verify that the client system can ping the NIM server Note If the ping fail...

Страница 175: ...eturn to the Function Selection menu System Verification i From the Function Selection menu select Diagnostic Routines and press Enter ii From the Diagnostic Mode Selection menu select System Verifica...

Страница 176: ...ps 1 Make sure that your boot list is correct a From the management module Web interface display the boot sequences for the blade servers in your Bull Blade Chassis Enterprise Blade Tasks Configuratio...

Страница 177: ...ying to boot If the CD fails on the second server replace the CD or DVD drive in the media tray e If replacing the CD or DVD drive does not resolve the problem replace the media tray f If booting on a...

Страница 178: ...service technician only that step must be performed only by a trained service technician Symptom Action A cover lock is broken an LED is not working or a similar problem has occurred If the part is a...

Страница 179: ...toms and what corrective actions to take Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved See Chapter 3 Parts listing on page 197 to...

Страница 180: ...h components are FRUs If an action step is preceded by Trained service technician only that step must be performed only by a trained service technician Symptom Action Service processor in the manageme...

Страница 181: ...heck the management module event log for error message checkpoint or firmware error codes If the DIMM was disabled by a system management interrupt SMI replace the DIMM If the DIMM was disabled by POS...

Страница 182: ...the keyboard video ownership on the Bull Blade Chassis Enterprise has not been switched to another blade server If the problem remains see Solving undetermined problems on page 194 The monitor goes b...

Страница 183: ...AIX console to a SOL connection This does not affect the console that is used by partition firmware 1 chcons dev vty0 2 shutdown Fr 2 10 9 Network connection problems Identify network connection probl...

Страница 184: ...55555555 66666666 77777777 88888888 99999999 Depending on your operating system and the utilities you have installed error messages might also be stored in an operating system log See the documentati...

Страница 185: ...ve not loosened any other installed devices or cables 2 If the option comes with its own test instructions use those instructions to test the option 3 Reseat the device that you just installed 4 Repla...

Страница 186: ...ving power the blade server is defective or the LED information panel is loose or defective e Local power control for the blade server is enabled use the management module Web interface to verify or t...

Страница 187: ...components are CRUs and which components are FRUs If an action step is preceded by Trained service technician only that step must be performed only by a trained service technician Isolation Procedure...

Страница 188: ...emory module 5 DIMM 6 Px C6 Memory module 6 DIMM 7 Px C7 Memory module 7 DIMM 8 Px C8 Memory module 8 2 See Removing a memory module on page 213 for location information and the removal procedure 3 In...

Страница 189: ...ervice action The isolation procedure code is recorded in the management module event log A message with three procedures might be similar to the following example except that the entry would be on on...

Страница 190: ...d 6 If the Chassis is functioning normally but the 1xxx2670 problem persists Replace the system board and chassis assembly as described in Replacing the Tier 2 system board and chassis assembly on pag...

Страница 191: ...ssembly as described in Replacing the Tier 2 system board and chassis assembly on page 229 If the SRC is B1xxB107 or B1xxB108 The system has detected a problem with a clock card 1 Replace the system b...

Страница 192: ...as described in Replacing the Tier 2 system board and chassis assembly on page 229 FSPSP05 The service processor has detected a problem in the platform firmware 1 Verify that the operating system is...

Страница 193: ...ule on page 213 for location information and the removal procedure 3 Install new memory DIMMs as described in Installing a memory module on page 214 See Supported DIMMs on page 5 for more information...

Страница 194: ...been displayed has the A1xx SRC remained for more than 40 minutes If so the server firmware could not begin terminating the partitions Contact your next level of support to assist in attempting to te...

Страница 195: ...nd the model of the system 2 Call Bull Support to find out what CRU the resource ID represents 3 Replace the CRU that the resource ID represents FSPSP29 The system has detected that all I O bridges ar...

Страница 196: ...is not programmed Record the reason code which is the last four digits of the first word from the SRC Perform one of the following procedures based upon the value of the reason code Reason code A46F 1...

Страница 197: ...se of the correct type SRC B1xx C02B A group of memory cards are missing and are required so that other memory cards on the board can be configured The additional parts in the CRU callout list include...

Страница 198: ...Support FSPSP50 A diagnostic function detects a connection problem between a processor chip and a GX chip If the CRUs called out before this procedure do not fix the problem Contact Bull Support FSPSP...

Страница 199: ...ng or that are turning slowly If you replace fans wait for the unit to cool and retry the operation 4 If the fans are functioning correctly there are environmental issues with the cooling of the proce...

Страница 200: ...dule on page 214 See Supported DIMMs on page 5 for more information NO12VDC Symbolic CRU Error code 1xxx2647 indicates that the blade server is reporting that 12V dc is not present on the Bull Blade C...

Страница 201: ...server might have a memory address conflict The software is designed to operate on the blade server Other software works on the blade server The software works on another server 2 If you received any...

Страница 202: ...path diagnostic LEDs read Safety on page vii and Handling static sensitive devices on page 202 If an error occurs view the light path diagnostic LEDs in the following order 1 Look at the control pane...

Страница 203: ...Figure 2 1 Light path diagnostic LEDs Callout System board LEDs 1 Light path power LED 2 System board LED Px 3 SAS hard disk drive LED or SAS solid state drive LED 4 DIMM 1 4 LEDs 5 1Xe connector LED...

Страница 204: ...error occurred 1 Reseat the battery 2 Replace the battery DIMM x error P1 C1 DIMM 1 P1 C2 DIMM 2 P1 C3 DIMM 3 P1 C4 DIMM 4 P1 C5 DIMM 5 P1 C6 DIMM 6 P1 C7 DIMM 7 P1 C8 DIMM 8 A memory error occurred 1...

Страница 205: ...ssis assembly error 1 Replace the blade server cover reinsert the blade server in the Bull Blade Chassis Enterprise and then restart the blade server 2 Check the management module event log for inform...

Страница 206: ...arting the PERM image You can force the blade server to start the PERM permanent image To force the blade server to start the PERM permanent image complete the following procedure 1 Access the Chassis...

Страница 207: ...e firmware code to the latest version See Updating the firmware on page 231 for more information about how to update the firmware code 2 13 4 Verifying the system firmware levels The diagnostics progr...

Страница 208: ...ctions screen is displayed then press F3 again to exit the diagnostic program 2 14 Solving shared Bull Blade Chassis Enterprise resource problems Problems with Bull Blade Chassis Enterprise shared res...

Страница 209: ...might actually be a problem in a Bull Blade Chassis Enterprise keyboard component To check the general function of shared keyboard resources perform the following procedure 1 Verify that the keyboard...

Страница 210: ...ports are the only failing component a Make sure that the USB device is operational b If using a USB hub make sure that the hub is operating correctly and that any software the hub requires is install...

Страница 211: ...plicable Media tray 8 Replace the following components one at a time in the order shown restarting the blade server each time a Removable media drive cable if applicable b Media tray cable if applicab...

Страница 212: ...etwork interface are configured correctly 7 Verify that the settings in the I O module are correct for the blade server Some settings in the I O module are specifically for each blade server 8 Verify...

Страница 213: ...rrectly See the Management Module User s Guide or the Management Module Command Line Interface Reference Guide for more information 7 Verify that the Bull Blade Chassis Enterprise blowers are correctl...

Страница 214: ...are Maintenance Manual and Troubleshooting Guide for your Bull Blade Chassis Enterprise If these steps do not resolve the problem it is likely a problem with the blade server See Monitor or video prob...

Страница 215: ...IMMs The following minimum configuration is required for the blade server to start System board and chassis assembly with two microprocessors Two 2 GB DIMMs A functioning Bull Blade Chassis Enterprise...

Страница 216: ...his the original reported failure or has this failure been reported before Diagnostic program type and version level Hardware configuration print screen of the system summary Firmware level Operating...

Страница 217: ...nents are of three types Tier 1 customer replaceable unit CRU Replacement of Tier 1 CRUs is your responsibility If Bull installs a Tier 1 CRU at your request you will be charged for the installation T...

Страница 218: ...annel Expansion Card CIOv option 46M6138 2607 3 4X InfiniBand DDR Expansion Card CFFh for BladeCenter option 7778 8258 3 Voltaire 4x InfiniBand DDR Expansion Card CFFh for BladeCenter option 7778 8298...

Страница 219: ...and screws 4 option 42D0628 2553 9 Solid State Drive SSD 69 GB and screws 4 option 44V6825 2553 9 Disk drive filler 40K5928 Label FRU list 44V7312 Label OEM FRU list 44V7313 Label System service 44V67...

Страница 220: ...200 Escala BL460 Problem Determination and Service Guide...

Страница 221: ...e see the Warranty and Support Information document 4 1 Installation guidelines Follow these guidelines to remove and replace blade server components Read the Safety Attention in Safety on page vii an...

Страница 222: ...ur Bull Blade Chassis Enterprise for additional information Verify that you have followed the reliability guidelines for the Bull Blade Chassis Enterprise Verify that the blade server battery is opera...

Страница 223: ...move the blade server from the Bull Blade Chassis Enterprise to access options connectors and system board indicators Figure 4 1 Removing the blade server from the Bull Blade Chassis Enterprise Attent...

Страница 224: ...lade server on a flat static protective surface with the cover side up 8 Place either a blade filler or another blade server in the bay within 1 minute The recessed spring loaded doors move out of the...

Страница 225: ...rther back in the bay that cover the bay opening move out of the way as you insert the blade server 8 Push the release handles on the front of the blade server to close and lock them The discovery and...

Страница 226: ...1 CRUs Replacement of Tier 1 customer replaceable units CRUs is your responsibility If Bull installs a Tier 1 CRU at your request you will be charged for the installation The illustrations in this doc...

Страница 227: ...lay the blade server on a flat static protective surface with the cover side up 4 Press the blade cover release as shown by 1 on each side of the blade server rotate the cover on the cover pins 3 and...

Страница 228: ...to the power source Always replace the blade server cover before installing the blade server Perform the following procedure to replace and close the blade server cover 1 Read Safety on page vii and...

Страница 229: ...moving the blade server from a Bull Blade Chassis Enterprise on page 203 3 Carefully lay the blade server on a flat static protective surface with the cover side up 4 Open and remove the blade server...

Страница 230: ...er until the two bezel assembly releases 3 click into place in the bezel assembly 3 Install and close the blade server cover See Installing and closing the blade server cover on page 208 Statement 21...

Страница 231: ...down the operating system turn off the blade server and remove the lade server from the Bull Blade Chassis Enterprise See Removing the blade server from a Bull Blade Chassis Enterprise on page 203 4 C...

Страница 232: ...e the lade server from the Bull Blade Chassis Enterprise See Removing the blade server from a Bull Blade Chassis Enterprise on page 203 3 Carefully lay the blade server on a flat static protective sur...

Страница 233: ...ise on page 204 4 4 7 Removing a memory module You can remove a very low profile VLP dual inline memory module DIMM 1 Read Safety on page vii and the Installation guidelines on page 201 2 Shut down th...

Страница 234: ...Installing a memory module Install dual inline memory modules DIMMs in the blade server The following table shows allowable placement of DIMM modules BL460 Blade planar P1 DIMM slots DIMM count 1 2 3...

Страница 235: ...from its package 8 Verify that both of the connector retaining clips are in the fully open position 9 Turn the DIMM so that the DIMM keys align correctly with the connector on the system board Attent...

Страница 236: ...te the management card connector See System board connectors on page 12 for the management card slot location Attention To avoid breaking the card retaining clips 2 or damaging the management card con...

Страница 237: ...ent card to any unpainted metal surface on the Bull Blade Chassis Enterprise or any unpainted metal surface on any other grounded rack component then remove the management card as shown by 1 in the fi...

Страница 238: ...the management module to discover the blade server Attention If the management card was not properly installed the power on LED blinks rapidly and a communication error is reported to the management...

Страница 239: ...that you are using is supported by the Escala BL460 blade server For example the following expansion cards are not supported by the Escala BL460 blade server Blade SFF Gb Ethernet Cisco 1X InfiniBand...

Страница 240: ...e surface with the cover side up 4 Open and remove the blade server cover See Removing the blade server cover on page 206 5 Lift the expansion card 1 up away from the 1Xe connector and out of the blad...

Страница 241: ...he Bull Blade Chassis Enterprise or any unpainted metal surface on any other grounded rack component then remove the part from its package 6 Orient the expansion card 1 over the system board 7 Lower t...

Страница 242: ...a Bull Blade Chassis Enterprise on page 203 3 Open and remove the blade server cover See Removing the blade server cover on page 206 4 Remove the horizontal CFFh CFFe expansion card 2 b Pull up on the...

Страница 243: ...is Enterprise See Removing the blade server from a Bull Blade Chassis Enterprise on page 203 3 Carefully lay the blade server on a flat static protective surface with the cover side up 4 Open and remo...

Страница 244: ...orm any configuration that the expansion card requires 4 4 12 Removing the battery You can remove and replace the battery Figure 4 17 Removing the battery Perform the following procedure to remove the...

Страница 245: ...ry clip Note After you remove the battery press gently on the clip to make sure that the battery clip is touching the base of the battery socket 4 4 13 Installing the battery You can install the batte...

Страница 246: ...ing and installation instructions that come with the battery 2 Tilt the battery so that you can insert it into the socket under the battery clip Make sure that the side with the positive symbol is fac...

Страница 247: ...and the Installation guidelines on page 201 2 Shut down the operating system turn off the blade server and remove the blade server from the Bull Blade Chassis Enterprise See Removing the blade server...

Страница 248: ...e it 2 Install the hard disk drive that was removed from the drive tray See Installing a drive on page 212 for instructions 3 Install and close the blade server cover See Installing and closing the bl...

Страница 249: ...lade server from a Bull Blade Chassis Enterprise on page 203 3 Carefully lay the blade server on a flat static protective surface with the cover side up 4 Open and remove the blade server cover See Re...

Страница 250: ...pe model number and serial number of the blade server on the repair identification RID tag that comes with the replacement system board and chassis assembly This information is on the identification l...

Страница 251: ...RS 485 bus of the management module Therefore a firmware update for the blade server is not supported from the management module You can still use the other methods of performing firmware updates for...

Страница 252: ...mand on AIX cd tmp fwupdate usr lpp diagnostics bin update_flash f 01EA3xx_yyy_zzz Install the firmware with the update_flash command on Linux cd tmp fwupdate usr sbin update_flash f 01EA3xx_yyy_zzz R...

Страница 253: ...n be used for AIX or Linux partitions See Using the SMS utility for more information Default boot list Use this utility to initiate a system boot in service mode through the default service mode boot...

Страница 254: ...hoices on the SMS utility main menu depend on the version of the firmware in the blade server Some menu choices might differ slightly from these descriptions Select Language Select this choice to chan...

Страница 255: ...x interface for connecting to one of the Ethernet compatible I O modules in I O module bays 1 and 2 which enables simultaneous transmission and reception of data on the Ethernet local area network LAN...

Страница 256: ...de server uses through the operating system settings The routing of an Ethernet controller to a particular I O module bay depends on the type of blade server You can verify which Ethernet controller i...

Страница 257: ...eth1 and the two associated physical HEA ports on the blade server The MAC addresses of the two physical HEAs are displayed in the Chassis management module The MAC address of the first integrated Et...

Страница 258: ...eck for the latest applicable IBM System Director updates and interim fixes To install the IBM System Director updates and any other applicable updates and interim fixes complete the following steps 1...

Страница 259: ...ng the troubleshooting procedures that are provided in your system and software documentation Most systems operating systems and programs come with information that contains troubleshooting procedures...

Страница 260: ...240 Escala BL460 Problem Determination and Service Guide...

Страница 261: ...ronments Maximum internal hard disk drive capacities assume the replacement of any standard hard disk drives and population of all hard disk drive bays with the largest currently supported drives avai...

Страница 262: ...et la Norv ge L tiquette du syst me respecte la Directive europ enne 2002 96 EC en mati re de D chets des Equipements Electriques et Electroniques DEEE qui d termine les dispositions de retour et de...

Страница 263: ...han recommended cables and connectors or by unauthorized changes or modifications to this equipment Unauthorized changes or modifications could void the user s authority to operate the equipment This...

Страница 264: ...g of non Bull option cards This product has been tested and found to comply with the limits for Class A Information Technology Equipment according to CISPR 22 European Standard EN 55022 The limits for...

Страница 265: ......

Страница 266: ...BULL CEDOC 357 AVENUE PATTON B P 20845 49008 ANGERS CEDEX 01 FRANCE REFERENCE 86 A7 81FB 00...

Отзывы: