background image

UEFI Shell command

Definition

exit

Exits the UEFI Shell environment

for

Executes commands for each item in a set of items

ftp

Performs FTP operation

getmtc

Gets the MTC from BootServices and displays it

goto

Forces batch file execution to jump to specified location

help

Displays the UEFI Shell command list or verbose command help

hexedit

Displays full screen hex editor

if

Executes commands in specified conditions

ifconfig

Modifies the default IP address of UEFI IPv4 network stack

ifconfig6

Displays or modifies IPv6 configuration for network interface

info

Displays hardware information

input

Take user input and place in UEFI variable

ioconfig

Deconfigures or reconfigures I/O components or settings

lanaddress

Displays LAN devices

lanboot

Performs LAN boot

load

Loads and optionally connects one or more UEFI drivers

loadpcirom

Loads a PCI Option ROM

ls

Displays a list of files and subdirectories in a directory

map

Displays or defines mappings

memmap

Displays the memory map

mkdir

Creates one or more directories

mm

Displays or modifies MEM/MMIO/IO/PCI/PCIE address space

mode

Displays or changes the console output device mode

mv

Moves one or more files or directories to another location

openinfo

Displays the protocols and agents associated with a handle

Table Continued

104

Utilities

Summary of Contents for Integrity Superdome X

Page 1: ...PE Integrity Superdome X Service Guide for Users Part Number 794235 007 Published November 2016 Edition 7 Abstract This guide describes the HPE Integrity Superdome X and provides user service information ...

Page 2: ...fications 28 Cooling requirements 29 Air quality specifications 29 Acoustic noise specifications 29 Sample site inspection checklist for site preparation 30 Updating firmware 34 Prerequisites 34 Installing the latest complex firmware using HP SUM 34 Manually updating the complex firmware 34 Download firmware bundle 35 Update the complex firmware 35 I O firmware and drivers 36 SMH and WBEM provider...

Page 3: ... components 65 Viewing indictment acquittals 66 Viewing recent service history 66 Physical Location installation and health history 66 Subcomponent isolation and deconfiguration displays 68 Using event logs 72 Live viewer 73 SEL and FPL viewers 75 Core Analysis Engine 79 OA 82 Troubleshooting processors 83 Troubleshooting memory 84 Troubleshooting cards and drivers 86 Troubleshooting compute enclo...

Page 4: ...baud rate 111 Insight Display 112 Insight Display overview 112 Navigating the Insight Display 112 Health Summary screen 113 Enclosure Settings screen 114 Enclosure Info screen 115 Blade and Port Info screen 116 Turn Enclosure UID On Off screen 117 View User Note screen 118 Chat Mode screen 118 Insight Display errors 119 Power errors 119 Cooling errors 119 Location errors 120 Configuration errors 1...

Page 5: ...commercial license Links to third party websites take you outside the Hewlett Packard Enterprise website Hewlett Packard Enterprise has no control over and is not responsible for information outside the Hewlett Packard Enterprise website Acknowledgments Intel Itanium Pentium Intel Inside and the Intel Inside logo are trademarks of Intel Corporation in the United States and other countries Microsof...

Page 6: ...cumentation and Technical Data for Commercial Items are licensed to the U S Government under vendor s standard commercial license Links to third party websites take you outside the Hewlett Packard Enterprise website Hewlett Packard Enterprise has no control over and is not responsible for information outside the Hewlett Packard Enterprise website Acknowledgments Intel Itanium Pentium Intel Inside ...

Page 7: ...Added instructions to save EFI variables to disk Added sections on troubleshooting the OA battery Updated illustrations for new HPE standards Updated Insight Display screens Added troubleshooting scenario where PXE fails to find the boot file Updated references to the new XFM2 crossbar modules 794235 003 edition Added BL920s Gen9 blade support Added SLES 11 SP4 and SLES 12 OS support Added RHEL 6 ...

Page 8: ...e enclosure supports four XFMs that provide the crossbar fabric which carries data between blades NOTE HPE Integrity Superdome X systems may contain XFM or XFM2 crossbar modules Unless specifically stated otherwise this document refers to all crossbar modules as XFMs but the information will generally apply to either XFM or XMF2 modules More information Integrity Superdome X QuickSpecs Power subsy...

Page 9: ...d and controlled by the OA through the GPSMs More information Integrity Superdome X QuickSpecs Server blades Each BL920s server blade contains two x86 processors and up to 48 DIMMs Server blades and partitions Integrity Superdome X supports multiple nPartitions of 2 4 6 8 12 or 16 sockets 1 2 3 4 6 or 8 blades Each nPartition must include blades of the same type but the system can include nPartiti...

Page 10: ...uperdome X servers at http www hpe com info superdomeX firmware matrix Fibre channel and LAN connectivity are supported by the interconnect modules in the rear of the compute enclosure For more information see More information Interconnect bay numbering Integrity Superdome X QuickSpecs Firmware Matrix for HPE Integrity Superdome X servers Connecting a PC to the OA service port Compute enclosure ov...

Page 11: ...Item Description 1 Power supply bay 7 2 Power supply bay 8 3 Power supply bay 9 4 Power supply bay 10 5 Power supply bay 11 6 Power supply bay 12 Table Continued HPE Integrity Superdome X overview 11 ...

Page 12: ... Do not block 9 Power supply bay 6 10 Power supply bay 5 11 Insight Display 12 Power supply bay 4 13 Power supply bay 3 14 Power supply bay 2 15 Power supply bay 1 16 Blade slots 17 Air intake slot Do not block 12 HPE Integrity Superdome X overview ...

Page 13: ...Power supply bay numbering HPE Integrity Superdome X overview 13 ...

Page 14: ...Server blade slot numbering 14 HPE Integrity Superdome X overview ...

Page 15: ... navigation bar selection left one position 3 Right arrow button Moves the menu or navigation bar selection right one position 4 OK button Accepts the highlighted selection and navigates to the selected menu 5 Down arrow button Moves the menu selection down one position 6 Up arrow button Moves up the menu selection one position HPE Integrity Superdome X overview 15 ...

Page 16: ...Compute enclosure rear components Item Description 1 AC power connectors upper 2 Fan bay 1 3 Fan bay 6 4 Fan bay 2 5 Fan bay 7 6 Fan bay 3 Table Continued 16 HPE Integrity Superdome X overview ...

Page 17: ...y 3 16 XFM bay 4 17 GPSM bay 2 18 Interconnect bay 2 19 Interconnect bay 4 20 Interconnect bay 6 21 Interconnect bay 8 22 OA bay 2 23 Power supply exhaust vent Do not block 24 AC power connectors lower 25 Fan bay 15 26 Fan bay 14 27 Fan bay 13 28 Fan bay 12 29 Fan bay 11 30 OA bay 1 31 Interconnect bay 7 Table Continued HPE Integrity Superdome X overview 17 ...

Page 18: ...o provide network access for data transfer Interconnect modules reside in bays located in the rear of the enclosure Review blade slot numbering to determine which external network connections on the interconnect modules are active To support server blade LAN and Fibre Channel I O connections an appropriate type of interconnect module is installed according to bay location 18 HPE Integrity Superdom...

Page 19: ...re interconnect bay Interconnect bay label FlexLOM 1 port 1 1 FlexLOM 1 port 2 2 FlexLOM 2 port 1 1 FlexLOM 2 port 2 2 Mezzanine 1 port 1 3 Mezzanine 1 port 2 4 Mezzanine 1 port 3 3 Table Continued HPE Integrity Superdome X overview 19 ...

Page 20: ... 3 port 4 6 NOTE For information on the location of LEDs and ports on individual interconnect modules see the documentation that ships with the interconnect module More information Integrity Superdome X QuickSpecs Server blade overview Product Processors DIMM slots Supported DIMM size PCIe I O Mezzanine card capacity PCI I O FlexLOM card capacity BL920s Gen8 BL920s Gen9 2 48 16 GB and 32 GB Gen8 1...

Page 21: ...ository and in event logs 2 CPU 1 3 Mezzanine bracket 4 Mezzanine connector 1 Type A 5 Mezzanine connector 2 Type A B 6 FlexLOM slot 2 7 CPU 0 8 Mezzanine connector 3 Type A B 9 FlexLOM slot 1 10 DDR3 DIMM slots 48 BL920s Gen8 DDR4 DIMM slots 48 BL920s Gen9 11 SUV board HPE Integrity Superdome X overview 21 ...

Page 22: ...ION The SUV cable is not designed to be used as a permanent connection therefore be careful when walking near the server blade Hitting or bumping the cable might cause the port on the server blade to break and damage the blade IMPORTANT The SUV port does not provide console access and the serial port is unused Item Description 1 Server blade connector 2 Serial 3 USB ports 2 4 Video More informatio...

Page 23: ...20 60 in 62 18 cm 24 48 in Component weights Table 2 Compute enclosure weights Component Weight Max quantity per enclosure Compute enclosure chassis1 64 9 kg 143 0 lb 1 I O chassis2 22 1 kg 48 7 lb 1 Midplane Brick 18 8 kg 41 5 lb 1 OA tray 3 6 kg 8 0 lb 1 Active Cool Fan 0 9 kg 2 7 lb 15 Power supply module 2 3 kg 5 0 lb 12 Enclosure DVD module 2 1 kg 4 7 lb 1 OA module 0 8 kg 1 8 lb 2 Table Cont...

Page 24: ...tion Guide Rack specifications Table 3 Rack specifications Rack Total cabinet area with packing materials H x D x W U height Width Depth Dynamic load gross Static load HPE 642 1075 mm Intelligent Rack 246 80 x 129 20 x 90 cm 85 35 x 50 87 x 35 43 in 42U 597 8 mm 23 54 in 1 085 63 mm 42 74 in 1 134 kg 2 500 lb 1 360 8 kg 3 000 lb HPE 642 1200 mm Shock Intelligent Rack 218 00 x 147 00 x 90 cm 85 82 ...

Page 25: ... Plug or connector type Circuit type Power receptacle required Number of power cords required per enclosure 3 phase 200 VAC to 240 VAC line to line phase to phase 3 phase 50 60 Hz NEMA L15 30p 3 Pole 4 wire 3 m 10 ft power cord 30 A 3 phase L15 30R 3 pole 4 wire 4 3 phase 220 VAC to 240 VAC line to neutral 3 phase 50 60 Hz IEC 309 4 pole 5 wire Red 3 m 10 ft power cord 16 A IEC 309 4 pole 5 wire r...

Page 26: ...urrent 100 A for 10 ms Ground leakage current 3 5 mA Power factor correction 0 98 Table 7 Enclosure 3 phase 2400 W power supply specifications North America Japan Specification Value Power cords 4 NEMA L15 30p 3 0 m 10 ft Max input current per line cord 24 0 A at 200 VAC 23 1 A at 208 VAC Output 2450 W per power supply Input requirements Rated input voltage 200 240 VAC line to line 3 phase Rated i...

Page 27: ... AC input expressed in watts and volt amps This figure was developed with the absolute maximum configuration running applications designed to draw the maximum power possible It is highly unlikely that any real world application will result in this amount of power use for any significant time period Table 10 Enclosure PDU power options Source Circuit type Source voltage nominal Plug or connector ty...

Page 28: ...ange2 5 C to 40 C 41 F to 104 F Recommended Operating Range 18 C to 27 C 64 F to 81 F Nonoperating powered off 5 C to 45 C 41 F to 113 F Nonoperating storage 40 C to 80 C 40 F to 176 F Humidity Range noncondensing Allowable Operating Range 12 C DP and 8 RH to 24 C DP and 85 RH Recommended Operating Range 5 5 C DP to 15 C DP and 65 RH Nonoperating powered off 8 RH to 90 RH and 29 C DP Nonoperating ...

Page 29: ...oads on the room and can lead to unexpected equipment problems More information Generic Site Preparation Guide Air quality specifications Chemical contaminant levels in customer environments for Hewlett Packard Enterprise hardware products must not exceed G1 mild levels of Group A chemicals at any time These contaminant levels are described in the current version of ISA 71 04 Environmental Conditi...

Page 30: ...number Hewlett Packard Enterprise information Sales representative Order number Representative making survey Date Scheduled delivery date Table 12 Site inspection checklist Check either Yes or No If No include comment or date Computer Room Number Area or condition Yes No Comment or Date 1 Do you have a completed floor plan 2 Is adequate space available for maintenance needs Front 91 4 cm 36 inches...

Page 31: ...ion and properly braced 12 Is floor tile underside shiny or painted If painted judge the need for particulate test Power and Lighting 13 Are lighting levels adequate for maintenance 14 Are AC outlets available for servicing needs for example laptop usage 15 Does the input voltage correspond to equipment specifications 15a Is dual source power used If so identify types and evaluate grounding 16 Doe...

Page 32: ... available for emergency purposes 24 Does the computer room have a fire protection system 25 Does the computer room have anti static flooring installed 26 Do any equipment servicing hazards exist loose ground wires poor lighting and so on Cooling 27 Can cooling be maintained between 5 C 41 F and 40 C 104 F up to 1 525 m 5 000 ft Derate 1 C 305 m 1 8 F 1 000 ft above 1 525 m 5 000 ft and up to 3 04...

Page 33: ... 18 F This temperature change Is within tolerance as a 20 C 36 F change per hour Repetitive changes Every 15 minutes there is a repetitive consistent 5 C 9 F up and down change This repetitive temperature change is a 40 C 72 F change per hour and not within tolerance Also note that rapid changes to temperature over a short period are more damaging than gradual changes over time 29 Can humidity lev...

Page 34: ...he correct order and ensure all dependencies are met before deployment of a firmware update It also contains logic to prevent version based dependencies from destroying an installation and ensures updates are handled in a manner that reduces any downtime required for the update process HP SUM does not require an agent for remote installations HP SUM is included in the downloadable firmware bundles...

Page 35: ...elease Notes 3 Copy the bundle to a media accessible from the OA 4 Connect a PC to OA over Telnet or SSH and login to the CLI For more information see Connecting a PC to the OA service port 5 At the CLI prompt use the connect blade blade command to connect to each blade and then use the exit command to return to the OA prompt For example OA connect blade 1 hpiLO exit IMPORTANT This will ensure tha...

Page 36: ...are bundle and drivers IMPORTANT Installing incorrect or unsupported firmware can cause unpredictable behavior The latest IO device firmware versions might not be supported for your system Be sure to use only the firmware versions that are qualified and recommended for your system Do not use the SPP as a source of device firmware for Superdome X systems SMH and WBEM providers Hewlett Packard Enter...

Page 37: ...p to 8 sockets Red Hat Linux RHEL 6 5 BL920s Gen8 RHEL 6 6 BL920s Gen8 and Gen9 v3 RHEL 6 7 BL920s all versions RHEL 6 8 BL920s all versions RHEL 7 0 BL920s Gen8 RHEL 7 1 BL920s Gen8 and Gen9 v3 RHEL 7 2 BL920s all versions RHEL 7 3 BL920s all versions SuSE Linux SLES 11 SP3 BL920s Gen8 and Gen9 v3 SLES 11 SP3 for SAP BL920s Gen8 and Gen9 v3 SLES 11 SP4 BL920s all versions SLES 12 BL920s Gen8 and ...

Page 38: ... Hat Linux For detailed information about using RHEL on Integrity Superdome X systems see the Running Linux on HPE Integrity Superdome X white paper at http www hpe com support superdomeXlinux whitepaper Using SuSE Linux For detailed information about using SLES on Integrity Superdome X systems see the Running Linux on HPE Integrity Superdome X white paper at http www hpe com support superdomeXlin...

Page 39: ...is the nature of an alias A partition name should at least have one of the following non numeric characters a z A Z dash _ underscore period Any other non numeric character is not allowed in a partition name nPartition names are unique within a complex Partition Power Operations To activate an inactive nPartition use the poweron partition command on the OA CLI To make an active partition inactive ...

Page 40: ...e nPartition An nPartition is considered inactive when it is not powered on An nPartition is in inactive state after it has been created or shut down Unknown nPartition An nPartition might report a partition state of Unknown and a runstate of DETACHED after an OA restart This state is possible when the firmware is not able to identify the correct nPartition state due to internal firmware errors at...

Page 41: ...G A boot operation has been initiated for this partition FWBOOT The boot process is in the firmware boot phase for this partition and the partition has transitioned into the active status EFI The partition is at the EFI shell OSBOOT The boot process has started booting the OS in this partition UP The OS in this partition is booted and running 1 SHUT A shutdown reboot reset operation has been initi...

Page 42: ...econfigured A parent resource has been deconfigured An example is the status of a memory DIMM which is healthy when the blade in which it is located is deconfigured The DIMM status is then PD PI Parent Indicted Similar to PD except the parent resource has been indicted I D Indicted and Deconfigured A resource has been indicted and deconfigured PI PD Parent Indicted and Parent Deconfigured A parent...

Page 43: ...ystem LED status information The LEDs provide initial status and health information LED information should be verified by the other sources of status information See LEDs and components on page 55 for more information TIP The OA CLI is the most efficient way to verify the information provided from LEDs OA access You can access the OA by entering the 169 254 1 x address using either a Telnet sessio...

Page 44: ...all of the system components Compute enclosure Use the show enclosure status and show enclosure powersupply all commands sd oa1 show enclosure status Enclosure 1 Status OK Enclosure ID OK Unit Identification LED Off Diagnostic Status Internal Data OK Thermal Danger OK Cooling OK Device Failure OK Device Degraded OK Redundancy OK Indicted OK Onboard Administrator Status OK Standby Onboard Administr...

Page 45: ... Failure OK Power Cord OK Indicted OK Similar information will be displayed for all other power supplies Collecting power status information for components at the compute enclosure Use the show xfm status all show blade status all and show interconnect status all commands to gather information on compute enclosure component power if in use NOTE OA displays XFM2 information as SXFM NOTE Similar inf...

Page 46: ...lt OK Health LED OK UID Off Powered On Diagnostic Status Internal Data OK Management Processor OK Thermal Warning OK Thermal Danger OK I O Configuration OK Power OK Device Failure OK Device Degraded OK Gathering cooling related information Use the following commands to gather all complex cooling information show enclosure fan all sd oa1 show enclosure fan all Fan 1 Information Status OK Speed 60 p...

Page 47: ... OK Device Info OK Firmware Mismatch OK Mezzanine Card OK Deconfigured OK PDHC OK Indicted OK show xfm status all sd oa1 show xfm status all Bay 4 SXFM Status Health OK Power On Unit Identification LED Off Diagnostic Status Internal Data OK Management Processor OK Thermal Warning OK Thermal Danger OK Power OK Firmware Mismatch OK Indicted OK Link 1 Dormant Link 2 Dormant Link 3 Dormant Link 4 Dorm...

Page 48: ...cation Event ID 3040 Server blade appears non functional Provider Name CPTIndicationProvider Event Time Fri May 18 04 56 22 2014 Indication Identifier 8304020120518045622 Managed Entity OA Name sd oa1 System Type 59 System Serial No USExxxxxS OA IP Address aa bb cc dd Affected Domain Enclosure Name lc sd2 RackName sd2 RackUID 02SGHxxxxAVY Impacted Domain Complex Complex Name SD2 Partition ID Not A...

Page 49: ...EM Serial Number NA Version Info Complex FW Version 7 4 2 Provider Version 8 34 Error Log Data Error Log Bundle 4000000000000e41 Recommended troubleshooting methodology The recommended methodology for troubleshooting a complex error or fault is as follows Procedure 1 Consult the system console for any messages emails or other items pertaining to a server blade error or fault 2 Use the SHOW PARTITI...

Page 50: ...ight Display on page 112 Log viewers See Using event logs on page 72 Offline and Online Diagnostics See Troubleshooting tools on page 55 Analyze events For information about using HPE Insight Remote Support to analyze system events see http www hpe com info insightremotesupport Developer log collection The OA will automatically save a set of debug logs when it notices daemon failures on the PDHC o...

Page 51: ...COPY archive archive name FTP ftp path 5 CLEAR ARCHIVE FTP example zomok oa UPLOAD DEBUG ARCHIVE dec zomok oa SHOW ARCHIVE Debug Logs Time ________________________________________________ ____________________ archive dec zomok oa 20140529_1513 logs tar gz May 29 2014 15 13 archive CH zomok oa 20140527_1605 logs tar gz May 27 2014 16 05 archive CH zomok oa 20140525_0534 logs tar gz May 25 2014 05 3...

Page 52: ...d configuration NOTE You cannot access the OA at this time 1 Verify that at least one upper and one lower power supply has the following normal LED status The power supply power LED is on The power supply fault LED is off 2 If the OA tray has a single OA installed reseat the OA and the OA tray 3 If two OAs are installed locate the OA with the Active LED illuminated and either reset the active not ...

Page 53: ...ries related to processors processor power modules shared memory and core I O devices See Using event logs on page 72 4 Review the OA SHOW ALL section for the SHOW SERVER PORT MAP bay to verify that the SAN port is connected Then check the SAN switch for failures and verify the correct configuration 3c PXE fails to find the boot file on the network UEFI is running Nothing can be logged for this co...

Page 54: ... core I O devices Make sure there are no indictments or any hardware issue or known firmware issue See Using event logs on page 72 2 Use the OA CLI TC command to initiate a TOC to reset the partition 3 Reboot the OS and escalate 4 Obtain the system software status dump for root cause analysis The issue is fixed when the OS becomes responsive and the root cause is determined and corrected 7a MCA oc...

Page 55: ... processors processor power modules shared memory and core I O devices See Using event logs on page 72 for more details The issue is fixed when the root cause is determined and corrected 8 The OA CLI and GUI display this message Data stored in the OA and DVD module do not match that in the enclosure The complex is unusable To recover fix this problem and reboot the OA Consult the Hewlett Packard E...

Page 56: ...vity 4 NIC icon 2 Indicates the status of the NIC Solid green Network linked no activity Flashing green Network linked activity 5 NIC icon 3 Indicates the status of the NIC Solid green Network linked no activity Flashing green Network linked activity 6 NIC icon 4 Indicates the status of the NIC Solid green Network linked no activity Flashing green Network linked activity 7 Health icon Off Server b...

Page 57: ...OTE The power supplies at the top of the enclosure are upside down Power LED 1 green Fault LED 2 amber Condition Off Off No AC power to the power supply On Off Normal Off On Power supply failure Fan LED Troubleshooting 57 ...

Page 58: ...us LED 1 N A for Integrity Superdome X 5 XFM crossbar fabric port 2 6 Link Cable Status LED 2 N A for Integrity Superdome X 7 XFM crossbar fabric port 3 8 Link Cable Status LED 3 N A for Integrity Superdome X 9 XFM crossbar fabric port 4 10 Link Cable Status LED 4 N A for Integrity Superdome X 11 XFM crossbar fabric port 5 12 Link Cable Status LED 5 N A for Integrity Superdome X 13 XFM crossbar fa...

Page 59: ...ed Deconfigured XFM2 LEDs and components Item Name Description 1 UID LED Blue UID on 2 Power LED Indicates if the module is powered on Green On 3 XFM crossbar fabric port 1 4 Link Cable Status LED 1 N A for Integrity Superdome X 5 XFM crossbar fabric port 2 6 Link Cable Status LED 2 N A for Integrity Superdome X 7 XFM crossbar fabric port 3 8 Link Cable Status LED 3 N A for Integrity Superdome X 9...

Page 60: ...e UID on 3 Health LED Flashing yellow Degraded indicted Off OK Flashing red Deconfigured 4 CAMNet port 1 N A for Integrity Superdome X 5 CAMNet port 2 N A for Integrity Superdome X 6 CAMNet port 3 N A for Integrity Superdome X 7 CAMNet port 4 N A for Integrity Superdome X 8 CAMNet port 5 N A for Integrity Superdome X 9 CAMNet port 6 N A for Integrity Superdome X 10 CAMNet port 7 N A for Integrity ...

Page 61: ... the status of the global clock signal distributed to connected enclosures Flashing green No clock signal expected Unused for this release of the system 14 Global clock connector 3 15 Global clock connector 2 16 Global clock connector 1 17 Enclosure DVD module USB port NOTE To ensure proper system functionality you must connect the USB cable between the OA module and the GPSM OA module LEDs and co...

Page 62: ...es which OA is active 5 Health LED Green OK Red Critical error 6 USB USB 2 0 Type A connector used for connecting the enclosure DVD module Connects to the USB mini A port on the GPSM NOTE You must connect the USB cable between the OA module and the GPSM to ensure proper system functionality 7 Serial debug port Serial RS232 DB 9 connector with PC standard pinout IMPORTANT This port is for OA debug ...

Page 63: ...ption of each failure event on the system that results in a service request even after a component is removed or replaced History of component identities Information in the HR database is stored as installation and action records These records are organized with component physical location as the key Indictment Records Indictment refers to a record specifying that a component requires service The ...

Page 64: ...he records are linked together If one is acquitted the acquittal will be passed to the cohort FRUs as well HR test commands The test camnet and test clocks commands will acquit all indictments specific to the test to be executed Resources that fail the test will be re indicted as the test completes The test fabric command acquits each type fabric CAMNet Global Clock of indictment before initiating...

Page 65: ...s Logged FRU Type Blade DIMM Location 0x0100FF0600060A74 enclosure1 blade6 cpusocket0 dimm6 Timestamp Fri Jun 26 16 35 24 2015 Indictment State Indicted Requested Deconfig State Deconfigured Current Deconfig State Deconfigured dimm 1 6 0 1 Location 6A Status OK No Errors Logged end report 2 records shown To see details about a specific FRU use show loc path To see additional deconfiguration detail...

Page 66: ...rrent deconfiguration states shown in the examples above are not the same This can happen when requested deconfiguration changes are not be acted on until the n Par containing the component in question is rebooted Viewing recent service history You can view the recent service history using the show acquit command To view the installation history for the acquitted locations enter show physical loca...

Page 67: ...cription Memory Uncorrectable Error An uncorrectable memory error has occurred most likely in the server s memory DIMMs or the blade Bundle ID 0x011000000000AF3D Alert ID 2700420140317074056 Serial Num 1X123456 Product Name DDR3 DIMM Indicted Acquitted Type Timestamp Entity Reason Ind Mon Mar 17 07 40 52 2014 CAE See reason above SubFRUs requiring service are shown here If none the section is omit...

Page 68: ...If a subcomponent deconfiguration event occurs the corresponding subcomponent Isolation will also be set which triggers an indictment of the parent component The sections below show examples of how the subcomponent isolation sections look NOTE The format of the deconfiguration sections look identical to those for Isolation so are not shown in the following sections Blade subcomponent displays Ther...

Page 69: ...Superdome X there are FlexLOMs instead of LOMs Each FlexLOM has its own physical location Therefore indictments against FlexLOMs are issued against the FlexLOM physical location rather than indicting the blade and setting one of the LOM bits The blade SubFru isolation display will continue to show LOM bits but these should always have a value of 0 Components supported by this display are as follow...

Page 70: ...r QPI links can range from 0 to 2 The SubFRU deconfiguration display section has the same layout as the SubFru Isolation display Memory subsystem SubFru Isolation Blade Memory Subsystem Socket 0 Memory Controller 0 Memory SMI DDR Buffer Channel Channel 0 1 0 1 0 0 0 0 0 0 1 0 0 0 0 0 The SubFRU deconfiguration display section has the same layout as the SubFru Isolation display Connections for I O ...

Page 71: ...he L3 cache I indicates Instruction For example the FLI cache is the First Level Instruction cache D indicates Data For example the MLD cache is the Mid Level Data cache VRMs supported by this display are as follows FP_regs GP_regs other an unspecified fault has been identified within the processor core CPU memory SubFru Isolation Processor Module Intel Xeon R E7 8800 processor Memory 0 1 Mbox 0 0...

Page 72: ... 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 0 0 0 0 VRM Fault VRMs reported by this display are as follows V3P3_STBY V2P5_STBY V1P2_STBY V1P8_0 V1P8_1 V1P2 CAMNET_A CAMNET_B Using event logs Event logs are generated by software or firmware when an event is detected Some events that cause event records to be generated are as follows Hardware related Example DIMM CPU VRM XNC or PCI BUS failures Software related ...

Page 73: ... any other event viewer meaning that each live event viewer can select its own filter and format options without affecting other live event viewers The log can be filtered using the following items blade number partition number alert level The following format options are also available Keyword This is the default format for all viewers The keyword format supplies the following information about a...

Page 74: ...rd Progress 2 Informational 3 Warning 5 Critical 7 Fatal The following event filter options are available B Blade P Partition V Virtual Partition U Unfiltered Current alert threshold Alert threshold 0 Current filter option Unfiltered Current format option Extended Keyword Select new filter format option or ctrl b to exit or cr to resume display of live events or H for help or C to display column h...

Page 75: ...llowing format options are also available Keyword This is the default format for all viewers The keyword format supplies the following information about an event log number reporting entity type reporting entity ID alert level hexadecimal dump of event records event ID keyword Raw hex The raw hex format supplies the following information about an event hexadecimal dump of event records Text The te...

Page 76: ...lert thresholds will cause events at the selected threshold and below to be shown 0 Minor Forward Progress 1 Major Forward Progress 2 Informational 3 Warning 5 Critical 7 Fatal The following event filter options are available B Blade P Partition V Virtual Partition U Unfiltered Current alert threshold Alert threshold 0 Current filter option Unfiltered Current format option Extended Keyword MP VWR ...

Page 77: ...ON 5512553 03 17 2014 14 28 02 5512552 OA 1 1 1 1 34801f4400e10000 0610000000000000 PARCON_NPAR_STATE_CHANGE 5512552 03 17 2014 14 27 57 5512551 OA 1 1 1 0 1680264000e10000 213a000200170000 PARCON_VPAR_OPERATION 5512551 03 17 2014 14 27 57 To connect to the SEL viewer enter the SHOW SEL command Welcome to the System Event Log SEL Viewer The following SEL navigation commands are available D Dump lo...

Page 78: ...TH 62384 03 17 2014 13 41 20 62383 SFW 1 3 0 0 0 3 2 43882adc01e17831 0000000000000002 MEM_RAS_MODE_ENABLED 62383 03 17 2014 13 41 19 62382 SFW 1 3 0 0 0 3 2 5188297a01e1782f 0000000000000709 CPU_MICROCODE_REVISION 62382 03 17 2014 13 41 18 62381 SFW 1 3 0 0 0 3 2 5188252501e1782d 0000001202450231 BOOT_ROM_REVISION 62381 03 17 2014 13 41 18 62380 SFW 1 3 0 0 0 3 2 43882ae601e1782b 0000000000000044...

Page 79: ...2 Degraded Warning 3 Minor 4 Major 5 Critical 6 Fatal NonRecoverable 7 L i Event ID Event ID Search based on Event Id L v EventCategory Name EventCategory Name all Search based on event category name or view all category names L p npar vpar complex Search based on partition id or complex L t eq le ge mm dd yyyy hh mi ss L t bw mm dd yyyy hh mi ss mm dd yyyy hh mi ss Search based on time of event g...

Page 80: ...6 2014 Indication Identifier 11227020140328155356 Managed Entity OA Name hawk039oa1 System Type 59 System Serial No SFP1236002 OA IP Address 15 242 4 234 Affected Domain Enclosure Name hawk039 RackName hawk039 RackUID 02SGH5141AE2 Impacted Domain Partition Complex Name hawk039 Partition ID 3 SystemGUID 00000000 0000 0000 0000 000000000000 Summary SFW test of SMIF over CHIF interface to Gromit iLO ...

Page 81: ...port Firmware Event Subcategory Other Probable Cause Communications Protocol Error Other Event Subcategory Gromit iLO Configuration Error Event Threshold 1 Event Time Window 0 minutes Actual Event Threshold 1 Actual Event Time Window 0 minutes Record ID 0x0 Record Type E1 Reporting Entity 0x0100ff03ff000017 enclosure1 blade3 cpusocket0 cpucore0 Alert Level 0x3 Data Type 0x16 Data Payload 0x1 Exten...

Page 82: ...ccess Telnet session at the CLI command prompt enter Exit Logout or Quit SSH session 1 Start an SSH session to the OA 2 Enter ssh l username IP address Example ssh l Administrator 16 113 xx yy The authenticity of host 16 113 xx yy 16 113 xx yy can t be established DSA key fingerprint is ab 5e 55 60 2b 71 8f 0c 55 3e 79 3e a2 93 ea 13 Are you sure you want to continue connecting yes no yes Warning ...

Page 83: ...cessor environment EFI typically occur during boot or runtime Boot errors typically related to a core failing self test a QPI link not initializing to full speed or a core or socket not coming out of reset Runtime errors can be due to a hardware or software defect that appears in either a core or uncore I O and XNC errors consult the CAE error logs Most common I O errors are surprise down and comp...

Page 84: ...result in the same symptoms CAE will analyze the failure to determine whether SMI2 is at fault For errors related to SMI2 suspect the CPU the memory buffer or the traces between them The memory buffer is permanently attached to the blade so it cannot be indicted independently Therefore the CPU and or blade are indicted for an SMI2 error If an error occurs on SMI2 replacing DIMMs is unlikely to cor...

Page 85: ... sibling disabled DIMMs These DIMMs are healthy and should not be replaced To identify a possible faulty DIMM use the HR SHOW INDICT command Replace DIMMs that are indicted Do not replace DIMMs that are deconfigured unless there are other indications of a faulty DIMM such as being specifically identified with DIMMERR Solution 3 Cause Using DIMMERR If there are memory errors that do not clearly ind...

Page 86: ...as the following indicates UEFI Driver Loading Bypass Configuration Press 1 Bypass loading UEFI drivers from I O slots 2 Bypass loading UEFI drivers from I O slots and blade LOMs N n Normal loading of UEFI drivers Q q Quit Waiting for user input The Bypass loading UEFI drivers from I O slots and blade LOMs option might be useful when a bad FlexLOM and or mezzanine card UEFI driver is preventing pa...

Page 87: ...SUM or manually using OA CLI There are different bundles for each method For instructions to update firmware and drivers see Manually updating the complex firmware on page 34 and Installing the latest complex firmware using HP SUM on page 34 For more information about installing firmware updates see the detailed instructions provided in the firmware download bundle Always follow the update instruc...

Page 88: ...es These firmware bundles can be installed without requiring any nPartition downtime See the detailed instructions provided in the firmware download bundle for more information System firmware System firmware bundle includes firmware for complex components including the following Server blade firmware not including LOMs Partition firmware for each server blade and OA OA firmware Manageability modu...

Page 89: ...l only update FRUs that do not match the complex firmware version 4 Check for indicts 5 Power on the partition XFM Requires a Complex outage 1 Power OFF all partitions 2 Remove and replace the suspect XFM following the instructions in the service guide IMPORTANT Do not mix XFM and XFM2 crossbar modules in the same system 3 Use the update firmware uri all command pointing it to the uri of a bundle ...

Page 90: ...on of all installed FRUs and will only update FRUs that do not match the complex firmware version 5 Check for indicts NOTE You will see indictments related to the loss of redundancy of the CAMNet 6 Acquit the indictments related to the loss of redundancy of the CAMNet NOTE For blade replacement If the FRU failed in a way that made it unable to join the partition after the failure you might not nee...

Page 91: ... 7 110 34 iSCSI Boot EFI 10 7 110 15 UEFI 10 7 110 34 iSCSI BIOS 107 00a9 HPE FlexFabric 10 Gb 2 port 534FLB 534M Adapter Boot 7 10 37 UEFI 7 10 54 Boot 7 12 83 7 12 31 Interconnect module firmware The system supports the LAN Pass Thru Module the HPE ProCurve 6120XG and 6125XLG blade switches and the HPE 4X FDR Infiniband Switch Symptoms of possible firmware issues include erratic server blade com...

Page 92: ...ar 28 17 21 44 mgmt Blade 7 Ambient thermal state is OK Mar 28 17 26 31 parcon Note Partition Controller has initialized all partition permissions to the default behavior Mar 28 17 28 53 parcon Note nPartition 2 Power On of nPartition completed Mar 28 17 29 37 mgmt Blade 2 Ambient thermal state is OK Mar 28 17 29 37 mgmt Blade 4 Ambient thermal state is OK Mar 28 17 29 37 mgmt Blade 6 Ambient ther...

Page 93: ...twork The non restricted ranges may be used for iLOs and OAs as long they are not duplicated generate IP address conflicts In addition all the IP addresses must be within the same subnet defined by netmask and IP address so that all OAs as well as all iLOs fit into that subnet Use the show ebipa and show OA network all commands to check the network settings for iLO and OA SHOW EBIPA EBIPA Device S...

Page 94: ...time use the procedure described in Show complex status below IMPORTANT The HR test fabric requires a complex outage Before running HR test fabric all indicted and deconfigured parts must be cleared and the partition must be powered off NOTE Test fabric includes both test camnet and test clocks OA1 HR test fabric Begin test 1 System Fabric Components Acquitting any current fabric and CAMNet indict...

Page 95: ...oubleshooting clock related issues Cause Clocks are provided by the GPSM module and are redundant within a complex Use the command HR test clocks to check for clock related issues as follows NOTE This command can be run while the partitions are active HR test clocks Clocks test started Blade Sys Clk 0 Sys Clk 1 Blade 1 1 OK OK Blade 1 2 OK OK Blade 1 3 OK OK Blade 1 4 OK OK Blade 1 5 OK OK Blade 1...

Page 96: ...0142 System Int 1 Wed Aug 13 06 35 0 2014 PCIe Link show cae E n 72287 Alert Number 72287 Event Identification Event ID 100142 Provider Name PCIeIndicationProvider Event Time Wed Aug 13 06 35 06 2014 Indication Identifier 310014220140813063506 Managed Entity OA Name System Type System Serial No OA IP Address Affected Domain Enclosure Name RackName RackUID Impacted Domain Complex Name Partition ID ...

Page 97: ...00aae3 1 Thu Jan 16 21 43 45 CET 2014 0x011000000000aae2 1 Mon Jan 13 11 44 30 CET 2014 0x011000000000aae1 1 Mon Jan 13 11 43 27 CET 2014 0x011000000000aadf 1 Tue Dec 10 01 07 39 CET 2013 0x013000000000aac0 1 Sun Dec 8 01 12 08 CET 2013 0x011000000000aadd 1 Sat Dec 7 01 58 05 CET 2013 0x011000000000aadc 1 Sat Dec 7 01 57 02 CET 2013 If an MCA of interest is found it can be captured by running the ...

Page 98: ...ibrary www hpe com info EIL Single Point of Connectivity Knowledge SPOCK Storage compatibility matrix www hpe com storage spock Storage white papers and analyst reports www hpe com storage whitepapers For additional websites see Support and other resources 98 Websites ...

Page 99: ... method To download product updates Hewlett Packard Enterprise Support Center www hpe com support hpesc Hewlett Packard Enterprise Support Center Software downloads www hpe com support downloads Software Depot www hpe com support softwaredepot To subscribe to eNewsletters and alerts www hpe com support e updates To view and update your entitlements and to link your contracts and warranties with yo...

Page 100: ...com services proactivecaresupportedproducts HPE Proactive Care advanced service Supported products list www hpe com services proactivecareadvancedsupportedproducts Proactive Care customer information Proactive Care central www hpe com services proactivecarecentral Proactive Care service activation www hpe com services proactivecarecentralgetstarted Warranty information To view the warranty for you...

Page 101: ... com info ecodata For Hewlett Packard Enterprise environmental information including company programs product recycling and energy efficiency see www hpe com info environment Documentation feedback Hewlett Packard Enterprise is committed to providing documentation that meets your needs To help us improve the documentation send any errors suggestions or comments to Documentation Feedback docsfeedba...

Page 102: ... to the file to be loaded These variables contain application specific data that is passed directly to the UEFI application UEFI variables provide system firmware a boot menu that points to all the OSs even multiple versions of the same OSs The UEFI System Utilities allows you to control the server s booting environment Depending on how boot options are configured after the server is powered up th...

Page 103: ...evtree Displays the UEFI Driver Model compliant device tree dh Displays UEFI handle information disconnect Disconnects one or more UEFI drivers from a device dmem Displays the contents of memory dmpstore Displays all UEFI NVRAM variables drivers Displays the UEFI driver list drvcfg Initiates the Driver Configuration Protocol drvdiag Initiates the Driver Diagnostics Protocol echo Controls batch fil...

Page 104: ...interface info Displays hardware information input Take user input and place in UEFI variable ioconfig Deconfigures or reconfigures I O components or settings lanaddress Displays LAN devices lanboot Performs LAN boot load Loads and optionally connects one or more UEFI drivers loadpcirom Loads a PCI Option ROM ls Displays a list of files and subdirectories in a directory map Displays or defines map...

Page 105: ...le devices sermode Sets serial port attributes set Displays or modifies UEFI Shell environment variables setsize Sets the size of a file setvar Changes the value of a UEFI variable shift Shifts batch file input parameter positions smbiosview Displays SMBIOS information stall Stalls the processor for the specified number of microseconds svrconfig Controls server settings tftp Performs TFTP operatio...

Page 106: ...ff extended character features Boot Maintenance Manager This menu allows you to change various boot options The Boot Maintenance Manager contains the following submenus Boot Options Menu Driver Options Menu Boot From File Set Boot Next Value Menu Set Time Out Value Menu 106 Boot Maintenance Manager ...

Page 107: ...Boot Options The Boot Options menu contains the following options Add Boot Option Delete Boot Option Change Boot Order Driver Options The Driver Options menu contains the following options Utilities 107 ...

Page 108: ...o be stored in a disk file as a backup in case they are lost parremove parcreate corrupted NVRAM To save the NVRAM variables onto the redundant disks use the UEFI command dmpstore all s filename To restore the EFI variables use dmpstore all l filename NOTE Redundant paths to disks might not be seen by default at EFI without boot entries You might need to use reconnect r and map r to locate all of ...

Page 109: ...f the laptop or PC is running Linux you must probably manually set the network port to 169 254 2 1 with a netmask of 255 255 0 0 Procedure 1 Connect a laptop or PC 100 1000Mb Ethernet port to the enclosure service link up port on the OA interposer using a standard CAT5e patch cable 2 Access an active OA as follows To access an active OA GUI Use the active OA service IP address from the Insight Dis...

Page 110: ...rations Procedure 1 Connect a serial cable between the serial port on the computer and the serial port on the OA module The following table is for the DB9 serial RS232 port and shows the pinout and signals for the RS232 connector The signal direction is DTE computer relative to the DCE OA NOTE A laptop or PC connected to the OA serial port requires a null modem cable The minimum connection to an e...

Page 111: ...he serial baud rate must be adjusted from the OA to match the serial baud rate coming from the OS modify the OS serial console from the default 9600 baud using HPONCFG command from the OA CLI Set the baud rate serial speed by entering the value shown in the table below SET SCRIPT MODE ON HPONCFG bay EOF RIBCL VERSION 2 0 LOGIN USER_LOGIN adminname PASSWORD password RIB_INFO MODE write MOD_GLOBAL_S...

Page 112: ... light turns off Pressing any button on the Insight Display reactivates the screen Amber The Insight Display illuminates amber when the OA detects an error or alert condition The screen displays the details of the condition After two minutes of inactivity the Insight Display flashes amber indicating that an error or alert condition exists If the enclosure UID is on and an error or alert condition ...

Page 113: ...resent in the Main Menu TIP Within any menu option navigate the cursor to What is This and press the OK button to view additional information about each setting option or alert The navigation bar contains options to do the following Navigate forward and backward through alert screens Return to the main menu Accept changes to current settings Cancel changes to current settings Access the Health Sum...

Page 114: ...dicates no DVD is connected to the OA while a dark gray rectangle indicates the DVD drive is present but that no media is present A dark green rectangle indicates that media is present but not actively connected to any server or that all connected servers have issued a disk eject command so the disk can be removed from the drive A bright green rectangle indicates that the media is present in the d...

Page 115: ...om changes Navigate the cursor to a setting or to and press OK to change the setting or get help on that setting Enclosure Info screen The Enclosure Info screen displays information about the enclosure including the following Active OA IP address Active OA Service IP address Current health status of the enclosure Current enclosure ambient temperature Current AC input power to the enclosure Enclosu...

Page 116: ...creen select the server blade number and then press the OK button Select Blade Info or Port Info and press the OK button To view information about the server blade select Blade Info and press the OK button NOTE The screen below does not depict the fully loaded blade supported for this release 116 Blade and Port Info screen ...

Page 117: ...onnected to particular port numbers on the interconnect modules Turn Enclosure UID On Off screen The Main Menu displays Turn Enclosure UID Off when the enclosure UID is active and displays Turn Enclosure UID on when the enclosure UID is off Selecting Turn Enclosure UID On from the main menu turns on the rear enclosure UID LED and changes the color of the Insight Display screen to blue Turn Enclosu...

Page 118: ...ses the web interface to send a message to an enclosure Insight Display The technician uses the Insight Display buttons to select from a set of prepared responses or dials in a custom response message on the line To send a response back to the Administrator navigate the cursor to Send then press the OK button The Chat Mode screen has top priority in the Insight Display and remains on the screen un...

Page 119: ...uggests corrective action to clear the current error Next Alert Displays the next alert or if none exist displays the Health Summary screen Previous Alert Displays the previous alert Power errors Power errors can occur because of insufficient power to bring up an enclosure Power errors can occur on server blades or interconnect modules To correct a power error do the following Procedure 1 Use the ...

Page 120: ... are installed in the wrong bays or if mezzanine cards are installed in the wrong connectors in the server blade Configuration errors can occur on server blades and interconnect modules Integrity Superdome X systems are configured such that these errors should not occur unless the components have been moved To correct a configuration error do the following Procedure 1 Use the arrow buttons to navi...

Page 121: ...ction suggested by the Insight Display In most cases you must remove the failed component to clear the error 3 Replace the failed component with a spare if applicable NOTE If the device failure error is an ac power input failure error you must have the failed ac input repaired to clear the error Insight Display 121 ...

Page 122: ...ties HPE Enterprise Servers www hpe com support EnterpriseServers Warranties HPE Storage Products www hpe com support Storage Warranties HPE Networking Products www hpe com support Networking Warranties Regulatory information Belarus Kazakhstan Russia marking Manufacturer and Local Representative Information Manufacturer information Hewlett Packard Enterprise Company 3000 Hanover Street Palo Alto ...

Page 123: ...w decade with 2000 as the starting point for example 238 2 for 2002 and 38 for the week of September 9 In addition 2010 is indicated by 0 2011 by 1 2012 by 2 2013 by 3 and so forth YYWW where YY indicates the year using a base year of 2000 for example 0238 02 for 2002 and 38 for the week of September 9 Turkey RoHS material content declaration Ukraine RoHS material content declaration Turkey RoHS m...

Page 124: ...E Customer engineer CEC Core electronics complex CMA Cable management arm CMC Corrected machine check CNA Converged Network Adapter CPE Correctable platform error CRAC Computer room air conditioner CRAH Compute room air handler CRU Customer replaceable unit CSR Control status registers DDNS Dynamic domain name system DHCP Dynamic host configuration protocol DLL Dynamic link library DMA Direct memo...

Page 125: ...ntegrity Data Collector iLO 4 Integrated Lights Out 4 IRC Integrated Remote Console IRS Insight Remote Support KVM Keyboard Video and Mouse LAN Local Area Network LDAP Lightweight directory access protocol LOM LAN on motherboard LVM Logical volume manager MCA Machine check abort MPS Maximum payload size NVRAM Nonvolatile RAM OA Onboard Administrator PA RISC Precision Architecture Reduced Instructi...

Page 126: ...are SIM System insight manager SMBIOS System management BIOS SMH System management home page SGPIO Serial general purpose input output SSH Secure Shell STM Support tool manager SUV Serial USB Video A single board containing these three functions A single connector attaches to the SUV board and has three ends one for Serial DB9 one for USB and one for video DB15 SXFM x86 enhanced performance crossb...

Page 127: ...terruptible power supply USB Universal serial bus VRM Voltage regulator module WBEM Web based enterprise management XBar Crossbar XFM Crossbar Fabric Module XFM2 Crossbar Fabric 2 Module Displayed as SXFM by the OA XPF x86 x64 Processor Family Standard terms abbreviations and acronyms 127 ...

Reviews: