background image

Diagnosing SAS Data Path Failures on Servers Using MegaRAID Disk Controllers

For more details, refer to information about viewing system information and inventory

in the 

Oracle ILOM Administrators Guide for Configuration and Maintenance

, which is

available at 

http://www.oracle.com/goto/ilom/docs

.

2.

Ensure that the server firmware version is at the minimum required version,

shown above, or a subsequent release, if available.

3.

If the required firmware (or newer) is not installed:

a.   

Download the firmware from My Oracle Support at: 

https://support.oracle.

com

b.   

Install the downloaded firmware.

Refer to the information about performing firmware updates in the 

Oracle ILOM

Administrators Guide for Configuration and Maintenance

, which is available at 

http:

//www.oracle.com/goto/ilom/docs

. Ensure that you perform the preparatory steps

described in that document before updating the firmware.

Note - 

Occasionally after installing the firmware, the Oracle ILOM web interface cannot

display the power state correctly on the power control page. To correct this problem, clear your

browser cache before logging in to the Oracle ILOM web interface.

Diagnosing SAS Data Path Failures on Servers Using

MegaRAID Disk Controllers

On Oracle x86 servers using MegaRAID disk controllers, Serial Attached SCSi (SAS) data path

errors can occur.

To triage and isolate a data path problem on the SAS disk controller, disk backplane (DBP),

SAS cable, SAS expander, or hard disk drive (HDD), gather and review the events in the disk

controller event log. Classify and analyze all failure events reported by the disk controller based

on the server SAS topology.

To classify a MegaRAID disk controller event, gather and parse the MegaRAID disk controller

event logs using the 

MegaCLI

 command:

For example, at the root prompt, type:

root# 

./MegaCli64 adpeventlog getevents –f event.log aall

Success in AdpEventLog

Important Operating Notes

15

Summary of Contents for Sun Fire X4800

Page 1: ...Sun Fire X4800 Server Product Notes Part No E69641 07 July 2017 ...

Page 2: ......

Page 3: ...ormation management applications It is not developed or intended for use in any inherently dangerous applications including applications that may create a risk of personal injury If you use this software or hardware in dangerous applications then you shall be responsible to take all appropriate fail safe backup redundancy and other measures to ensure its safe use Oracle Corporation and its affilia...

Page 4: ...n des informations Ce logiciel ou matériel n est pas conçu ni n est destiné à être utilisé dans des applications à risque notamment dans des applications pouvant causer un risque de dommages corporels Si vous utilisez ce logiciel ou ce matériel dans le cadre d applications dangereuses il est de votre responsabilité de prendre toutes les mesures de secours de sauvegarde de redondance et autres mesu...

Page 5: ...tice for Default Self Signed Certificate 17 Broken Links in Sun Server X4800 Documentation Library 17 Supported Software and Firmware 19 Supported Operating Systems 19 Supported Firmware 20 Additional Software 20 Sun Java Enterprise System 20 Oracle Enterprise Manager Ops Center 21 MegaRAID Storage Manager MSM 21 SunVTS Bootable Diagnostics CD ROM 21 Oracle Integrated Lights Out Manager Oracle ILO...

Page 6: ...Module 15465497 33 MegaCLI CfgEachDskRaid0 Command Does Not Work Correctly CR 7121867 33 DIMM Mismatch on Odd Numbered Pairs Results in DDR Training Failed on Entire Branch CR7101624 and CR7111545 34 If DIMM Failure Causes Branch Failure Oracle ILOM Might Not Identify Them as Faulty CR7099038 and CR7111543 35 When Inserting a CMOD Simultaneously Rotate Handles Until They Touch the Chassis 35 Video...

Page 7: ...tion Fails With AER Error on Systems With Four or More Sun InfiniBand Dual Port 4x QDR PCIe Low Profile Host Channel Adapter M2 Cards 22686146 52 False MCE Errors Appear in var log mcelog CR 7104293 52 I O Does Not Work On SLES11 SP1 with XEN CR 6965290 and CR7110443 53 RHEL6 Kdump Runs Out of Memory CR 7000993 and CR 7000942 53 SLES11 SP1 Does Not Produce A Kdump File After A Crash CR7001706 54 S...

Page 8: ...582 64 Messages Complain of Failure to Allocate I O Resources CR 6984329 65 Windows 2008 Cannot Be Installed to Combo GbE 8Gb FC ExpressModule HBA Disks CR 6984746 66 Hot Plugging of PCIe ExpressModules Is Not Supported by Windows 2008 66 ESX Issues 67 Messages Complain of Failure to Allocate I O Resources CR 6984329 67 Option Cards Can Fail to Load Device Drivers in Some Configurations CR 6933436...

Page 9: ...re Open CR 6917474 82 Allocated Power Figures Are Incomplete CR 6931837 82 Fixed in SW 1 1 LDAP Account Cannot Be Used to Start Console CR 6969473 83 Fixed in SW 1 1 Power Cycling the Host Using the Web Interface Generates an Error CR 6909374 83 Fixed in SW 1 1 DIMM Mismatch Fault Not Cleared After DIMM Replacement and Host Power Cycle CR 6972285 83 Fixed in SW 1 0 1 Console Redirect from CLI Fail...

Page 10: ...her Devices Firmware CR 6537282 89 Hotplugging PCIe EM Card Fails When Replacing the Original Card With One of a Different Type CR 7003634 90 Messages Complain of Failure to Allocate I O Resources CR 6984329 90 DIMM Failure Causes Other DIMMs To Be Disabled CR 6929978 91 10 Sun Fire X4800 Server Product Notes July 2017 ...

Page 11: ...ork administrators and service technicians Required knowledge Advanced understanding of server systems Product Documentation Library Documentation and resources for this product and related products are available at http docs oracle com cd E19140 01 index html Feedback Provide feedback about this documentation at http www oracle com goto docfeedback Using This Documentation 11 ...

Page 12: ...12 Sun Fire X4800 Server Product Notes July 2017 ...

Page 13: ...es on page 31 Oracle Solaris 10 Operating System Issues on page 43 Linux Issues on page 51 Oracle VM Issues on page 61 Windows Operating System Issues on page 63 ESX Issues on page 67 Oracle ILOM Enhancements and Issues on page 71 Oracle Hardware Installation Assistant Issues on page 87 BIOS Issues on page 89 Server Security Software Releases and Critical Patch Updates To ensure continued security...

Page 14: ...e ensures that you have the most up to date software and security patches To confirm that you have the latest OS release refer to the Oracle Hardware Compatibility Lists See Supported Operating Systems on page 19 For details about the current system software update see IMPORTANT Install Latest OS Updates Patches and Firmware on page 14 IMPORTANT Install Latest OS Updates Patches and Firmware Some ...

Page 15: ...document before updating the firmware Note Occasionally after installing the firmware the Oracle ILOM web interface cannot display the power state correctly on the power control page To correct this problem clear your browser cache before logging in to the Oracle ILOM web interface Diagnosing SAS Data Path Failures on Servers Using MegaRAID Disk Controllers On Oracle x86 servers using MegaRAID dis...

Page 16: ...formation about the diagnosis and triage of hard disk and SAS data path failures on x86 servers at the My Oracle Support web site https support oracle com Refer to the Knowledge Article Doc ID 2161195 1 If there are multiple simultaneous disk problems on an Exadata server Oracle Service personnel can refer to Knowledge Article Doc ID 1370640 1 Oracle ILOM Deprecation Notice for IPMI 2 0 Management...

Page 17: ... certificate that is provided by Oracle ILOM Customer provided SSL certificates will not be impacted by this change For future updates about the default SSL self signed certificate that is provided by Oracle ILOM refer to the latest firmware release information in the Oracle ILOM Feature Updates and Release Notes for Firmware 3 2 x Broken Links in Sun Server X4800 Documentation Library The followi...

Page 18: ...ndex html Sun Fire X4800 Server Rack Mounting and Shipping Bracket User s Guide Sun Fire X4800 Server product page http www oracle com goto x4800 http www oracle com us products servers storage servers x86 sun fire x4800 ds 079895 pdf Sun Fire X4800 Server Installation Guide for Linux Operating Systems Source for syslinux http www kernel org pub linux utils boot syslinux http www syslinux org wiki...

Page 19: ...perating Systems To find the latest operating system version supported for the Sun Fire X4800 go to the following sites and search using your server model number Oracle Solaris http www oracle com webfolder technetwork hcl index html Oracle Linux http linux oracle com pls apex f p 117 1 3991604960223967 Oracle VM http linux oracle com pls apex f p 117 1 3991604960223967 Windows http www windowsser...

Page 20: ...Oracle ILOM 3 2 9 25 r116305 BIOS 11080300 Additional Software Note To obtain optimal performance security and stability install system software release 1 11 0 or newer Oracle recommends that you always install the latest available firmware The following additional software is available for download Oracle Hardware Installation Assistant OHIA 2 5 SunVTS 7 0 Patch Set 12 or later Sun Java Enterpris...

Page 21: ...aCLI command line configuration utility to manage your RAID controllers These applications are available on the Tools and Drivers image on the product download site MSM enables you to easily configure the controllers disk drives and virtual disks on your system The Configuration wizard greatly simplifies the process of creating disk groups and virtual disks The Configuration wizard guides you thro...

Page 22: ...llows a remote user to perform most maintenance operations including installing an operating system For more information on Oracle ILOM refer to the Oracle ILOM 3 2 collection at http www oracle com goto ilom docs Oracle Hardware Management Pack Oracle Hardware Management Pack allows you to monitor hardware through the host operating system either remotely using SNMP or locally using command line ...

Page 23: ...Replaceable Units FRUs This is not always true Some CRUs and FRUs are shipped without antistatic wrist straps Previous Versions of the Service Manual Incorrectly State That a FEM Is Only Supported on CMOD 0 CR 7107085 In previous versions of the service manual in the section Removing and Installing a Fabric Expansion Module CRU the notes incorrectly state that the Fabric Expansion module FEM is su...

Page 24: ...nting Bracket Illustration from Installation Guide is Incorrect A figure in How to Install the Rack Mounting Hardware in a Round Hole Rack in Sun Fire X4800 Server Installation Guide is incorrect It shows the screws for installing the front mounting bracket in a threaded rack incorrectly being inserted from the front of the rack Instead the screws should be inserted from the back of the rack The f...

Page 25: ...umentation Errata This figure is incorrect in 821 0285 10 the printed version of this document They are shown correctly in 821 0285 11 the online version of this document Supported Software and Firmware 25 ...

Page 26: ...26 Sun Fire X4800 Server Product Notes July 2017 ...

Page 27: ...ntext and descriptive text available to assistive technologies to aid in interpreting status and understanding the system System level descriptions and status indicator interpretation can be found in the product Service Manual The documentation also provides diagrams and screenshots that do not rely on color Within the diagrams callouts indicate the referenced component information The callout des...

Page 28: ... read the content of the screen you can use the CLI as an equivalent means to access the color based mouse based and other visual based utilities that are part of the BUI For example you can use a keyboard to enter CLI commands to identify faulted hardware components check system status and monitor system health You can use the Oracle ILOM Remote Console Plus to access both a text based serial con...

Page 29: ...zers or magnifiers can be used to read the content of the screen Refer to the assistive technology product documentation for information about operating system and command line interface support The CLI tools for using the software are described in the accessible HTML documentation for Hardware Management Pack at http www oracle com goto ohmp docs BIOS Accessibility When viewing BIOS output from a...

Page 30: ...The callouts are mapped within a table to provide text descriptions of the referenced parts of the figures In addition alternative text is provided for all tables and images that provides the context of the information and images Note that screen readers might not always correctly read the code examples in the documentation The conventions for writing code require that closing braces should appear...

Page 31: ...ge 34 Yes If DIMM Failure Causes Branch Failure Oracle ILOM Might Not Identify Them as Faulty CR7099038 and CR7111543 on page 35 Yes When Inserting a CMOD Simultaneously Rotate Handles Until They Touch the Chassis on page 35 Yes Video Output of CMM Is Distorted or Missing on page 35 Yes SATA Drives Not Accessible Through NEM1 CR 7003993 on page 36 Yes Supported Rack Mounting Configuration on page ...

Page 32: ...Bits Support to Enabled the default is Disabled 4 Save your changes and exit the BIOS Setup Utility 5 Reboot the server Reset Takes a Long Time and Causes the Server to Power Cycle If you have a pending BIOS upgrade a routine reset might take longer than expected and might cause your server to power cycle and reboot several times This is expected behavior as it is necessary to power cycle the serv...

Page 33: ...r Replacing a CMOD or SP Module 15465497 When the replacement of either a CMOD or the service processor results in an incompatibilty between the hardware revision of the component and the firmware version of either the SP or the BIOS the recommendation is to maintain compatibility with the SP firmware Therefore update or downgrade the system firmware package to the version compatible with the SP M...

Page 34: ...mber is not 252 substitute your enclosure number in these commands DIMM Mismatch on Odd Numbered Pairs Results in DDR Training Failed on Entire Branch CR7101624 and CR7111545 If a DIMM mismatch error size rank or speed occurs on odd numbered DIMM pairs it might cause a ddr training failed error on the entire branch thus disabling all DIMMs on that branch Furthermore Oracle ILOM might not mark them...

Page 35: ... the Oracle ILOM Fault Manager to view information about active faults fmadm faulty to identify the mismatched DIMM pair Then replace the pair with a matched pair If the fault isn t cleared automatically you must manually clear it using the appropriate Oracle ILOM fmadm repair command as described in the Oracle ILOM documentation When Inserting a CMOD Simultaneously Rotate Handles Until They Touch...

Page 36: ... Fire X4800 servers in a Sun Rack II 1042 1214 as long as your datacenter meets the cooling requirements to support these systems DIMM Population Rules This section provides the rules for adding and replacing DIMMs on a CMOD It updates and enhances the information in DIMM Population Rules in Sun Fire X4800 Server Service Manual For additional information including how to prepare the system for ser...

Page 37: ... Match Within CMOD Match Within Pairs Size 2 GB 4 GB or 8 GB x x Speed 1066 MHz JEDEC or DDR3 ECC RDIMMs x x Density Single or dual rank x x Manufacturer s model number x DIMM Population Order You can add DIMMs according to the following rules You cannot add DIMMs to a 32 DIMM system because it is already fully populated All DIMMs must have their parameters matched as described above Only four or ...

Page 38: ...figuration 2 D8 D12 Blue 3 D2 D6 White 4 D10 D14 White 5 D1 D5 Black 6 D9 D13 Black Fill these first 7 D3 D7 Green 8 D11 D15 Green Fill these last The following figure shows the location of the DIMMs on the CMOD 38 Sun Fire X4800 Server Product Notes July 2017 ...

Page 39: ...king Screws Before You Begin Install the server in the rack as described in Installing the Server in a Rack Using the Universal Rack Mounting Kit in Sun Fire X4800 Server Installation Guide 1 From the rear of the server insert the screw 1 through the rack 2 and the rear mounting bracket 3 so that it protrudes from the mounting bracket just above the flange on the shelf rail 5 For a round hole tapp...

Page 40: ...et before performing this procedure They are shown separated in this picture to highlight the alignment Figure Legend 1 Screw 2 Cage nut used for square hole racks only 3 Rack 4 Rear mounting bracket 5 Shelf rail 2 Repeat Step 1 for the other side 40 Sun Fire X4800 Server Product Notes July 2017 ...

Page 41: ... still does not boot and the same message appears contact Oracle customer service System Does Not Power Up After Power Cycling CR 6950414 Under rare conditions when the system is power cycled it might not power up Workaround Remove and reapply AC power to and from the system Either Switch the AC power Off then switch it back On Remove all the AC power cords from the power supplies and then plug th...

Page 42: ...er show SP faultmgt Oracle ILOM displays the faulted DIMMs with the fault class It might show one of the following fault memory intel nex dimm_ce fault memory intel nex dimm_ue fault memory intel dimm mismatched fault memory intel dimm something else 3 If the fault class is anything else besides dimm_ce or dimm_ue contact your Oracle service representative 42 Sun Fire X4800 Server Product Notes Ju...

Page 43: ... Are Depleted CR 6669984 on page 48 Yes System Might Panic With unowned mutex Message CR 6893274 on page 49 No Hotplugging PCIe Express Modules in Slots 2 0 or 2 1 Might Not Work CR 6954869 on page 49 Yes A System With a Combo GbE 8Gb FC Express Module HBA Might Get a BAD TRAP Panic CR 6942158 on page 49 Yes Oracle Solaris Installation Takes a Long Time On Systems With Four or More Sun InfiniBand ...

Page 44: ...ady includes required patches If you use your own custom OS install environment you must update your install environment with the following patches which can be downloaded from https support oracle com Solaris 10 9 10 requires patch 146025 01 or later It is recommended that you apply the latest patch cluster Solaris 10 10 09 is supported for existing installations but deprecated for new installati...

Page 45: ...09 or higher Note If you are doing a PXE install you might choose to add patch 142091 09 or higher to your install image This allows you to complete the installation without reducing the memory size However you must still reduce the memory size step 2 and add the patch to the installed system step 4 before you can boot it 2 Reduce the system memory to 512 GB We suggest you remove CMODs 3 and 4 You...

Page 46: ...ghts reserved Workaround You can fix this by changing the PXE boot menu to include amd64 on the kernel and module lines The following displays show examples of the lines before and after this change The lines have been wrapped to make the display fit the page Before incorrect kernel I86PC Solaris_10 16 multiboot kernel unix install B install media 10 6 78 11 images s10u8_08a console ttya install_ ...

Page 47: ... because it uses the same method as BIOS However most drivers can manage this condition 1 Usually you can ignore these messages 2 If you continue to experience I O resource issues see I O and Interrupt Resource Allocation in the Sun Fire X4800 Server Installation Guide Patch Required for Solaris FMA For Solaris 10 10 09 in order to use Solaris Fault Management Architecture FMA with your server you...

Page 48: ...nstance 1 and SCI Feb 25 15 45 06 mpk12 3214 189 156 pcplusmp WARNING No interrupt vector pciex8086 10f7 instance 5 Feb 25 15 45 06 mpk12 3214 189 156 pcplusmp WARNING Sharing vectors pciex8086 10f7 instance 1 and pciex8086 10f7 instance 5 In var adm messages Feb 25 15 44 53 mpk12 3214 189 156 ixgbe ID 611667 kern info NOTICE ixgbe7 Insufficient interrupt handles available 1 Feb 25 15 44 53 mpk12 ...

Page 49: ... Message CR 6893274 Under rare conditions the system might panic and display the message turnstile_block unowned mutex This is a known Solaris OS issue Hotplugging PCIe Express Modules in Slots 2 0 or 2 1 Might Not Work CR 6954869 On an 8 socket system hotplugging PCIe express modules in slots 2 0 or 2 1 might not work This is due to a possible shortage of hotplug interrupts on the system Workarou...

Page 50: ...odule HBA Might Get a BAD TRAP Panic CR 6942158 This issue applies to Solaris 10 10 09 It is fixed in Solaris 10 9 10 Workaround Install patch number 143858 03 or newer or install Solaris 10 9 10 50 Sun Fire X4800 Server Product Notes July 2017 ...

Page 51: ...es Not Produce A Kdump File After A Crash CR7001706 on page 54 Yes SLES11 Fails to Install On Systems Equipped with 6 Core CMODs CR 7024769 on page 54 Yes Messages Complain of Failure to Allocate I O Resources CR 6984329 on page 55 Yes Xen Profiler Is Not Supported in Oracle Linux 5 5 CR 6839366 on page 55 No Oracle Linux 5 5 Does Not Support CPU Throttling CR 6847286 on page 55 No InfiniBand PCIe...

Page 52: ...vigate to RC Settings QPI and change MMIOH Size per IOH from 2Gb to 4Gb the default is 2Gb 3 Navigate to RC Settings Chipset NorthBridge Configuration and change PCIE MMIO 64 Bits Support to Enabled the default is Disabled 4 Save your changes and exit the BIOS Setup Utility 5 Finish the operating system installation False MCE Errors Appear in var log mcelog CR 7104293 Occasionally many errors appe...

Page 53: ...isplays a message Your running kernel is using more than 70 of the amount of space you reserved for kdump you should consider increasing your crashkernel reservation One of two conditions might be present It might have enough memory but post the messages anyway It might really not have enough kdump memory space In this case if the system crashes it might not produce a kdump file and it might post ...

Page 54: ...memory SLES11 SP1 Does Not Produce A Kdump File After A Crash CR7001706 After a crash SUSE Linux Enterprise Server 11 SLES11 SP1 might not produce a kdump file In this case it might also display a message similar to the following on the console INFO Cannot find debug information Unable to find debuginfo file 41 368017 Restarting system Workaround Perform a regular update from Novell after installa...

Page 55: ... unable to allocate I O resources successfully it displays error messages The OS might try as well However if it tries it fails and generates additional failure messages because it uses the same method as BIOS However most drivers can manage this condition 1 Usually you can ignore these messages 2 If you continue to experience I O resource issues see I O and Interrupt Resource Allocation in the Su...

Page 56: ...used Workaround Perform one or the other Update to the errata kernel from Novell Put nox2apic in the boot command line in boot grub menu 1st append line LEDs on PCIe ExpressModule Work Incorrectly with Oracle Linux 5 5 CR 6894954 The lights on the PCIe ExpressModule do not work normally on systems with Oracle Linux 5 5 When you insert the PCIe ExpressModule and press the attention button The LED s...

Page 57: ...xample grub conf generated by anaconda Note that you do not have to rerun grub after making changes to this file NOTICE You have a boot partition This means that all kernel and initrd paths are relative to boot eg root hd0 0 kernel vmlinuz version ro root dev sda3 initrd initrd version img boot dev sda default 1 timeout 5 serial unit 0 speed 115200 terminal timeout 5 serial console title Enterpris...

Page 58: ...SFP PCIe 2 0 PCIe ExpressModule Workaround 1 Add the following entry to the etc modprobes conf file options ixgbe InterruptType 1 1 1 1 1 1 1 1 It might be necessary to temporarily remove the Sun Dual 10GbE I2 SFP PCIe 2 0 PCIe ExpressModule to prevent the kernel from crashing long enough for you to modify this file Workaround 2 Install drivers from Novell 1 Navigate to the following page http dri...

Page 59: ...n systems where many PCIe ExpressModule cards are installed Workaround 1 In the GRUB configuration set pci nomsi 2 Boot a non Xen kernel 3 Edit boot grub menu lst 4 Add the following text to the Xen kernel init line stanza pci nomsi For example title Xen SUSE Linux Enterprise Server 11 2 6 27 19 5 root hd0 1 kernel boot xen gz module boot vmlinuz 2 6 27 19 5 xen nn nn n other text parameters etc p...

Page 60: ...60 Sun Fire X4800 Server Product Notes July 2017 ...

Page 61: ...ilures to allocate I O resources might appear in POST and in log files For example you might see Sep 8 15 50 49 nsg14 28 kernel PCI Failed to allocate I O resource 2 20 0 for 0000 8d 00 0 Sep 8 15 50 49 nsg14 28 kernel PCI Failed to allocate I O resource 2 20 0 for 0000 8d 00 1 Workaround BIOS tries to allocate I O resources If it is unable to allocate I O resources successfully it displays error ...

Page 62: ...ameters Required on Sun Fire X4800 CR 7094126 To allow the megaraid_sas driver to load correctly you must add parameters to Oracle VM When installing Oracle VM 2 x add the following kernel parameters mboot c32 xen gz extra_guest_irqs 64 2048 nr_irqs 2048 vmlinuz initrd img If booting from the installation media press F2 when the initial boot screen is displayed and add the above parameters to the ...

Page 63: ...Hot Plugging of PCIe ExpressModules Is Not Supported by Windows 2008 on page 66 No Unspecified CPU Fault After Warm Reset CR 7054657 When the server undergoes a reset because of a CPU accessing memory from an uncorrectable DIMM a reboot or warm reset from the server might result in an unspecified CPU fault This has been observed with Windows 2008 R2 SP1 and Windows 2008 SP2 The sequence of events ...

Page 64: ...erver 2008 R2 does not support hot plugging on certain PCIe ExpressModules These include SG XPCIE2FCGBE Q Z Sun StorageTek Dual FC Dual GbE HBA Express Module SG XPCIE2FCGBE E Z Sun StorageTek Dual FC Dual GbE HBA Express Module SG XPCIEFCGBE Q8 Z Sun StorageTek Dual 8GB FC Dual GbE HBA Express Module SG XPCIEFCGBE E8 Z Sun StorageTek Dual 8GB FC Dual GbE HBA Express Module X7284A Z Sun Quad GbE E...

Page 65: ... Find the system date and time and set it 4 Save your work and exit Messages Complain of Failure to Allocate I O Resources CR 6984329 Messages that complain about failures to allocate I O resources might appear in POST and in log files For example you might see Sep 8 15 50 49 nsg14 28 kernel PCI Failed to allocate I O resource 2 20 0 for 0000 8d 00 0 Sep 8 15 50 49 nsg14 28 kernel PCI Failed to al...

Page 66: ...e OS cannot see the disks Workaround During the installation load the drivers from the USB drive twice Loading the drivers twice is necessary because there are two devices on the Combo GbE 8Gb FC ExpressModule HBA Hot Plugging of PCIe ExpressModules Is Not Supported by Windows 2008 With Windows 2008 SP2 and 2008 R2 the following PCIe ExpressModules cannot be hot plugged Fibre Channel 4 Gigabit Sec...

Page 67: ... 17013064 on page 69 Yes VMware ESXi 5 5 Runs Out of Interrupts With PCIe Cards 16494653 on page 70 Yes Messages Complain of Failure to Allocate I O Resources CR 6984329 Messages that complain about failures to allocate I O resources might appear in POST and in log files For example you might see Sep 8 15 50 49 nsg14 28 kernel PCI Failed to allocate I O resource 2 20 0 for 0000 8d 00 0 Sep 8 15 50...

Page 68: ... the Service Console device drivers and other data structures The BIOS utilizes 2 GB of this memory area for its own uses With the large number of supported I O devices this memory can become exhausted in some situations VMware describes this problem in their Knowledge Base at https vmware com Workaround Use the following procedure to reduce the amount of memory reserved by the Service Console 1 R...

Page 69: ...klinux26 dma_alloc_coherent Out of memory Sep 24 04 13 21 mpk12 3214 189 114 vmkernel 0 00 00 38 761 cpu0 4096 WARNING PCI 1861 No such device Sep 24 04 13 21 mpk12 3214 189 114 last message repeated 9 times Sep 24 04 13 22 mpk12 3214 189 114 vmkernel 0 00 01 19 828 cpu16 4166 WARNING vmklinux26 dma_alloc_coherent Out of memory VMware ESXi 5 5 Does Not Support MMIO Regions Above 4GB 16480679 17013...

Page 70: ...nough left are unavailable for use VMware ESXi 5 5 Runs Out of Interrupts With PCIe Cards 16494653 In certain configurations VMware ESXi can run out of interrupts for devices this can include storage and networking For more information refer to VMware s Configuration Maximums document for ESXi 5 5 under host maximums http www vmware com pdf vsphere5 r55 vsphere 55 configuration maximums pdf 70 Sun...

Page 71: ...t Allows IPv4 Only IPv6 Only or Dual Stack on page 74 N A New Procedures for Updating CPLD CR 7043418 on page 75 N A Update Oracle ILOM and BIOS Firmware Before Updating Other Device Firmware CR 6537282 on page 75 N A Use the Locate Button to Prove Physical Presence CR 6881237 on page 76 N A NEM Expander Firmware Update Procedure CR 6979140 on page 76 Yes Network Management Port 1 Does Not Work CR...

Page 72: ... Is Not Available CR 6867060 and CR 6904922 on page 85 Yes File Transfer Using URI Fails if Target Password Contains Certain Special Characters 25917655 This problem is fixed in System Software Release 1 11 0 When using Oracle ILOM to transfer files using a Uniform Resource Identifier URI the transfer fails if the target host s password contains any of the following special characters Examples of ...

Page 73: ...rs create snmpv3 user create SP services snmp users newuser authenticationpassword sleep 10 seconds to give snmp enough time to make the change sleep 10 verify user show SP services snmp users newuser do a snmpget with that user to verify it configure alert set SP alertmgmt rules 1 type snmptrap sleep 10 seconds to give snmp enough time to make the change sleep 10 verify alert show SP alertmgmt ru...

Page 74: ...ck ILOM Administration Connectivity Network b Modify the settings on the Network Settings page as required For further details about how to configure the properties on the Network Setting page click the More Details link c Click Save to save your network property changes in Oracle ILOM Note When you save your network settings it might end your Oracle ILOM session If this happens use the new settin...

Page 75: ... 1 6 or newer use set SP network state ipv6 only To commit the IPv4 and IPv6 pending network changes type set SP network commitpending true Note If you are logged in to Oracle ILOM using an Ethernet connection your connection is terminated when you set commitpending to true When this happens log back in using the new settings New Procedures for Updating CPLD CR 7043418 The procedures for updating ...

Page 76: ...LOM Web GUI on page 76 Workaround 2 Using Oracle ILOM CLI on page 79 Workaround 1 Using Oracle ILOM Web GUI Before You Begin Perform the following actions before upgrading your NEM firmware Download the firmware image for your server or CMM from the platform s product web site For details see https support oracle com Copy the firmware image to the system on which the web browser is running using a...

Page 77: ... to Connect to the ILOM Web Interface in Sun Fire X4800 Server Installation Guide 3 Select System Information Components The Component Management page appears 4 Highlight NEM0 5 From the Actions drop down menu select Update Firmware A screen asks for download details Oracle ILOM Enhancements and Issues 77 ...

Page 78: ... URL Then type the URL for the firmware image into the text box b Select a transfer method from the drop down list c Click the Update button to upload the file and update the firmware The Update Status display appears providing details about the update progress When the update indicates 100 the firmware upload is complete When the update is finished it displays the message Firmware Update Successf...

Page 79: ...vileges to update the firmware on the system 1 Restart the server and enter the BIOS screen When POST messages appear press F2 to enter the BIOS the BIOS Setup Utility Note You do not need to configure anything in the BIOS the BIOS Setup Utility It is used in this procedure to ensure that the NEMs are powered on but the OS does not boot 2 Log in to the Oracle ILOM CMM CLI 3 Use the cd command to n...

Page 80: ...ip_address rom_nem pkg SCP load_uri scp username password ip_address rom_nem pkg HTTP load_uri http username password ip_address rom_nem pkg HTTPS load_uri https username password ip_address rom_nem pkg SFTP load_uri sftp username password ip_address rom_nem pkg Where ip_address is the IP address of the system where the file is stored username is the login user name to the system where the file is...

Page 81: ...etwork use an external Ethernet switch Note Oracle ILOM allows you to select which management port to use Even if you select port 1 it does not switch ports start SYS and stop SYS Commands Cause Power Button Pressed Event In Log CR 6906176 When you enter the start SYS and stop SYS commands it causes a Power Button Pressed event to be logged This log entry is incorrect You can ignore these log entr...

Page 82: ... its ID should be 8 and the IDs of subsequent messages should be decremented accordingly Incorrect Error Message in Event Log After Restore When Serial Console or JavaRconsole Session Are Open CR 6917474 If you do a restore while a serial console or a JavaRconsole session are open you might see an error message in the event log For example 409 Restore Log major Fri Feb 26 19 42 40 2010 Config rest...

Page 83: ...ccount to log into Oracle ILOM the start SP console command does not work Workaround Log into Oracle ILOM using a non LDAP account if you want to use the console Fixed in SW 1 1 Power Cycling the Host Using the Web Interface Generates an Error CR 6909374 If you use the Oracle ILOM web interface to power cycle the server it might display an error message stating that the operation failed even thoug...

Page 84: ...cessful it looks like this start SP console Are you sure you want to start SP console y n y Disabling external host serial connection Serial console started To stop type ESC When the command fails it looks like this start SP console Are you sure you want to start SP console y n y Workaround 1 Reboot the SP Use the command reset SP 2 Wait for the SP to fully reboot 3 Try the start SP console comman...

Page 85: ...is might happen When you hotswap a power supply However you can preserve the information by creating copies of the other faults Occasionally a power supply fault clears itself When this happens the information about other faults might be lost Workaround Perform this procedure before hotswapping a power supply 1 Save your fault information immediately when you see a power fault and before hot swapp...

Page 86: ...ilable CR 6867060 and CR 6904922 Workaround Use other methods to update Oracle ILOM as described in the Oracle Integrated Lights Out Manager ILOM 3 0 Supplement for the Sun Fire X4800 Server 86 Sun Fire X4800 Server Product Notes July 2017 ...

Page 87: ...277 The following ipmitool functions are not supported Set BIOS boot options to persistent For example the following command does not work ipmitool H sp_ip_address U username P password I lanplus chassis bootdev bios options persistent Set chassis policy For example the following command does not work ipmitool H sp_ip_address U username P password I lanplus chassis policy There is no workaround Or...

Page 88: ...88 Sun Fire X4800 Server Product Notes July 2017 ...

Page 89: ... 90 Yes Messages Complain of Failure to Allocate I O Resources CR 6984329 on page 90 Yes DIMM Failure Causes Other DIMMs To Be Disabled CR 6929978 on page 91 N A Auto Boot Host On Power Loss Control Is Deactivated The control to Auto Boot Host On Power Loss in the BIOS Setup Utility has been deactivated To set Auto Boot Host On Power Loss use Oracle ILOM instead Update Oracle ILOM and BIOS Firmwar...

Page 90: ... card fails to operate correctly reboot the server Messages Complain of Failure to Allocate I O Resources CR 6984329 Messages that complain about failures to allocate I O resources might appear in POST and in log files For example you might see Sep 8 15 50 49 nsg14 28 kernel PCI Failed to allocate I O resource 2 20 0 for 0000 8d 00 0 Sep 8 15 50 49 nsg14 28 kernel PCI Failed to allocate I O resour...

Page 91: ...4 D15 D3 or D7 D3 D7 D11 D15 D8 or D12 D0 D4 D1 D5 D8 D9 D12 D13 D9 or D13 D1 D5 D9 D13 D10 or D14 D2 D6 D3 D7 D10 D11 D14 D15 D11 or D15 D3 D7 D11 D15 In a 4 socket system a faulted DIMM causes the BIOS to disable either two or four DIMMs depending on the socket number Faulted DIMM Disabled DIMMs D0 or D4 D0 D4 D1 D5 D1 or D5 D1 D5 D2 or D6 D2 D6 D3 D7 D3 or D7 D3 D7 D8 or D12 D8 D9 D12 D13 D9 or...

Page 92: ...sts the faulted DIMMs In 8 socket systems Oracle ILOM also lists disabled DIMMs These are listed as fault memory intel dimm population invalid If all the DIMM are listed as fault memory intel dimm population invalid then the configuration is invalid 2 Replace the faulted DIMMs Refer to Removing and Installing DIMMs CRU in Sun Fire X4800 Server Service Manual 3 Clear the DIMM faults in Oracle ILOM ...

Reviews: