background image

Use the CLI

As an alternative to using the PowerVault Manager, you can run the show system CLI command to view the health of
the system and its components. If any component has a problem, the system health is in a Degraded, Fault, or Unknown
state, and those components are listed as Unhealthy Components. Follow the recommended actions in the component Health
Recommendation field to resolve the problem.

Monitor event notification

With event notification configured and enabled, you can view event logs to monitor the health of the system and its
components. If a message tells you to check whether an event has been logged, or to view information about an event, use the
PowerVault Manager or the CLI. Using the PowerVault Manager, view the event log and then click the event message to see
detail about that event. Using the CLI, run the 

show events detail 

command to see the detail for an event.

View the enclosure LEDs

You can view the LEDs on the hardware to identify component status. If a problem prevents access to the PowerVault Manager
or the CLI, viewing the enclosure LEDs is the only option available. However, monitoring/management is often done at a
management console using storage management interfaces, rather than relying on line-of-sight to LEDs of racked hardware
components.

Performing basic steps

You can use any of the available options that are described in the previous sections to perform the basic steps comprising the
fault isolation methodology.

Gather fault information

When a fault occurs, gather as much information as possible. Doing so helps determine the correct action that is needed to
remedy the fault.

Begin by reviewing the reported fault:

Is the fault related to an internal data path or an external data path?

Is the fault related to a hardware component such as a disk drive module, controller module, or power supply unit?

By isolating the fault to one of the components within the storage system, you are able determine the necessary corrective
action more quickly.

Determine where the fault is occurring

When a fault occurs, the Module Fault LED illuminates. Check the LEDs on the back of the enclosure to narrow the fault to a
CRU, connection, or both. The LEDs also help you identify the location of a CRU reporting a fault.

Use the PowerVault Manager to verify any faults found while viewing the LEDs. If the LEDs cannot be viewed due to the
location of the system, use the PowerVault Manager to determine where the fault is occurring . This web-application provides
you with a visual representation of the system and where the fault is occurring. The PowerVault Manager also provides more
detailed information about CRUs, data, and faults.

Review the event logs

The event logs record all system events. Each event has a numeric code that identifies the type of event that occurred, and has
one of the following severities:

Critical – A failure occurred that may cause a controller to shut down. Correct the problem immediately.

Error – A failure occurred that may affect data integrity or system stability. Correct the problem as soon as possible.

Warning – A problem occurred that may affect system stability, but not data integrity. Evaluate the problem and correct it if
necessary.

Troubleshooting and problem solving

27

Summary of Contents for PowerVault ME5 Series

Page 1: ...Dell PowerVault ME5 Series Storage System Owner s Manual April 2022 Rev A01 ...

Page 2: ...damage to hardware or loss of data and tells you how to avoid the problem WARNING A WARNING indicates a potential for property damage personal injury or death 2022 Dell Inc or its subsidiaries All rights reserved Dell Technologies Dell and other trademarks are trademarks of Dell Inc or its subsidiaries Other trademarks may be trademarks of their respective owners ...

Page 3: ...e Ops panel 20 5U enclosure Ops panel 21 Controller modules 21 12 Gb s controller module LEDs 22 Cache status LED details 24 Controller failure when a single controller is operational 25 Chapter 2 Troubleshooting and problem solving 26 Fault isolation methodology 26 Options available for performing basic steps 26 Performing basic steps 27 Host I O 28 2U enclosure LEDs 28 2U enclosure Ops panel 28 ...

Page 4: ...s 46 Attach or remove the front bezel of a 2U enclosure 47 Replacing a drive carrier module in a 2U enclosure 48 Replacing a DDIC in a 5U enclosure 52 Replacing a controller module or IOM 60 Replacing a power cooling module PCM in a 2U enclosure 65 Replacing a power supply unit PSU in a 5U enclosure 66 Replacing a fan cooling module FCM in a 5U enclosure 68 Completing the component installation pr...

Page 5: ...ators and deployment personnel Related publications The following documentation provides additional information about ME5 Series storage systems Dell PowerVault ME5 Series Administrator s Guide Dell PowerVault ME5 Series ME5012 and ME5024 Getting Started Guide Dell PowerVault ME5 Series ME5084 Getting Started Guide Dell PowerVault ME5 Series Deployment Guide Preface About this guide 5 ...

Page 6: ...a sticker on the back of the storage system chassis This information is used to route support calls to appropriate personnel NOTE Quick Resource Locator QRL The QRL code contains information unique to your system It can be found on the information tag and the Setting Up Your Dell PowerVault ME5 Series Storage System document provided with your ME5 Series enclosure Scan the QRL to get immediate acc...

Page 7: ...oller module A does not report controller module B as missing 2 Remove the controller blank from slot B 3 Grasp the controller module with both hands and with the latch in the open position orient the module and align it for insertion into slot B 4 Ensuring that the controller module is level slide it into the enclosure as far as it will go A controller module that is only partially seated will pr...

Page 8: ... correct type ready for insertion Disconnect power from a PCM or power supply unit PSU to be replaced before removing the PCM or PSU Read the hazardous voltage warning label that is affixed to PCMs CAUTION 5U84 enclosures only To prevent overturning drawer interlocks stop users from opening both drawers simultaneously Do not attempt to force open a drawer when the other drawer in the enclosure is ...

Page 9: ...closure system rear orientation Figure 3 2U24 enclosure system front orientation The 2U24 controller enclosure is equipped with dual controllers Figure 4 2U24 enclosure system rear orientation Storage system hardware 9 ...

Page 10: ...e key components of 2U and 5U enclosures are described in the following sections Although many CRUs differ between the form factors the controller modules and IOMs are common to 2U12 2U24 and 5U84 chassis The controller modules and IOMs are introduced in 2U enclosure core product and cross referenced from 5U84 enclosure core product 2U12 2U12 enclosures consist of 12 LFF Large Form Factor disk dri...

Page 11: ...28 4 port 10Gbase T iSCSI 4 port mini SAS HD The supported IOMs are used in expansion enclosures for adding additional storage 3 In single controller module configurations the controller module is installed in slot A and a controller module blank is installed in slot B 5U84 5U84 enclosures consist of 84 LFF or 84 SFF disk drives held in two 42 slot drawers Table 3 5U84 enclosure variants Product C...

Page 12: ...out the optional 2U enclosure front bezel see Attach or remove the front bezel of a 2U enclosure 2U enclosure rear panel The controller modules and IOMs use alphabetic designators and the Power Cooling Modules PCMs use numeric designators to identify the slots in a 2U enclosure There are two redundant controller modules that use a series of LEDs to reflect host connectivity status You can monitor ...

Page 13: ... the bottom of the module and it is in a closed locked position See 12 Gb s controller module LEDs Figure 11 Controller module details 1 Host ports 2 USB serial port service only 3 USB serial port CLI 4 Ethernet port for management network 5 SAS expansion port Expansion enclosure IOM The following figure shows the IOM used in supported expansion enclosures for adding storage Ports A B C ship confi...

Page 14: ...as occurred For a detailed description of PCM LED behavior see 2U enclosure PCM LEDs 5U84 enclosure core product The following figures show component locations and CRU slot indexing on the 5U84 enclosure front panel with drawers and on the rear panel The 5U84 supports up to 84 DDIC modules populated within two drawers 42 DDICs per drawer 14 DDICs per row NOTE The 5U84 does not ship with DDICs inst...

Page 15: ...Direction into the enclosure drawer slot slot 0 or 1 5U84 enclosure rear panel The controller modules and IOMs use alphabetic designators and the Power Supply Units PSUs and Fan Control Modules FCMs use numeric designators to identify the slots in a 5U84 enclosure NOTE Controller modules IOMs PSUs and FCMs are available as CRUs 5U84 controller enclosures support dual controller module configuratio...

Page 16: ...ng module Controller modules The 5U84 controller enclosure uses the same controller modules that are used by 2U12 and 2U24 enclosures The top slot for holding controller modules is designated slot A and the bottom slot is designated slot B The face plate details of the controller modules show a module aligned for use in slot A In this orientation the controller module latch is shown at the bottom ...

Page 17: ...enclosure Figure 19 Expansion enclosure IOM details 1 3 5 mm serial port service only 2 SAS expansion port A 3 SAS expansion port B disabled 4 SAS expansion port C 5 Ethernet port disabled Power supply module This figure shows the power supply unit that is used in 5U controller enclosures and optional 5U84 expansion enclosures Figure 20 Power supply unit PSU 1 Module release latch 2 Handle 3 PSU F...

Page 18: ...rrier adapter 5U84 empty chassis with midplane module runner system and drawers The chassis has a 19 inch rack mounting that enables it to be installed onto standard 19 inch racks and uses five EIA units of rack space 8 75 At the front of the enclosure two drawers can be opened and closed Each drawer provides access to 42 slots for Disk Drive in Carrier DDIC modules DDICs are top mounted into the ...

Page 19: ...osure Each drawer can be locked shut by turning both anti tamper locks clockwise using a screwdriver with a Torx T20 bit included in your shipment The anti tamper locks are symmetrically placed on the left and right sides of the drawer bezel Drawer status and activity LEDs can be monitored by two drawer LEDs panels located next to the two drawer pull pockets located on the left and right side of e...

Page 20: ...ne PCM is supplying power Off system not operating regardless of AC present Status Health Blue On steady system is powered on and controller is ready Blinking 2 Hz Enclosure management is busy for example when booting or performing a firmware update Amber On steady module fault present may be associated with a Fault LED on a controller module IOM or PCM Blinking logical fault 2 s on 1 s off Unit i...

Page 21: ...dule IOM PSU FCM DDIC or drawer Logical status LED Amber Constant or blinking fault present from something other than the enclosure management system The logical status LED may be initiated from the controller module or an external HBA The indication is typically associated with a DDIC and LEDs at each disk position within the drawer which help to identify the DDIC affected Top Drawer Fault Amber ...

Page 22: ...sures the controller module labels appear upside down In each diagram the controller module is oriented for insertion into either slot of 5U84 enclosures Alternatively you can configure the 2U controller enclosure with a single controller module Install the controller module in slot A and install a blank plate in slot B Figure 26 ME5 Series controller module Table 6 Common controller module LEDs L...

Page 23: ...s Green On Connected link is up Green or amber Flashing Link activity Amber On Connected partial link is up None Off Not connected or link is down The following figure shows the host port LEDs on a 32Gb s Fibre Channel controller module Figure 27 32Gb s Fibre Channel ports LED Description Color Status Fibre Channel link activity Green On Connected link is up Flashing Link activity Off Not connecte...

Page 24: ...k is down Cache status LED details This section describes the behavior of the LEDs during powering on and off and cache status behavior Power on off behavior During power on discrete sequencing for power on display states of internal components is reflected by blinking patterns displayed by the Cache Status LED Table 7 Cache Status LED power on behavior Item Display states reported by Cache Status...

Page 25: ...can be flushed to eMMC Controller failure when a single controller is operational The following information applies to 2U and 5U dual controller enclosures when one of the controllers is down and the other controller fails Cache memory is flushed to eMMC in the case of a controller failure or power loss During the process of writing to eMMC only the components needed to write the contents of the c...

Page 26: ...ing basic steps Performing basic steps Host I O Cabling systems to replicate volumes is another important fault isolation consideration related to initial system installation See the ME5 Series Storage Systems Deployment Guide for more information about troubleshooting during initial setup Options available for performing basic steps When performing fault isolation and troubleshooting steps select...

Page 27: ... as much information as possible Doing so helps determine the correct action that is needed to remedy the fault Begin by reviewing the reported fault Is the fault related to an internal data path or an external data path Is the fault related to a hardware component such as a disk drive module controller module or power supply unit By isolating the fault to one of the components within the storage ...

Page 28: ...he affected disk groups from all hosts as a data protection precaution As an extra data protection precaution it is helpful to conduct regularly scheduled backups of your data 2U enclosure LEDs Use the LEDs on the 2U enclosure to help troubleshoot initial start up problems 2U enclosure Ops panel The front of the enclosure has an Ops panel that is located on the left ear flange of the 2U chassis Th...

Page 29: ...f On On On PCM fault above temperature above voltage above current Off Blinking Blinking Blinking PCM firmware download is in progress 2U enclosure Ops panel LEDs The Ops panel displays the aggregated status of all the modules The following table describes the Ops panel LED states Table 11 Ops panel LED states System Power Green Amber Module Fault Amber Identity Blue LED display Associated LEDs Al...

Page 30: ...ier module LEDs Disk drive status is monitored by a green LED and an amber LED mounted on the front of each drive carrier module as shown in the following figure The drive module LEDs are identified in the figure and the LED behavior is described in the table following the figure In normal operation the green LED are on and flicker as the drive operates In normal operation the amber LED will be Of...

Page 31: ...s located on the face plate The following table describes LED behaviors for expansion enclosure IOMs Table 13 Expansion enclosure IOM LEDs LED Description Color Status Module fault Amber On Ops panel undergoing 5s test Rear panel area module fault IOM fan PSU when paired with module fault LED Drive module hardware fault paired with drive fault LED Flashing Unknown invalid or mixed module type such...

Page 32: ...properly Flashing Part of standby sequence as the controller module comes online Off Controller module power is off controller module is offline or controller module has a fault condition Hardware fault Amber On Controller module hardware fault Off Controller module functioning properly OK to remove White On Ready for removal the cache is clear Off Do not remove the controller module cache still c...

Page 33: ...ule Figure 35 25 GbE iSCSI ports LED Description Color Status iSCSI link activity Green On Connected link is up Flashing Link activity Off Not connected or link is down The following figure shows host port LEDs on a 10Gbase T iSCSI controller module Figure 36 10Gbase T iSCSI ports LED Description Color Status iSCSI 10Gbase T link speed Green On 10GbE link speed Amber On 1GbE link speed None Off No...

Page 34: ...anel Indicator Description Color Status Unit Identification Display UID Green Dual seven segment display that shows the numerical position of the enclosure in the cabling sequence The UID is also called the enclosure ID The controller enclosure ID is 0 System Power On Standby Green On steady system power is available operational Amber Constant amber system in standby not operational Module Fault A...

Page 35: ...power missing PSU in standby other PSU is providing power On On On Firmware has lost communication with the PSU module On Off PSU has failed Follow the procedure in Replacing a PSU ME5084 FCM LEDs The following table describes the LEDs on the Fan Cooling Module FCM faceplate Table 17 FCM LED states LED Status description Module OK Constant green indicates that the FCM is working correctly Off indi...

Page 36: ...wer Fault Amber if a drawer component has failed If the failed component is a disk the LED on the failed DDIC will light amber Follow the procedure in Replacing a DDIC in a 5U enclosure on page 52 If the disks are OK contact your service provider to identify the cause of the failure and resolve the problem CAUTION The sideplanes on the enclosure drawers are not hot swappable or customer serviceabl...

Page 37: ...e fault is indicated if the Drive Fault LED is lit amber In the event of a disk failure replace the DDIC 5U84 controller module and IOM LEDs Controller module and IOM CRUs are common to the 2U and 5U84 enclosures For information about controller module LEDs see 12 Gb s controller module LEDs For information about IOM LEDs see IO Module LEDs Initial start up problems The following sections describe...

Page 38: ...ent temperatures are kept low and to also minimize acoustic noise Air flow is from the front to the back of the enclosure Table 23 Thermal monitoring recommended actions Symptom Cause Recommended action If the ambient air is below 25ºC 77ºF and the fans are observed to increase in speed then some restriction on airflow may be causing additional internal temperature rise NOTE This is not a fault co...

Page 39: ...he Ops panel lights amber to indicate a fault for the problems listed in the following table NOTE All alarms also report through SES Table 25 5U alarm conditions Status Severity PSU alert loss of DC power from a single PSU Fault loss of redundancy Cooling module fan failure Fault loss of redundancy SBB I O module detected PSU fault Fault PSU removed Configuration error Enclosure configuration erro...

Page 40: ...does not reorder the expansion enclosure IDs 1 To perform a rescan using the PowerVault Manager a Verify that both controllers are operating normally b Select Maintenance Hardware c Select Actions Rescan All Disks d Click Rescan 2 To perform a rescan using the CLI type the following command rescan Troubleshooting hardware faults Make sure that you have a replacement module of the same type before ...

Page 41: ...riginal port If the link status LED remains off you have isolated the fault to the controller module port Replace the controller module No Proceed to the next step 7 Verify that the switch if any is operating properly If possible test with another port 8 Verify that the HBA is fully seated and that the PCI slot is powered on and operational 9 Replace the HBA with a known good HBA or move the host ...

Page 42: ...port status LED on Yes You now know that the expansion cable is good Return the cable to the original port If the expansion port status LED remains off you have isolated the fault to the controller module expansion port Replace the controller module No Proceed to the next step 7 Move the expansion cable back to the original port on the controller enclosure 8 Move the expansion cable on the expansi...

Page 43: ... Avoid hand contact by transporting and storing products in static safe containers Keep electrostatic sensitive parts in their containers until they arrive at static protected workstations Place parts in a static protected area before removing them from their containers Avoid touching pins leads or circuitry Always be properly grounded when touching a static sensitive component or assembly Remove ...

Page 44: ...e PFU setting For a dual controller system the partner firmware update PFU setting Settings System Properties Firmware Properties controls how updates impact the partner controller Automatic PFU is enabled the default When you activate controller module firmware on one controller the firmware is automatically copied over and activated on the partner controller first and then activated on the curre...

Page 45: ...ick Browse For File and navigate to the downloaded firmware bundle 4 Follow the on screen directions to install the firmware Activating a firmware bundle After a firmware bundle is available to the system activate the firmware to complete the firmware update 1 Go to Maintenance Firmware System and click its Activate this Version link to display the Activate Firmware dialog 2 Follow the on screen d...

Page 46: ...ther ensure that everything is okay or to drill down to a problem component The PowerVault Manager uses health icons to show OK Degraded Fault or Unknown status for the system and its components If you discover a problem component follow the actions in its Recommendation field to resolve the problem As an alternative to using the PowerVault Manager you can run the CLI show system command to view t...

Page 47: ...Up to 84 2 5 SFF or 3 5 LFF drives ME5084 Mini SAS HD 12 Gb s 2 5U84 Up to 84 2 5 SFF or 3 5 LFF drives 1 This model supports 10 Gb s or 1 Gb s speeds used for iSCSI host connection 2 This model uses SFF 8644 connectors and qualified cable options for host connection Attach or remove the front bezel of a 2U enclosure The following figure shows a partial view of a 2U12 enclosure Figure 40 Attaching...

Page 48: ...TE Familiarize yourself with full disk encryption FDE considerations relative to disk drive installation and replacement When moving FDE capable disk drives for a disk group stop I O to the disk group before removing the drive carrier modules Import the keys for the disk drives so that the drive content becomes available See the Dell PowerVault ME5 Series Storage System Administrator s Guide or De...

Page 49: ...steps to install an LFF drive carrier module in a 2U enclosure 1 Press the latch on the drive module carrier to open the handle Figure 43 LFF drive carrier module in open position 2 Insert the drive carrier module into the enclosure 3 Gently slide the drive carrier module into the enclosure until it stops moving Figure 44 Installing an LFF drive carrier module 1 of 2 4 Push the drive carrier modul...

Page 50: ...Replacing an SFF drive carrier module The replacement procedures for SFF drive carrier modules are the same for LFF modules except that the SFF drive carrier modules are mounted vertically Removing an SFF drive carrier module Perform the following steps to remove an SFF drive carrier module from a 2U enclosure 1 Press the latch on the drive module carrier to open the handle Figure 46 Removing an S...

Page 51: ...sed drive slots Installing an SFF drive carrier module Perform the following steps to install an SFF drive carrier module in a 2U enclosure 1 Press the latch on the drive module carrier to open the handle Figure 48 SFF drive carrier module in open position 2 Insert the drive carrier module into the enclosure 3 Gently slide the drive carrier module into the enclosure until it stops moving Module re...

Page 52: ...Ensure optimal cooling throughout the enclosure by installing blank drive carrier modules into all unused drive slots To remove a blank drive carrier module press the latch on the module and pull the module out of the drive slot To install a blank drive carrier module insert the module into the drive slot and push the module into the drive slot to secure it in place Replacing a DDIC in a 5U enclos...

Page 53: ...roup stop I O to the disk group before removing the DDICs Import the keys for the disk drives so that the drive content becomes available See the Dell PowerVault ME5 Series Storage System Administrator s Guide or Dell PowerVault ME5 Series Storage System CLI Guide for more information Before you begin any of the procedures see the ESD precautions on page 43 Installing a replacement 2 5 disk drive ...

Page 54: ...m the upper assembly of the DDIC 5 Slide the upper assembly of the DDIC onto the mounting bracket with the 2 5 disk drive 6 Secure the upper assembly to the mounting bracket using the supplied screws 54 Module removal and replacement ...

Page 55: ...ed with a new disk drive in carrier DDIC Install the replacement disk drive in the DDIC before opening the drawer of the enclosure to remove the failed drive 1 Insert the SAS connector into the SAS interface on the disk drive 2 Slide the disk drive into the lower assembly of the DDIC 3 Remove the protective film from the upper assembly of the DDIC Module removal and replacement 55 ...

Page 56: ...f the DDIC onto the disk drive 5 Secure the upper assembly to the disk drive using the supplied screws 6 Attach the appropriate drive size label to the label location on top of the upper assembly 56 Module removal and replacement ...

Page 57: ...using a Torx T20 bit Figure 51 Drawer front panel details 1 Left side 2 Right side 3 Anti tamper lock 4 Sideplane OK Power Good 5 Drawer Fault 6 Logical Fault 7 Cable Fault 8 Drawer Activity 9 Drawer pull handle 2 Push the drawer latches inward and hold them as shown in the following figure Figure 52 Opening a drawer 1 of 2 3 Pull the drawer outward until it locks at the drawer stops as shown in t...

Page 58: ...from a 5U enclosure Remove a DDIC only if a replacement DDIC is available NOTE Closing a drawer with one or more DDICs missing can potentially cause cooling problems See Populating drawers 1 Determine which drawer contains the disk drive to remove If the disk drive has failed a fault LED is lit on the front panel of the affected drawer If the disk drive has failed the Drive Fault LED on the DDIC i...

Page 59: ...igure 56 Installing a DDIC 1 Slide latch slides left 2 Latch button shown in locked position 3 Drive Fault LED 3 Verify the following a The latch button is in the locked position b The Drive Fault LED is not lit 4 Close the drawer Populating drawers The general guidelines for populating a drawer with DDICs are provided in the Dell PowerVault ME5 Series Storage System Deployment Guide Additional gu...

Page 60: ...h DDICs installed in the drawers An enclosure is configured with either 42 disk drives half populated or 84 disk drives fully populated for customer delivery If half populated the rows containing disk drives should be populated with a full complement of DDICs no blank slots in the row The following list identifies rows in drawers that should contain DDICs when the enclosure is configured as half p...

Page 61: ... enclosure 3 Install the replacement controller module in the enclosure 4 Wait 30 minutes and then use the PowerVault Manager or CLI to check the system status and event logs to verify that the system is stable NOTE If the Partner Firmware Update PFU feature is not enabled update the firmware on the replacement controller module For more information updating the firmware see the Dell PowerVault ME...

Page 62: ...ement controller module in a dual controller module enclosure Before you begin any procedure see ESD precautions 1 Examine the replacement controller module for damage and closely inspect the interface connector Do not install the replacement controller module if the pins are bent 2 Grasp the controller module using both hands and with the latch in the open position orient the controller module an...

Page 63: ... module from the storage system enclosure 4 Install the replacement controller module in the storage system enclosure and configure the replacement controller module Removing a controller module from a single controller module enclosure Perform the following steps to remove a controller module from a single controller module enclosure Before you begin any procedure see ESD precautions 1 Shut down ...

Page 64: ...the firmware that was on the failed controller module 7 Configure the system settings and perform storage setup CAUTION If the disk groups go into quarantine mode during the storage setup contact technical support before proceeding to the next step 8 Reconfigure the connections to the host systems and remap the volumes 9 Set up replications between storage systems Removing an IOM Before you begin ...

Page 65: ...le replacement within the right slot as you view the enclosure rear panel To replace a PCM in the left slot rotate the module 180º so that it properly aligns with its connectors on the back of the midplane Removing a power cooling module CAUTION Removing a power supply unit significantly disrupts the enclosure s airflow Do not remove a power cooling module until you have received the replacement m...

Page 66: ...enclosure taking care to support the base and weight of the module with both hands 4 Close the power cooling module handle to secure the PCM You should hear a click as the latch handle engages and secures the power cooling module to its connector on the back of the midplane 5 Connect the power cable to the power source and the power cooling module 6 Using the management interfaces the PowerVault M...

Page 67: ...red when replacing both PSUs at once 3 Verify the Power OK LED is lit then switch off the faulty PSU and disconnect the power supply cable 4 If replacing a single PSU via hot swap proceed to step 6 5 If replacing both PSUs verify that the enclosure was shut down using management interfaces and that the enclosure is powered off 6 Verify that the power cable is disconnected 7 Push the release latch ...

Page 68: ...a 5U enclosure This section provides procedures for removing and installing an FCM in a 5U enclosure The images in the FCM removal and installation procedures show rear panel views of the 5U enclosure Before you begin any procedure see ESD precautions Removing a fan cooling module You can change all fan cooling modules as long as they are removed and inserted one at a time We recommend that you sh...

Page 69: ... Module OK LED does not illuminate verify that the FCM is properly inserted and seated in the slot If properly seated the module may be defective Check the PowerVault Manager and the event logs for more information Using the management interfaces the PowerVault Manager or CLI determine if the health of the new FCM is OK Verify that the Module OK LED is green and that the Ops panel states show no a...

Page 70: ...e LEDs are located on the enclosure front and rear panels 1 Verify front panel LEDs Front panel LEDs reside on the Ops panel located on the left ear flange Disk LEDs are located on the carrier modules Verify that the System Power On Standby LED is illuminated green and that the Module Fault LED is not illuminated Verify that the enclosure ID LED located on the left ear is illuminated green Verify ...

Page 71: ...mmand with additional parameters to filter the output to see the detail for an event See the CLI Reference Guide for more information about command parameters and syntax Performing updates in PowerVault Manager after replacing an FC or SAS HBA After replacing an FC or SAS HBA in an attached host perform the following tasks 1 For an FC HBA update the zoning if a switch is used then update the host ...

Page 72: ...ediate action is required In this document this severity is abbreviated as Info Resolved The condition that caused an event to be logged has been resolved An event message might specify an associated error code or reason code which provides additional detail for technical support Error codes and reason codes are outside the scope of this guide Topics Event descriptions Events Event descriptions Th...

Page 73: ... Warning The disk group is online but cannot tolerate another disk failure If the indicated disk group is RAID 6 it is operating with degraded health due to the failure of two disks If the indicated disk group is not RAID 6 it is operating with degraded health due to the failure of one disk A dedicated spare or global spare of the proper size and type is being used to automatically reconstruct the...

Page 74: ...ely The user was given immediate feedback that it failed at the time they attempted to add the disk group Recommended actions No action is required 7 Error In a testing environment a controller diagnostic failed and reports a product specific diagnostic code Recommended actions Perform failure analysis 8 Warning One of the following conditions has occurred A disk that was part of a disk group is d...

Page 75: ... the CLI trust command may be able to recover some or all of the data in the disk group However trusting a partially reconstructed disk may lead to data corruption See the CLI help for the trust command Contact technical support for help to determine if the trust operation applies to your situation and for help to perform it If the associated disk group is offline and you do not want to use the tr...

Page 76: ...in the disk group have logged SMART events or unrecoverable read errors If so and the disk group is a non fault tolerant RAID level RAID 0 or non RAID copy the data to a different disk group and replace the faulty disks If so and the disk group is a fault tolerant RAID level check the current state of the disk group If it is not FTOL then back up the data as data may be at risk If it is FTOL then ...

Page 77: ...r may be unpredictable in this temperature range Check the event log to determine if more than one disk has reported this event If multiple disks report this condition there could be a problem in the environment If one disk reports this condition there could be a problem in the environment or the disk has failed Recommended actions Check that the storage system s fans are running Check that the am...

Page 78: ...s has been cleared This event indicates that a problem reported by event 39 40 or 524 is resolved Recommended actions No action is required 48 Info The indicated disk group has been renamed Recommended actions No action is required 49 Info A lengthy SCSI maintenance command has completed This typically occurs during disk firmware update Recommended actions No action is required 50 Error A correcta...

Page 79: ...lacement disk should have performance that is the same as or better than the one it is replacing If disk group reconstruction starts wait for it to complete and then retry the expansion 55 Warning The indicated disk reported a SMART event A SMART event indicates impending disk failure Recommended actions Resolve any non disk hardware problems especially a cooling problem or a faulty power supply I...

Page 80: ...ller reset a disk channel to recover from a communication error This event is logged to identify an error trend over time Recommended actions If the controller recovers no action is required View other logged events to determine other action to take 62 Warning The indicated dedicated spare disk or global spare disk has failed Recommended actions Replace the disk with one of the same type SSD enter...

Page 81: ... Warning The controller could not use an assigned spare for a disk group because the spare s capacity is too small This occurs when a disk in the disk group fails there is no dedicated spare available and all global spares are too small or if the dynamic spares feature is enabled all global spares and available disks are too small or if there is no spare of the correct type There may be more than ...

Page 82: ...e Guide 88 Warning The mirrored configuration retrieved by this controller from the partner controller is corrupt The local flash configuration will be used instead Recommended actions Restore the default configuration by using the restore defaults command as described in the CLI Reference Guide 89 Warning The mirrored configuration retrieved by this controller from the partner controller has a co...

Page 83: ...d for the indicated volume Recommended actions No action is required 106 Info The indicated volume has been added to the indicated pool Recommended actions No action is required 107 Error A serious error has been detected by the controller In a single controller configuration the controller will restart automatically In an Active Active configuration the partner controller will kill the controller...

Page 84: ... the data in the partner controller s cache but if the other controller does not restart successfully the data will be lost Recommended actions To determine if data might have been lost check whether this event was immediately followed by event 56 Storage Controller booted up closely followed by event 71 failover started The failover indicates that the restart did not succeed 117 Warning This cont...

Page 85: ...ent probably has a hardware fault Replace the module If you are not able to access the management interfaces of the controller that logged this event do the following Shut down that controller and reseat the module If you are then able to access the management interfaces check the version of the controller firmware and update to the latest firmware if needed If the problem recurs replace the modul...

Page 86: ...both controller modules have been replaced or moved while the system was powered off One or both controller modules have had their flash configuration cleared this is where the previously used WWNs are stored The controller module recovers from this situation by generating a WWN based on its own serial number Recommended actions If the controller module was replaced or someone reprogrammed its FRU...

Page 87: ... accessible to allow reading from and writing to the disk group it will be dequarantined automatically with a resulting status of FTDN or CRIT If a spare disk is available reconstruction will begin automatically When the disk group has been removed from quarantine event 173 is logged For a more detailed discussion of dequarantine see the SMC or CLI documentation CAUTION Avoid using the manual dequ...

Page 88: ...ware update fails the user will be notified about the problem immediately and should take care of the problem at that time so even when there is a failure this event is logged as Informational severity Recommended actions No action is required 175 Info The network port Ethernet link has changed status up or down for the indicated controller Recommended actions If this event is logged indicating th...

Page 89: ...action is required 188 Info Write back cache has been disabled Event 187 is the corresponding even that is logged when write back cache is disabled Recommended actions No action is required 189 Info A disk channel that was previously degraded or failed is now healthy Recommended actions No action is required 190 Info The controller module s supercapacitor pack has started charging This change met ...

Page 90: ...e through feature to disable write back cache and put the system in write through mode When the fault is resolved event 199 is logged to indicate that write back mode has been restored Recommended actions If event 199 has not been logged since this event was logged the power supply probably does not have a health of OK and the cause should be investigated Another power supply event was probably lo...

Page 91: ...edly such as when a power failure occurs This event is generated when the Storage Controller SC detects a problem with the eMMC as it is booting up Recommended actions Restart the Storage Controller that logged this event If this event is logged again shut down the Storage Controller and replace the controller module Info The system has come up normally and the NV device is in a normal expected st...

Page 92: ...and the disk group is a fault tolerant RAID level check the current state of the disk group If it is not FTOL then back up the data as data may be at risk If it is FTOL then replace the indicated disk If more than one disk in the same disk group has logged a SMART event back up the data and replace each disk one at a time In virtual storage it may be possible to remove the affected disk group whic...

Page 93: ...the problem persists see Troubleshooting and problem solving Info SAS topology has changed The number of SAS expanders has increased or decreased The message specifies the number of elements in the SAS map the number of expanders detected the number of expansion levels on the native local controller side and on the partner partner controller side and the number of device PHYs Recommended actions N...

Page 94: ... module Info An enclosure management processor EMP reported an event Recommended actions No action is required 236 Error A special shutdown operation has started These special shutdown types indicate an incompatible feature Recommended actions Replace the indicated controller module with one that supports the indicated feature Info A special shutdown operation has started These special shutdown ty...

Page 95: ...d be investigated Another eMMC event was probably logged at approximately the same time as this event such as event 239 240 or 481 See the recommended actions for that event 243 Info A new controller enclosure has been detected This happens when a controller module is moved from one enclosure to another and the controller detects that the midplane WWN is different from the WWN it has in its local ...

Page 96: ...ot mount either volume until the copy is complete as indicated by event 268 Recommended actions No action is required 253 Info A license was uninstalled Recommended actions No action is required 255 Info The PBCs across controllers do not match as PBC from controller A and PBC from controller B are from different vendors This may limit the available configurations Recommended actions No action is ...

Page 97: ... to determine if unwritable cache data is present 4 Incompatible firmware versions in the controller modules 5 Incompatible firmware is present in the system Recommended actions For variant 1 2 or 3 You must resolve this condition before the firmware update will proceed Log into the system and run the show system command to identify unhealthy components and find recommendations for restoring syste...

Page 98: ...ess port If the indicated PHY type is Ingress replace the cable in the module s ingress port For other indicated PHY types or if replacing the cable does not fix the problem replace the indicated module If the problem persists check for other events that may indicate faulty hardware such as an event indicating an over temperature condition or power supply fault and follow the recommended actions f...

Page 99: ...anged to high Recommended actions No action is required 303 Info DDR memory clock frequency has changed to low Recommended actions No action is required 304 Info The controller has detected I2C errors that may have been fully recovered Recommended actions No action is required 305 Info A serial number in Storage Controller SC flash memory was found to be invalid when compared to the serial number ...

Page 100: ... other EMPs in the system Recommended actions No action is required 311 Info This event is logged when a user initiates a ping of a host using the iSCSI interface Recommended actions If the ping operation failed check connectivity between the storage system and the remote host 313 Error The indicated controller module has failed This event can be ignored for a single controller configuration Recom...

Page 101: ...dump data have been cleared Recommended actions No action is required 354 Warning SAS topology has changed on a host port At least one PHY has gone down For example the SAS cable connecting a controller host port to a host has been disconnected Recommended actions Check the cable connection between the indicated port and the host Monitor the log to see if the problem persists Info SAS topology has...

Page 102: ...r or Warning The scheduler experienced a problem with the indicated schedule Recommended actions Take appropriate action based on the indicated problem Info A scheduled task was initiated Recommended actions No action is required 362 Critical Error or Warning The scheduler experienced a problem with the indicated task Recommended actions Take appropriate action based on the indicated problem Info ...

Page 103: ...logs before they are overwritten For example you might have enabled managed logs without configuring a destination to send logs to 412 Warning One disk in the indicated RAID 6 disk group failed The disk group is online but has a status of FTDN fault tolerant with a down disk If a dedicated spare linear only or global spare of the proper type and size is present that spare is used to automatically ...

Page 104: ...t link speed exceeded the capability of an FC SFP The speed has been automatically reduced to the maximum value supported by all hardware components in the data path Recommended actions Replace the SFP in the indicated port with an SFP that supports a higher speed 456 Warning The system IQN was generated from the default OUI because the controllers could not read the OUI from the midplane FRU ID d...

Page 105: ... take steps to reduce storage usage or add capacity Info The indicated virtual pool exceeded one of its thresholds for allocated pages There are three thresholds two of which are user settable The third and highest setting is set automatically by the controller and cannot be changed This event is logged with Warning severity if the high threshold is exceeded and the virtual pool is overcommitted O...

Page 106: ...or When the problem is resolved event 468 is logged 470 Warning Removal of the indicated disk groups completed with failure Removal of disk groups can fail for several reasons and the specific reason for this failure is included with the event Removal most often fails because there is no longer room in the remaining pool space to move data pages off of the disks in the disk group Recommended actio...

Page 107: ...ed actions No action is required 479 Error The controller reporting this event was unable to flush data to or restore data from non volatile memory This mostly likely indicates a eMMC failure but it could be caused by some other problem with the controller module The Storage Controller that logged this event will be killed by its partner controller which will use its own copy of the data to perfor...

Page 108: ...ater performance If the system contains a mix of disk types SSD enterprise SAS or midline SAS there should be at least one global spare of each type unless dedicated spares are used to protect every disk group of a given type which will only apply to a linear storage configuration 485 Warning The indicated disk group was quarantined to prevent writing invalid data that may exist in the controller ...

Page 109: ...outdated data Recommended actions Wait at least 5 minutes for the automatic recovery process to complete Then sign in and confirm that both controller modules are operational You can determine if the controllers are operational with the CLI show redundancy mode command In most cases the system will come back up and no action is required If both controller modules do not become operational in 5 min...

Page 110: ... not resolve the problem the fault is probably in the enclosure midplane Replace the chassis FRU of the most upstream enclosure with reported failures If that does not resolve the problem and there is more than one enclosure with reported failures replace the chassis FRU of the other enclosures with reported failures until the problem is resolved 496 Warning An unsupported disk type was found Reco...

Page 111: ...the enclosure supports The disk is functional but I O performance is reduced This event may be logged for one disk channel or for both disk channels Recommended actions If the disk is a member of a non fault tolerant disk group RAID 0 or non RAID move the data to a different disk group Replace the disk with one of the same type SSD enterprise SAS or midline SAS and the same or greater capacity For...

Page 112: ...ions may continue as long as the system is not restarted Recommended actions If the system is restarted and access to data is intended the lock key must be reinstated 515 Info An FDE disk was repurposed by a user The disk was reset to its original factory state Recommended actions No action is required 516 Error An FDE disk has been placed in the unavailable state The related event message 518 whi...

Page 113: ...4 Error A temperature or voltage sensor reached a critical threshold A sensor monitored a temperature or voltage in the critical range When the problem is resolved event 47 is logged for the component that logged event 524 If the event refers to a disk sensor disk behavior may be unpredictable in this temperature range Check the event log to determine if more than one disk has reported this event ...

Page 114: ...ller EC firmware detected a level of incompatibility with the partner Expander Controller EC This incompatibility could be due to unsupported hardware or firmware As a preventative measure the local Expander Controller EC is holding the partner Expander Controller EC in a reset loop Recommended actions Remove the partner controller module from the enclosure Boot the partner controller module in si...

Page 115: ...t port configuration Recommended actions Replace the killed controller module with a controller module that has the same host port configuration as the surviving controller module 548 Warning Disk group reconstruction failed When a disk fails reconstruction is performed using a spare disk In this case the reconstruction operation failed because unreadable data uncorrectable media error exists in a...

Page 116: ...is resolved an event with the same code will be logged with Resolved severity Warning An EMP reported that a power supply unit PSU has been uninstalled Recommended actions Check that the indicated PSU is in the indicated enclosure If the PSU is not in the enclosure install a PSU immediately If the PSU is in the enclosure ensure that the power supply is fully seated in its slot and that its latch i...

Page 117: ...l temperature threshold in the indicated FRU The temperature sensor is not able to communicate with the EMP Recommended actions If temperature sensor is outside critical temperature threshold in the indicated FRU Check that the ambient temperature is not too warm For the normal operating range see your product s Hardware Installation and Maintenance Guide Check for any obstructions to the airflow ...

Page 118: ...dicated FRU Check that all modules in the enclosure are fully seated in their slots and that their latches if any are locked If this does not resolve the issue the indicated FRU has probably failed and should be replaced If the voltage sensor is not able to communicate with the EMP Wait for at least 10 minutes and check if the error resolves If the error persists check that all modules in the encl...

Page 119: ...will be logged with Resolved severity Warning An expander in a controller module expansion module or drawer is mated but is not responding or an expander in an expansion module has been removed Recommended actions Check that the indicated FRU is in the indicated enclosure If the FRU is not in the enclosure install the appropriate FRU immediately If the FRU is in the enclosure ensure that the FRU i...

Page 120: ...chnical support For all FRU types except the enclosure if the partner FRU is not degraded remove and reinsert the indicated FRU If the indicated FRU is the enclosure set up a preventive maintenance window and power cycle the enclosure at that time If these recommended actions do not resolve the issue the indicated FRU has probably failed and should be replaced If the current sensor is outside crit...

Page 121: ...s No action is required 568 Info A disk group has mixed physical sector size disks for example 512n and 512e disks in the same disk group This event is the result of the user selecting disks with sector formats that do not match or a global spare replacement with a different sector format than the disk group This could result in degraded performance for some workloads Recommended actions No action...

Page 122: ...longer needed Info Allocated snapshot space exceeded either the low or middle snapshot space threshold The threshold settings are intended to indicate that the pool is using a significant portion of configured snapshot space and should be monitored If the storage usage drops below any threshold event 572 is logged Recommended actions Reduce the snapshot space usage by deleting snapshots that are n...

Page 123: ...ss the peer connection which may be due to CHAP configuration changes or a pool out of space condition Recommended actions Resolve the issue specified by the error message included with this event Info A replication completed successfully Recommended actions No action is required 580 Info A replication was aborted Recommended actions No action is required 581 Warning A replication was suspended in...

Page 124: ...s modified Recommended actions No action is required 586 Error Resuming the replication was unsuccessful due to the condition specified within the event Reasons for replication failure include but are not limited to shutdown of the secondary system a loss of communication across the peer connection which may be due to CHAP configuration changes or a pool out of space condition Recommended actions ...

Page 125: ...me accessible the disk group will be dequarantined automatically with a resulting status of FTOL If not all of the disks become accessible but enough become accessible to allow reading from and writing to the disk group it will be dequarantined automatically with a resulting status of FTDN or CRIT If a spare disk is available reconstruction will begin automatically When the disk group has been rem...

Page 126: ...ments to minimize the risk of data loss in the event of drawer failure so the system had to select a spare that did not meet the requirements For a RAID 6 disk group this means that more than two member disks are in the same drawer For other RAID levels this means that more than one member disk is in the same drawer Recommended actions Replace the indicated failed disk in the indicated enclosure t...

Page 127: ... the volume An error is possible if the snapshot fails Recommended actions Monitor the health of the local system the replication set the volume and the peer connection A full storage pool may be the cause of this fault Check the peer connection system health and state Ensure that the Maximum Licensable Snapshots limit shown by the CLI show license command was not exceeded 605 Warning Inactive pro...

Page 128: ...ed on a door lock element The door lock element reports status associated with the enclosure drawer The drawer has been reporting as open for a long period of time This may reduce cooling potentially causing the enclosure to overheat Recommended actions Check that the drawer is fully closed and latched When the problem is resolved an event with the same code will be logged with Resolved severity I...

Page 129: ...on an IOM Recommended actions Either install the indicated IOM or attempt to reseat it If the problem persists replace the IOM Warning An alert condition was detected on an IOM Recommended actions If uninstalled install the indicated IOM otherwise attempt to reseat it If the problem persists replace the IOM Info An IOM was uninstalled Recommended actions No action is required Resolved A previous W...

Page 130: ... for the indicated enclosure Recommended actions No action is required 621 Info Degraded ADAPT rebalance operation started This operation takes fault tolerant stripe zones and makes them degraded so critical stripe zones can be made degraded Recommended actions No action is required 622 Info Degraded ADAPT rebalance operation completed This operation takes fault tolerant stripe zones and makes the...

Page 131: ... 635 Warning An I O controller PHY setting was changed by a user Recommended actions No action is required 636 Error The other controller killed this controller for an unknown reason The system will automatically recover Recommended actions Collect the logs and contact technical support for further action 637 Error The other controller killed this controller because it stopped responding via the i...

Page 132: ...nges to SupportAssist State changed Contact information changed Proxy settings changed or cleared Operation mode changed Settings changed Recommended actions No action is required 647 Error This Storage Controller is restarting due to an internal error This Storage Controller experienced a management interface hang and will restart to recover Recommended actions Collect the logs and contact techni...

Page 133: ...Table 28 Event descriptions and recommended actions continued Number Severity Description Recommended actions No action is required Events and event messages 133 ...

Page 134: ...rt 2 Verify the appropriate COM port for use with the CLI 3 If necessary press Enter to display login prompt a Type the user name of a user with the manage role at the login prompt and press Enter b Type the password for the user at the Password prompt and press Enter Topics Micro USB device connection Micro USB device connection The following sections describe the connection to the micro USB port...

Page 135: ... a device driver or special mode of operation The following table displays the product and vendor identification information that is required for certain operating systems Table 32 USB identification code USB identification code type Code USB Vendor ID 0x210C USB Product ID 0xA4A7 Microsoft Windows drivers Windows Server 2016 and later operating systems provide a native USB serial driver that supp...

Page 136: ...F disks in the DDIC carrier It can also use 2 5 SFF disks with 3 5 adapter in the DDIC Enclosure weights Table 35 2U12 2U24 and 5U84 enclosure weights CRU component 2U12 kg lb 2U24 kg lb 5U84 kg lb Storage enclosure empty 4 8 10 56 4 8 10 56 64 141 Disk drive carrier 0 9 1 98 0 3 0 66 0 8 1 8 Blank disk drive carrier 0 05 0 11 0 05 0 11 Power Cooling Module PCM 3 5 7 7 3 5 7 7 Power Supply Unit PS...

Page 137: ...must be operated with low pressure rear exhaust installation Back pressure created by rack doors and obstacles not to exceed 5Pa 0 5 mm H2O Altitude operating 2U enclosures 0 to 3 000 meters 0 to 10 000 feet Maximum operating temperature is de rated by 5ºC above 2 133 meters 7 000 feet 5U84 enclosures 100 to 3 000 meters 330 to 10 000 feet Maximum operating temperature is de rated by 1ºC above 900...

Page 138: ... 80 10 load 87 20 load 88 20 load 90 50 load 92 50 load 87 100 load 88 100 load 85 surge 85 surge Harmonics Meets EN61000 3 2 Output 5 V 42A 12 V 38A 5 V standby voltage 2 7A Operating temperature 0ºC to 57ºC 32ºF to 135ºF Hot pluggable Yes Switches and LEDs AC mains switch and four status indicator LEDs Enclosure cooling Dual axial cooling fans with variable fan speed control Power supply unit Ta...

Page 139: ...cription 91 100 load Holdup time 5 ms from ACOKn high to rails out of regulation see SBB v2 specification Main inlet connector IEC60320 C20 with cable retention Weight 3 kg 6 6 lb Cooling fans Two stacked fans 80 mm x 80 mm x 38 mm 3 1 in x 3 15 in x 1 45 in Technical specifications 139 ...

Page 140: ...s not responsible for any radio or television interference caused by using other than recommended cables and connectors or by unauthorized changes or modifications to this equipment Unauthorized changes or modifications could void the user s authority to operate the equipment This device complies with Part 15 of the FCC Rules Operation is subject to the following two conditions 1 this device may n...

Page 141: ...requirements Chassis form factor 2U12 2U24 5U84 Cable type Harmonized H05VV F 3G1 0 Harmonized H05VV F 3G2 5 Plug AC source IEC 320 C14 250V 10A A suitable plug rated 250V 16A IEC 320 C20 250V 16A A suitable plug rated 250V 16A Socket IEC 320 C13 250V 10A IEC 320 C19 250V 16A NOTE The plug and the complete power cable assembly must meet the standards appropriate to the country and must have safety...

Reviews: