background image

MK-90CRH002-0

1

 

 

Hitachi Compute Rack 210H 

User’s Guide 

 

 

 
 
 
 
 

F

A S T

F

IN D

L

IN K S

 

Document Organization 

Getting Help 

Contents

 

 

 

Summary of Contents for Compute Rack 220H

Page 1: ...MK 90CRH002 01 Hitachi Compute Rack 210H User s Guide FAS TFIND LINK S Document Organization Getting Help Contents ...

Page 2: ...ailable this entire document will be updated and distributed to all registered users All of the features described in this document may not be currently available For information about features and product availability refer to the most recent product announcement or contact Supply info Notice Hitachi products and services can be ordered only under the terms and conditions of Hitachi s applicable ...

Page 3: ...ion 1 1 Avoiding trouble 1 2 Bundled Software 1 5 Monitoring operating status 2 1 Fault detection 2 2 Operation monitoring 2 8 System Unit Functions 3 1 Disk arrays 3 2 Redundancy 3 17 Wake On LAN 3 18 Memory RAS 3 20 LAN extended functions 3 28 Operational precautions 4 1 Precautions on LAN controller 4 2 Maintenance and service parts 5 1 Daily maintenance items 5 2 Cleaning 5 2 Service parts 5 4...

Page 4: ...iv Contents Hitachi Compute Rack 210H User s Guide Troubleshooting 6 1 Solving problems 6 2 Corrective actions for error 6 3 Acronyms and Abbreviations Index ...

Page 5: ...or the Compute Rack 210H CR 210H Please read this document carefully and maintain a copy for reference This preface includes the following information Intended audience Document revision level Document organization Document conventions Getting help Comments Notice The use of the Compute Rack is governed by the terms of your agreements with Hitachi ...

Page 6: ...ocument organization The table below provides an overview of the contents and organization of this document Click the chapter title in the left column to go to that chapter The first page of each chapter provides links to the sections in that chapter Chapter Description Chapter 1 Before operation Describes what you should understand before operation Chapter 2 Monitoring operating status Describes ...

Page 7: ...are brackets Indicates optional values Example a b indicates that you can choose a b or nothing braces Indicates required or expected values Example a b indicates that you must choose either a or b vertical bar Indicates that you have a choice between two or more options or arguments Examples a b indicates that you can choose a b or nothing a b indicates that you must choose either a or b underlin...

Page 8: ...ges including reference codes and severity levels displayed and or logged at the Compute Rack The supply info support staff are available 24 hours a day seven days a week For technical support visit the portal site at supply URL For contact information visit supply URL Comments Please send us your comments on this document if any by e mail to supply e mail Make sure that the e mail includes the do...

Page 9: ...1 Before operation 1 1 Hitachi Compute Rack 210H User s Guide Before operation This chapter describes what you should understand before operation Avoiding trouble Bundled Software ...

Page 10: ... For verification of the event log of software attached to the system unit see the individual software manuals See Bandled software on page 1 5 Consistency check In a disk array a re assign data migration to a reserved area is performed if a bad block inaccessible area is detected on the HDD during the read write process However if a bad block exists in the mirror data part as well as in the area ...

Page 11: ... computer is infected we recommend disconnecting your LAN cable immediately and isolating the computer from the network in order to prevent secondary infection System scan Periodically check if computer viruses are hidden It is convenient to use anti virus software s scheduling function that enables automatic scanning periodically Downloading the latest date It is essential to always update anti v...

Page 12: ... be stored on tape or similar media to restore the system to its most previous working state if data loss occurs It is also possible to restore the system to normal by creating system information backup data if system data become lost or corrupted We recommend backing up the system data periodically For details on how to create and restore backup data see the Windows Help or visit the Microsoft we...

Page 13: ...ause a double failure and loss of data For details about using this utility see MegaRAID Storage Manage Instruction Manual Hardware Maintenance Agent The Hardware Maintenance Agent is a utility necessary for maintenance of the system unit If a failure occurs on the system unit this utility analyzes such a failure automatically thus facilitating identification of this failure and shortening system ...

Page 14: ...1 6 Before operation Hitachi Compute Rack 210H User s Guide This page is intentionally left blank ...

Page 15: ...ng operating status 2 1 Hitachi Compute Rack 210H User s Guide Monitoring operating status This chapter describes system unit fault detection and methods monitoring operation Fault detection Operation monitoring ...

Page 16: ...ous code is cleared when the power is turned on POWER switch is turned on 3 OS event log Records an event as a system log a security log or an application log Stores an event log generated at a system level or an application leveling the following directory as well as an error message SystemRoot SYSTEM32 config The logs can be referenced by using an event viewer 4 STOP message Indicates if an erro...

Page 17: ...an choose what is displayed as the operation status from the event code the POST code and the power consumption using a SERVICE switch A combination of ON OFF selections of MODE0 LED MODE1 LED and SERVICE LED determines what is currently displayed A combination of ON OFF selections of MODE0 LED MODE1 LED and the SERVICE LED determines what the MAINTENANCE LEDs indicate as follows Table 2 2 What th...

Page 18: ...git The LEDs 5 through 8 indicate the lower digit The smaller the LED number is the higher bit the LED represents If Power On Self Test POST is successful only LEDs 1 3 5 6 and 7 are turned on That means AE is displayed Figure 2 3 Post code indicate The MAINTENANCE LEDs keep the display of the POST code unless the AC power source is turned off In addition to that when an error occurred the MAINTEN...

Page 19: ... consumption All Off Less than 200 W Only 1 is On 200 W or more and less than 250 W 1 and 2 are On 250 W or more and less than 300 W 1 through 3 are On 300 W or more and less than 350 W 1 through 4 are On 350 W or more and less than 400 W 1 through 5 are On 400 W or more and less than 450 W 1 through 6 are On 450 W or more and less than 500 W 1 through 7 are On 500 W or more and less than 550 W 1 ...

Page 20: ...Online http support microsoft com STOP message and memory dump If the Windows OS has been trapped in a critical state for some reason a blue screen appears and a STOP message is displayed At this time Windows can save the STOP message information in a dump file This is referred to as a memory dump If Windows is hung up the cause of a failure can be narrowed down by analyzing the memory dump The fo...

Page 21: ...er s Guide Error notification by utilities Utility programs such as MegaRAID Storage Manager use a popup message or an internal event log to be displayed for error notification For more details on error notification of those utilities see the individual manuals ...

Page 22: ...eck the operating status of the system unit or internal options for errors such as turned on ERROR LED and fans abnormal noise Every six months 2 Check an OS event log for errors Every day 3 Check for error notifications by utilities Every day To monitor the operation of the system unit see the following manuals Operating status of system unit and internal options See Maintenance and service parts...

Page 23: ...ons 3 1 Hitachi Compute Rack 210H User s Guide System Unit Functions This chapter describes the functions of the system unit that are useful in operation Disk arrays Redundancy Wake On LAN Memory RAS LAN extended functions ...

Page 24: ...ry in order to prevent data loss in case one of the HDDs in the disk array should fail NOTICE Even the disk array cannot prevent data loss that is caused by a non HDD failure software overrun or operation errors Just in case back up your system data Disk array A virtual drive consisting of multiple physical drives which cannot be recognized by the OS as it is Logical drive A logical drive configur...

Page 25: ...ler The method and features of each RAID level are described below RAID0 Host Array Controller Data Block 2 Block 7 Disk 2 Block 1 Block 6 Disk 1 Block 3 Block 8 Disk 3 Block 5 Block 10 Disk 5 Block 4 Block 9 Disk 4 Figure 3 1 RAID0 method Data is striped on multiple HDDs Advantage Particularly the throughput is improved for a large number of files Disadvantage All data is lost if one of the hard ...

Page 26: ...k Advantage Provides 100 data redundancy thus enabling easy switching between two disks to continue read or write processing if a disk failure occurs HDD interchange also allows data reconstruction without shutdown operations Disadvantage A double HDD capacity is required because a mirrored disk of the same capacity should be installed An attempt to read or write data during data rebuild causes a ...

Page 27: ...ntly Data striping on a block basis is suitable for transaction processing Even if one of the HDDs in the disk array fails read or write processing can be continued while array parity is computing the lost data HDD interchange enables data reconstruction without shutting down operations The array parity distributed on each HDD has the advantage that parallel processing by independent access to HDD...

Page 28: ...ently Data striping on a block basis is suitable for transaction processing Even if up to two of the HDDs in the disk array fail read or write processing can be continued while array parity is computing the lost data HDD interchange enables data reconstruction without shutting down operations The array parity distributed on each HDD has the advantage that parallel processing by independent access ...

Page 29: ... easy switching between two disks to continue read or write processing if a disk failure occurs HDD interchange also allows data reconstruction without shutdown operations Particularly the throughput is improved for a large number of files Typically RAID10 is superior to RAID5 in write performance because no array parity is generated Disadvantage A double hard disk capacity is required because a m...

Page 30: ...BOD Host Array Controller Data Block 1 Block 2 Disk 1 Figure 3 6 JBOD method Data is stored only on one HDD JBOD performs no data redundancy and works like a HDD connected to a typical SAS SATA controller Number of HDDs required 1 JBOD is RAID0 configured with one HDD ...

Page 31: ...n required causes a decrease in processing performance as compared to the normal status RAID6 can operate even if two HDDs fail If a HDD fails in the drive group having multiple logical drives all the disks under the drive group will operate in degraded mode Rebuilding data In the disk array of RAID1 5 6 or 10 after the failed disk is replaced the array controller restores and stores data on the r...

Page 32: ...ilure occurs on the logical drive of RAID1 5 6 or 10 the failed disk must be replaced to return the system normal status If a spare disk reserve disk is premounted on the array controller data can be automatically restored on the reserve disk in case of a failure Preparing this reserve disk so that it can be exchanged at any time is referred to as a hot spare A function for the array controller to...

Page 33: ...ther HDD under rebuilding However the process performance decreases in comparison with normal state under copying to perform data copying by SMART copyback Active disk Reserved disk Failed disk A SMART error occurred in DISK1 Copy DISK1 data to reserve disk DISK6 DISK1 will be registared as failed disk after finish a copy Re run with DISK2 to DISK6 DISK 1 DISK 2 DISK 3 DISK 4 DISK 5 DISK 6 DISK 1 ...

Page 34: ...DISK 2 DISK 3 DISK 4 DISK 5 DISK 6 DISK 1 DISK 2 DISK 3 DISK 4 DISK 5 DISK 6 DISK 1 DISK 2 DISK 3 DISK 4 DISK 5 DISK 6 DISK 1 DISK 2 DISK 3 DISK 4 DISK 5 DISK 6 DISK 1 DISK 2 DISK 3 DISK 4 DISK 5 DISK 6 Disk array Disk array Disk array Disk array Disk array Active disk Reserved disk Failed disk An error occurred in DISK1 Transition to degraded mode Rebuild DISK1 data to reserved disk DISK6 Re run ...

Page 35: ... 2 DISK 3 DISK 4 DISK 5 DISK 1 DISK 2 DISK 3 DISK 4 DISK 5 Disk array Disk array Disk array DISK 1 DISK 2 DISK 3 DISK 4 DISK 5 Disk array Disk array Active disk Failed disk An error occurred in DISK1 Transition to degraded mode Replace DISK1 failed disk operator Rebuild DISK1 data Re run with DISK1 to DISK5 Figure 3 9 A case without a reserve disk ...

Page 36: ...management install in a successive extension storage bay the HDDs to be used with a single disk array We also recommend to record disk array and logical drive configurations If a failure occurs however the installation location depends on whether a reserve disk is provided For this reason make a record each time 1 2 3 4 5 6 A 0 A 1 B 0 B 1 B 2 B 3 1 2 3 4 5 6 B 0 A 0 B 1 B 2 A 1 B 3 Extension stor...

Page 37: ...arrays with a RAID level of 1 5 6 or 10 The reserve disk functions as a hot spare on either disk array Depending on a HDD hot spare the following configuration is applied 1 2 3 4 5 6 A 0 A 1 B 0 B 1 B 2 Reserve A 0 Reserve B 0 B 1 B 2 A 1 Extension storage bay Rebuild Replace disk Becomes reserved disk Extension storage bay Rebuild Replace disk Becomes reserved disk 1 2 3 4 5 6 A 0 Reserve B 0 B 1...

Page 38: ...her in disk array A or B If the hot spare is used in disk array A A and C are applied to the case where If there are two or more disk arrays with a RAID level of 1 5 6 or 10 an unused area will exist on the rebuilt HDD because the reserve disk is greater in capacity than hard disk A x used in disk array A If the reserve disk has the same capacity as A x The reserve disk works as a hot spare only i...

Page 39: ...two power supplies to different power sources respectively if installed In case of a failure on one power source the system unit can operate without a shutdown as long as power is still supplied from the other source If a failure occurs in one power supply contact the sales representative or maintenance personnel Replace the failed power supply as soon as possible System fans The system unit has s...

Page 40: ...te management function only on the Web console Support conditions for the Wake On LAN The Wake On LAN function is supported under the following conditions Supported OS Windows Server 2008 R2 Supported LAN devices Onboard LAN controller 1 Only network interface connector 1 on the rear of the system unit is available Figure 3 14 Network interface connector 1 Network interface connector 2 management ...

Page 41: ... switch is pressed to turn off the power before the OS is activated during BIOS POST If the power is turned off using the UPS management software If the power is not supplied to the system unit such as the AC cable is disconnected or the circuit breaker is tripped or in case of a power failure If the system is shut down when connected to link partner the Wake On LAN enabled onboard LAN controller ...

Page 42: ...es preventive maintenance of memory switching before an uncorrectable memory error Uncorrectable Error 2 bit error occurs These are the following conditions for using the online spare memory function Two or more memory boards should be installed per channel 8 or 12 memory boards for 4 channels per processor All memory boards should have the same capacity and same model When two processors are inst...

Page 43: ... Memory capacity Number of rank MJ702GL3 Y MJ702GL3 R 2048 MB 1 MJ704GL3 Y MJ704GL3 R 4096 MB 1 MJ708GL3 Y MJ708GL3 R 8192 MB 2 MJ716GL3 Y MJ716GL3 R 16384 MB 2 In case the of a memory error memory switching is performed on a rank basis For instance on the assumption that a memory board having two rank is installed on memory slots 1 and 5 channel 1 of processor 1 and a memory board of rank 0 in me...

Page 44: ...s used as a spare memory Therefore the memory capacity is displayed smaller than the actual one On the system BIOS setup menu confirm Chipset North Bridge Total Memory Depending on a memory board installed the spare memory capacity per channel is as follows Table 3 2 Spare memory capacity Installed memory boards Spare memory capacity per channel MJ702GL3 Y MJ702GL3 R 2048 MB MJ704GL3 Y MJ704GL3 R ...

Page 45: ...ry boards should be of the same configuration for channels 0 and 1 as well as channels 2 and 3 of each processor All memory boards should have the same capacity and same model When two processors are installed the same memory configuration should be applied to processor 1 and processor 2 Memory mirroring divides memory boards into a primary mirror depending on channels installed and configures two...

Page 46: ...primary slot The precautions on use of memory mirroring are as follows To validate the memory mirroring function set Chipset North Bridge Memory Mode to Mirroring on the system BIOS setup menu When the memory mirroring function is set valid the half of the memory boards installed are used as a mirror and thus the actual memory capacity used becomes half On the system BIOS setup menu confirm Chipse...

Page 47: ...detection of two 4 bit DRAM device failures These are the following conditions for using the lock step function Memory boards should be of the same configuration for channels 0 and 1 as well as channels 2 and 3 of each processor All memory boards should have the same capacity and same model When two processors are installed the same memory configuration should be applied to processor 1 and process...

Page 48: ...t down You cannot use the lock step function simultaneously with online spare memory memory mirroring and device tagging Device tagging Memory device tagging is a function for providing redundancy on a DRAM chip basis so that the system can operate without a shutdown even if one DRAM chip on a memory board breaks down Normally memory generates an ECC from data and stores on each DRAM chip separate...

Page 49: ... 1 bit error or single DRAM chip error automatic correction is continued At this time the ERROR LED does not turned on For the device tagging function only one memory board is used for operation per channel If DRAM chips on multiple memory boards in one channel fail the device tagging function does not work and the system is shut down If an uncorrectable memory error occurs ECC based error correct...

Page 50: ...led network adapter is changed over to the backup adapter automatically to shift processing LAN device load distribution function Provides expanded band width of a network by combining two network adapters This function distributes the traffic load of transmit data to each adapter Switch redundancy function Provides high reliability of a network by combining two network adapter with two switching ...

Page 51: ...4 Operational precautions 4 1 Hitachi Compute Rack 210H User s Guide Operational precautions This chapter describes operational precautions Precautions on LAN controller ...

Page 52: ...th a function for their LAN controller to perform TCP IP protocol checksum calculations However we recommend not to use this function but to use the standard TCP IP checksum calculation function of the OS If the calculation by the OS is set up the integrity of packet data received from a network will be checked in the final stage of protocol processing of the OS thus enabling construction of a mor...

Page 53: ...roperties settings Item Broadcom Advanced Control Suite 4 not installed Broadcom Advanced Control Suite 4 installed IPv4 Checksum Offload Rx Tx Enabled None Rx Tx Enabled None TCP UDP Checksum Offload lpv4 Rx Tx Enabled None Rx Tx Enabled None TCP UDP Checksum Offload lpv6 Rx Tx Enabled None Rx Tx Enabled None Large Send Offload IPv4 Enable Disable Enable Disable Large Send Offload v2 IPv4 Enable ...

Page 54: ...4 4 Operational precautions Hitachi Compute Rack 210H User s Guide This page is intentionally left blank ...

Page 55: ...ice parts 5 1 Hitachi Compute Rack 210H User s Guide Maintenance and service parts This chapter describes daily maintenance procedures service parts and consumables Daily maintenance items Cleaning Service parts Consumables ...

Page 56: ...al noise 6 months Internal DVD ROM Clean Clean pickup lens with cleaning kits In case of a media read error If the system unit is used in a dusty environment clean the unit once every month Cleaning This section describes how to clean the system unit and its standard devices For the cleaning of other optional devices see their individual manuals System unit For the following procedure see Hitachi ...

Page 57: ...Figure 5 1 Cleanup location of the system unit 5 Before connection remove dust from the connectors of interface cable and system unit using a dry cloth 6 Wipe dust off the plug of the power cable and connect the cable to an outlet and system unit Internal DVD ROM Clean the pickup lens in case of a media read error For purchasing a cleaning kit contact the sales representative For the cleaning meth...

Page 58: ...parts Part name Product code Remarks Internal DVD ROM Standard of system unit 1 Notes 1 If the drive is used under the installation environment provided in Hitachi Compute Blade 2000 1000 320 and Hitachi Compute Rack 220 210 Site Planning Guide the energization life time is approximately 13 000 hours If the drive is used for 24 hours a day and for 30 days a month its lifetime will be approximately...

Page 59: ...6 Troubleshooting 6 1 Hitachi Compute Rack 210H User s Guide Troubleshooting This chapter describes trouble shooting of the system unit Solving problems Corrective actions for error ...

Page 60: ... the documents of devices For any questions on a manual s description contact the sales representative 3 System unit does not work normally or an error occurs See Corrective actions for error on page 6 3 4 System unit infected with a computer virus Disconnect the network cable and follows the instructions mentioned in the manual attached to your anti virus software 5 Want to change the disk array ...

Page 61: ...or message appears on page 6 5 4 The ERROR LED on the system unit lights See Errors during use on page 6 16 5 Memory capacity is smaller than the actual one Check if memory is installed normally See Hitachi Compute Rack 210H CRU Replacement Guide The available memory capacity might decrease due to the effect of a memory hole Check if the online spare memory function or memory mirroring function ha...

Page 62: ... its use Geomagnetic effects or color shading might occur on the display Turn off the power once and leave at least 30 minutes before restart Check if the system unit is too close to the display Keep the system unit properly distant from the display or increase the refresh rate setting 11 Only the mouse cursor appears Any cause cannot be identified Contact the sale representative or maintenance pe...

Page 63: ...d A BMC error has been detected 1 2714 Cannot set BMC network configuration A BMC error has been detected 1 2716 BMC is not ready A BMC error has been detected 1 2717 Power supply configuration error A power supply configuration error has been detected 1 3B01 System battery is dead A battery error has been detected 1 3B02 Check date and time settings A system clock error has been detected 2 BMC ha...

Page 64: ...essages during POST Error message Description Action Cache data was lost because of an unexpected power off or reboot during a write operation but the adapter has recovered This could be because of memory problems bad battery or you may not have a battery installed Press any key to continue or press C to load the configuration utility If this message appears even though an illegal power shutdown o...

Page 65: ...o disable this warning if your controller does not have a battery Cache backup module information is not set correctly Press D while this message is appearing 3 The battery hardware is missing or malfunctioning the battery is unplugged or the battery could be fully discharged If you continue to boot the system the battery backed cache will not function If the battery is connected and has been allo...

Page 66: ...ny key to continue or C to load the configuration utility Some of the disk array configurations have been removed 4 All of the disks from your previous configuration are gone If this is an unexpected message then power off your system and check your cables to ensure all disks are present Press any key to continue or press C to load the configuration utility All the HDDs with the disk array configu...

Page 67: ...placement If you continue data corruption can occur Press X to continue or else power off the system and replace the DIMM module and reboot If you have replaced the DIMM press X to continue A cache memory error has occurred in the disk array controller 1 Multibit ECC errors were detected on the RAIDcontroller If you continue data corruption can occur Contact technical support to resolve this issue...

Page 68: ...pter was recovered but cached data was lost Press any key to continue or press C to load the configuration utility Firmware version inconsistency has been detected 1 Firmware Failed Validation Adapter needs to be refreshed Firmware version inconsistency has been detected 1 The most recent configuration command could not be committed and must be retried Press any key to continue or press C to load ...

Page 69: ...S topology detected Check your cable configurations repair the problem and restart your system An invalid SAS topology has been detected 1 Invalid AS Address present in SBR Contact your system support Press any key to continue with the default SAS address An invalid SAS address exists 1 Invalid SAS Address present in MFC data Program the valid SAS Address and restart your system An invalid SAS add...

Page 70: ... or C to load the configuration utility The HDD security function is not supported 1 Invalid pass phrase If you continue there will be a drive security key error and all secure configurations will be marked as foreign Reboot the machine to retry the pass phrase or press any key to continue The HDD security function is not supported 1 Unable to communicate to EKMS If you continue there will be a dr...

Page 71: ...rational If VDs have not returned to write back mode after 30 minutes of charging then contact technical support for additional assistance The following VD is affected X Press any key to continue The cache backup module information is illegal 1 Two BBUs are connected to the adapter This is not a supported configuration Battery and caching operations are disabled Remove one BBU and reboot to restor...

Page 72: ...n result in inaccessible data unless it is addressed Reattach the upgrade key and reboot An upgrade key is not supported 1 Serial Boot ROM SBR device is corrupt or bad Please contact Tech Support Serial Boot ROM SBR device malfunction 1 Notes 1 Contact the sales representative or maintenance personnel 2 Set correct information on the RAID BIOS MegaRAID WebBIOS 3 If the system does not recover from...

Page 73: ...dia in the appropriate drive Missing OS OS not found If the above messages appear verify the setup of the system BIOS or RAID BIOS See Hitachi Compute Rack 210H 220H BIOS Guide If the system does not recover the boot information storage area of the HDD might be destroyed Re install the OS If the OS cannot be booted after re installation replace the HDD In this case contact the sales representative...

Page 74: ...personnel ERROR LED lights HDD status LED lights amber A failure has occurred on the internal HDD Replace the HDD Contact the sales representative or maintenance personnel ERROR LED lights Power supply LED lights amber A power supply failure has occurred Replace the power supply Contact the sales representative or maintenance personnel Power supply LED blinks amber A power supply is warning status...

Page 75: ...D Digital Versatile Video Disc FCC Federal Communications Commission HDD Hard Disk Drive ID Identity Document IP Internet Protocol LAN Local Area Network OS Operating System PC Personal Computer POST Power On Self Test TPM Trusted Platform Module URL Uniform Resource Locator USB Universal Serial Bus UTP Unshielded Twist Pair WAN Wide Area Network WEEE Waste Electrical and Electronic Equipment ...

Page 76: ...2 Acronyms and Abbreviations Hitachi Compute Rack 210H User s Guide This page is intentionally left blank ...

Page 77: ...Index Index 1 Hitachi Compute Rack 210H User s Guide Index ...

Page 78: ...Index 2 Index Hitachi Compute Rack 210H User s Guide This page is intentionally left blank ...

Page 79: ...Hitachi Compute Rack 210H User s Guide ...

Page 80: ......

Reviews: