background image

Initializing previously used good hard disk drives

A previously used good hard disk drive is defined in this document as a drive that
was previously a member of an array that was attached to a ServeRAID-8 series
controller. The drive is being reused within a new RAID configuration. Before you
can add a previously used good hard disk drive to a new array, either as a
replacement drive or to expand an array, you must first initialize the drive to remove
existing configuration information. Existing configuration information can cause the
ServeRAID controller to behave differently than expected and, in some rare cases,
can result in data loss.

You can initialize a hard disk drive by using the Array Configuration Utility (ACU)
(accessible by pressing Ctrl+A when you are prompted at system startup), the IBM
ServeRAID Support CD version 8 or 9, or the ServeRAID Manager program (from
within the operating system), or by using the

arcconf task

command.

For more information, see the following RETAIN tips.

A rebuild does not start after replacing a defunct drive - IBM ServeRAID

http://www-304.ibm.com/systems/support/supportsite.wss/
docdisplay?brandind=5000008&lndocid=MIGR-5074510
(Type

5074510

in the

Search

field at http://www.ibm.com.)

Lost configuration when drive added - ServeRAID 8k, 8k-l

http://www-304.ibm.com/systems/support/supportsite.wss/
docdisplay?brandind=5000008&lndocid=MIGR-5073723
(Type

5073723

in the

Search

field at http://www.ibm.com.)

Synchronizing logical drives after upgrading firmware from build 8263
or earlier

ServeRAID firmware builds 7777 through 8263 use a legacy method to track
bad-stripe unit errors. Starting with firmware build 8264 and later, the process to
manage bad-stripe units is improved with a bad-stripe table. The firmware upgrade
process does not import existing bad-stripe information from the old method to the
new, and the legacy method is perceived by the new code as disk errors when the
affected stripe units are read. In most cases, these types of errors are corrected
automatically without any user intervention. However, in rare cases, if a large
number of these errors are reported by a single drive, the drive might be marked
Defunct prematurely.

To avoid this situation, start a synchronization on each logical drive. The
synchronization process scrubs the physical drives, corrects the errors as they are
detected, and when applicable, creates an equally equivalent bad-stripe table entry.
Note that the ServeRAID-8 series controllers are designed to correct or self-heal
bad-stripe table entries when the next successful write occurs to that stripe unit.

You can start synchronizations by using the IBM ServeRAID Support CD, the
ServeRAID Manager installable application (from within the operating system), or
the

arcconf task

command.

2

ServeRAID-8 Series: Best Practices and Maintenance Information

Summary of Contents for ServeRAID-8 Series

Page 1: ...ServeRAID 8 Series Best Practices and Maintenance Information...

Page 2: ......

Page 3: ...ServeRAID 8 Series Best Practices and Maintenance Information...

Page 4: ...it supports read the general information in Notices on page 45 Second Edition September 2011 Copyright IBM Corporation 2008 2011 US Government Users Restricted Rights Use duplication or disclosure res...

Page 5: ...rage Expansion Unit 3 14 Zero mode flash for ServeRAID 8k and ServeRAID 8k l System x3650 16 Zero mode flash for ServeRAID 8k and ServeRAID 8k l System x3550 18 Zero mode flash for ServeRAID 8k and Se...

Page 6: ...Index 49 iv ServeRAID 8 Series Best Practices and Maintenance Information...

Page 7: ...isplay lndocid SERV RAID brandind 5000008 or complete the following steps Note Changes are made periodically to the IBM Web site Procedures for locating firmware and documentation might vary slightly...

Page 8: ...ort supportsite wss docdisplay brandind 5000008 lndocid MIGR 5073723 Type 5073723 in the Search field at http www ibm com Synchronizing logical drives after upgrading firmware from build 8263 or earli...

Page 9: ...btain the storage management data as needed This is the only way that ServeRAID Manager 9 0 can manage a system with an earlier version of the device driver If the device driver is ever upgraded to ve...

Page 10: ...ith ServeRAID firmware build 15407 and later managing hard disk drive write cache policies on the ServeRAID 8 series controllers is improved to make this a global controller setting so that all hard d...

Page 11: ...8 series controller is treated as a replaced drive when the host server is restarted Defunct is a physical drive state in which the ServeRAID controller cannot communicate correctly with the drive Vis...

Page 12: ...sting ServeRAID configuration information metadata The ServeRAID controller tries to use the existing information on the drive to recover the array The newly installed drive might appear as part of an...

Page 13: ...other operations are completed The tools that are available from a graphical interface to modify tasks are on the IBM ServeRAID Support CD you must start to the CD and the operating system installable...

Page 14: ...physical drive from the connector using the handle of the tray 5 Wait 45 seconds to allow the hard disk drive motor to completely stop spinning 6 Remove the defunct drive from the slot and insert the...

Page 15: ...ry The symptoms that are generated by previous levels of PHY settings often have multiple origins therefore it is very important to apply all other updates and fixes before you consider the AMSU updat...

Page 16: ...13 Evaluate the current AMSU settings for the servers that are listed in Table 1 Note You can check the AMSU settings by using the following arcconf getlogs command arcconf getlogs n uart filename txt...

Page 17: ...e has failed v The ServeRAID controller is not seen during POST after a recent flash update v The flash update program returns an error code or failed message v The system loses power during a flash u...

Page 18: ...jumper connector across the two pins that are shown in Figure 1 7 If you removed the controller in step 4 reinstall it 8 Insert the ServeRAID 8i firmware disk 1 into the diskette drive 9 Turn on the s...

Page 19: ...Note Changes are made periodically to the IBM Web site Procedures for locating firmware and documentation might vary slightly from what is described in this document a Go to http www ibm com systems...

Page 20: ...part number 39R7563 FRU 40K1739 v ServeRAID 8k SAS Controller part number 25R8064 v ServeRAID 8k l SAS Controller part number 39R8729 To perform a zero mode flash update complete the following steps...

Page 21: ...cess If the zero mode jumper is installed correctly the ServeRAID 8k controller does not display a POST banner 12 As the system starts to diskette 1 the following message is displayed This program wil...

Page 22: ...1 Obtain a 2 pin jumper connector 2 Obtain a USB diskette drive 3 Download and create the ServeRAID 8k flash diskettes from the IBM ServeRAID Support CD version 9 0 containing build 15407 or later or...

Page 23: ...er 10 As the server starts to diskette 1 the following message is displayed This program will update the firmware on all IBM ServeRAID 8k controllers in the system Do you want to continue Y N 11 Type...

Page 24: ...the ServeRAID Matrix Web site go to http www 304 ibm com systems support supportsite wss docdisplay lndocid SERV RAID brandind 5000008 or complete the following steps Note Changes are made periodical...

Page 25: ...banner 10 As the server starts to diskette 1 the following message is displayed This program will update the firmware on all IBM ServeRAID 8k controllers in the system Do you want to continue Y N 11...

Page 26: ...trix Web site go to http www 304 ibm com systems support supportsite wss docdisplay lndocid SERV RAID brandind 5000008 or complete the following steps Note Changes are made periodically to the IBM Web...

Page 27: ...ight corner from the front of the server see Figure 6 Note The JP9 location is not labeled on the system board Figure 6 ServeRAID 8k and ServeRAID 8k l zero mode flash jumper JP9 System x3500 Chapter...

Page 28: ...message is displayed This program will update the firmware on all IBM ServeRAID 8k controllers in the system Do you want to continue Y N 10 Type y to respond to the prompt and then follow the instruc...

Page 29: ...ctions on the backplanes and system boards 2 Perform any required ServeRAID BIOS firmware and device driver updates 3 Perform any hard disk drive firmware updates 4 Perform a zero mode flash update if...

Page 30: ...nt and deadmap value to the metadata of the surviving member drives of the array Unfortunately if the second drive fails almost simultaneously the controller immediately stops all metadata updates and...

Page 31: ...es of the ServeRAID controllers as they recover from a defunct drive The RAID 5EE valid state transitions for single disk failures are listed in the following table Table 2 RAID 5EE valid state transi...

Page 32: ...ny model v System x3850 type 8864 any model v System x3950 E type 8874 any model v System x3950 E type 8879 any model v System x3950 type 8872 any model v System x3950 type 8878 any model v xSeries 26...

Page 33: ...information is listed in the change history of the hard disk drive firmware updates If a drive is marked Defunct by the ServeRAID controller and does not have the latest firmware the drive might not b...

Page 34: ...are found If no errors occur or all errors are corrected the drive is considered healthy Unrecoverable errors might cause data loss arcconf task start n device ID_number verify Within the operating sy...

Page 35: ...s initialization check the connections and retry If the drive does not complete initialization replace it This process irrevocably removes array and logical drive information from the disk You can ini...

Page 36: ...eference on the IBM ServeRAID Support CD Critical Migrating A logical drive in a Critical state that is undergoing a logical drive migration LDM Degraded RAID level 6 and RAID level 60 moves to a Degr...

Page 37: ...User s Reference on the IBM ServeRAID Support CD Okay The logical drive is working correctly It is in a good functional state Okay Revived The logical drive has recovered to a Okay Revived state This...

Page 38: ...om an Okay revived state configuration Redundant RAID Level RAID 5EE v When a force online operation is performed on an Offline state logical drive that has two defunct physical drives in its configur...

Page 39: ...s bootable arrays v Hot spare assignment is not allowed for arrays in the Revived state v The operating system might not boot correctly from an Offline Revived array To view the minimum number of driv...

Page 40: ...Configuration Utility and then select Create Array The Array Properties window opens 3 Select the physical drives that are members of the previous configuration 4 Select the previous RAID level 5 Ent...

Page 41: ...Chapter 1 ServeRAID 8 series best practices and maintenance information 35...

Page 42: ...n Offline array to Online Critical Revived by using the BIOS ServeRAID Manager SRM GUI or the Adaptec RAID Controller Configuration ARCCONF Force Online operation option BIOS To force an Offline array...

Page 43: ...d by using the SRM GUI complete the following steps 1 Right click an Offline logical drive 2 Select Force online 3 A warning message is displayed as shown in the following illustration Chapter 1 Serve...

Page 44: ...umber logicaldrive logical_drive_number online noprompt 2 You are prompted with the following warning message before the force online operation is performed WARNING Forcing a logical drive online is n...

Page 45: ...Adaptec BIOS Utility or in the IBM ServeRAID Manager application if you select an option that is not valid an Invalid Process error message is displayed For example in the Adaptec BIOS Utility if you...

Page 46: ...operation fails the arcconf command exits and an applicable error message is displayed Displaying an Offline array in POST After powering on or restarting the server as the Adaptec BIOS information i...

Page 47: ...logical drive or Identify logical drive Arcconf Only a delete logical drive operation can be performed on a Revived logical drive If you perform any other operation on the Revived drive other than a d...

Page 48: ...ogical drive or Identify logical drive Arcconf Only a delete logical drive or force online operation can be performed on an Offline drive If you perform any other operation on an Offline drive other t...

Page 49: ...c tests that you can perform Most systems operating systems and programs come with documentation that contains troubleshooting procedures and explanations of error messages and error codes If you susp...

Page 50: ...In the U S and Canada call 1 800 IBM SERV 1 800 426 7378 Hardware service and support You can receive hardware service through your IBM reseller or IBM Services To locate a reseller authorized by IBM...

Page 51: ...OR A PARTICULAR PURPOSE Some states do not allow disclaimer of express or implied warranties in certain transactions therefore this statement may not apply to you This information could include techni...

Page 52: ...ands for approximately 1000 bytes MB stands for approximately 1 000 000 bytes and GB stands for approximately 1 000 000 000 bytes When referring to hard disk drive capacity or communications volume MB...

Page 53: ...for a publication direct your mail to the following address Information Development IBM Corporation 205 A015 3039 E Cornwallis Road P O Box 12195 Research Triangle Park North Carolina 27709 2195 U S...

Page 54: ...48 ServeRAID 8 Series Best Practices and Maintenance Information...

Page 55: ...hard disk drive 27 maintenance updates for ServeRAID 9 Microsoft Windows upgrading ServeRAID Manager 9 0 3 mounting battery correctly ServeRAID 8k 24 N notes 1 notes important 46 notices 45 notices an...

Page 56: ...44 write cache policy physical drive 4 Z zero mode flash for a ServeRAID 8i SAS PCI X Controller 11 for a ServeRAID 8s SAS SATA PCI e Controller 13 for ServeRAID 8k and ServeRAID 8k l BladeCenter Stor...

Page 57: ......

Page 58: ...Part Number 46M1375 Printed in USA 1P P N 46M1375...

Reviews: