background image

54

Chapter 4

Support Tools Manager (STM)

Common Problems

Summary of Contents for B6191-90015a

Page 1: ...Diagnostic IPR Media User s Guide PA RISC Computer Systems B6191 90015a June 1999 Copyright 1999 Hewlett Packard Company ...

Page 2: ... FAR 52 227 19 c 1 2 Use of this manual and flexible disc s compact disc s or tape cartridge s supplied for this pack is restricted to this product only Additional copies of the programs may be made for security and back up purposes only Resale of the programs in their present form or with alterations is expressly prohibited A copy of the specific warranty terms applicable to your Hewlett Packard ...

Page 3: ... Window System is a trademark of the Massachussetts Institute of Technology MS DOS and Microsoft are U S registered trademarks of Microsoft Corporation OSF Motif is a trademark of the Open Software Foundation Inc in the U S and other countries Certification for conformance with OSF Motif user environment pending ...

Page 4: ...4 ...

Page 5: ...PR Media 30 4 Support Tools Manager STM Running STM 34 Three Interfaces 35 System Map and Device Icons 36 System Map in xstm 36 System Map in mstm 37 System Map in cstm 38 Kinds of Support Tools 39 Menus and Commands 40 xstm Menus and Commands 40 mstm Menus and Commands 41 cstm Menus and Commands 45 Getting Result Information Logs 46 Remote Execution 47 Distributed Structure 48 Improving Performan...

Page 6: ...ocess Works 58 What HP Predictive Support Covers 58 LIF LOAD HP UX 9000 Series 800 and 700 Systems 60 Installing Online Support Tools from the Diagnostic IPR Media CD ROM 61 A Disk Copy Utility To make an image of a disk after install or upgrade HP UX 10 x to 11 x Operating Environment 72 Quick Start Instructions 73 Executing COPYUTIL 73 B EMS Hardware Monitors Enabling Hardware Monitoring 78 ...

Page 7: ...ncorporated at reprint do not cause the date to change The part number changes when extensive technical changes are incorporated New editions of this manual will incorporate all material updated since the previous edition Internal Date May 28 1999 HP Printing Division Systems Supportability Lab Hewlett Packard Co 19091 Pruneridge Ave Cupertino CA 95014Printing History June 1999 Edition 1 ...

Page 8: ...8 ...

Page 9: ...nctions The following is a summary of the contents of the chapters in this manual Chapter 1 Diagnostic IPR Media product overview Chapter 2 Hardware support tools overview NOTE Chapter 2 Hardware Support Tools Overview provides a simplified approach to starting the hardware problem solving process using the tools provided on the Diagnostic IPR Media This is not intended as a comprehensive troubles...

Page 10: ...wing information in your message Title of the manual you are referencing Manual part number from the title page Edition number or publication date from the title page Your name Your company s name SERIOUS ERRORS such as technical inaccuracies that may render a program or a hardware device inoperative should be reported to your HP Response Center or directly to a Support Engineer Current Informatio...

Page 11: ...also the update format distribution media for the 9 04 and 9 07 HP UX systems Online diagnostics subsystem Support Tools Manager HP UX 10 x and 11 x EMS Hardware Monitors HP UX 10 20 and 11 x only HP Predictive Support tools Series 800 only LIF resident offline diagnostics IPR patches It is the platform for running offline diagnostics for all PA RISC systems The Diagnostic IPR Media is primarily i...

Page 12: ... of the online tools to be loaded onto your system including the following Online diagnostics subsystem Support Tools Manager HP UX 10 x and 11 x EMS Hardware Monitors HP UX 10 20 and 11 x only HP Predictive Support tools Series 800 only LIF resident offline diagnostics IPR patches In general terms the Diagnostic IPR Media is organized as follows details of organization for your particular media t...

Page 13: ...elp facility to assist users in getting started determining what tools are available and how to run them etc For the HP UX 10 01 10 10 10 20 10 30 and 11 x releases the Support Tools Manager STM diagnostic systems is available The STM diagnostic system is the diagnostic system used for information verification and diagnosis The STM system provides a map of the system and lets you know what tools a...

Page 14: ...ications use the device No license is required to run the verifiers The STM exercisers are designed to stress devices in order to facilitate the reproduction of intermittent problems The exercisers on HP UX 9 X systems require a license to run The STM information modules create a log of information specific to one device including The product identifier A description of the device The hardware pat...

Page 15: ...ffline Diagnostics Environment is an offline support tools platform that is run from ISL and is available on series 700 800 and 900 machines ODE provides a common user friendly interface for diagnostics and utilities developed to run in this environment Diagnostics and utilities provided under ODE include MAPPER a utility for mapping out the physical layout of the SPU and its peripherals IOTEST a ...

Page 16: ...on may be very valuable later on in determining what action to take to isolate the problem cause The following key decisions drive the troubleshooting strategy as outlined in the flow diagram Will the OS boot This step determines whether or not the online versus offline support tools can be used If the OS cannot boot the offline tools are the only option If the OS can boot the recommendation is to...

Page 17: ...ot If the machine will not boot to ISL from the main disk or even if it will boot ISL but still won t boot the OS the user has little choice but to either start swapping suspect hardware using the error codes displayed on the front panel LEDs and console error messages for guidance or attempt to boot from the Diagnostic IPR Media This decision is likely to be ...

Page 18: ...e tools should be used If the system can be booted to the OS the user has several online tools available to troubleshoot problems The following are the strategic uses for each set of tools 1 STM verifiers are useful primarily for finding reproducible problems that are causing a particular device to fail They will run a quick verification on selected devices and indicate whether they are basically ...

Page 19: ...nd a menu oriented interface for less experienced users In the command line interface users can select and run specific tests and or utilities In the menu oriented interface users select specific hardware modules to test and do not have to know which diagnostic is associated with a particular module NOTE ODE utilities like MAPPER and FUPDATE formerly UPDATE can only be run from the command line in...

Page 20: ...computer systems For 64 bit systems like N Class there will be a different version of the offline diagnostic programs The 64 bit version will have a 2 appended to its name For example the 64 bit version of MAPPER is MAPPER2 The ODE module TMMGR TM Manager will only be updated to support new 32 bit systems such as the J5000 J7000 C3000 and B1000 TMMGR will NOT be updated to support new 64 bit syste...

Page 21: ...UN Run a module after setting desired environment variables Control Y Control C Abort an ODE command pause a module run RESUME Restart a paused module DISPLOG After running a module display contents of a log EXIT Return to next higher level prompt Environmental Variables SHOWSTATE Display the value of the following environment variables LOOP Run a test this many times ERRPRINT ON OFF Print low lev...

Page 22: ... non interactively 8 If you wish to run interactively type the following at the ODE prompt ODE module_name This command loads the module from LIF into memory and initializes it displaying the module_name prompt MODULE_NAME To run interactively type help for a list of commands which are valid for use with selected module MODULE_NAME help 9 To run non interactively type the following at the ODE prom...

Page 23: ...ns Following is a brief summary of each screen System Screen The system screen is the first screen displayed to the user It provides a map of all hardware modules in the system and allows the user to select and test sets of modules Test Screen A test screen displays a list of tests for a particular module It allows the user to select and run a set of tests for a specific module Logging Screen The ...

Page 24: ...ted 0 Command The system screen shows a map of the computer system displaying a short description of each HP architected module its architected path and its status The status field gives information about the test state of the hardware module The status field may have the following values N A The TM for this module is not available Either the TM does not exist on the LIF volume or the TM is unable...

Page 25: ...ompleted 0 Command The test screen displays all the tests for a selected module and indicates which ones are selected for testing By default not all tests may be tagged For example tests that require special loopback connectors may not be selected by default Therefore if one wants to ensure that a specific set of tests is run for a particular module one should enter the test screen for that module...

Page 26: ...nsole text rows 24 SCROLL Activates scrolling during screen redraws ON TERM Terminal type UNKNOWN LOOP Number of times to loop test execution 1 Page 1 of 1 Command The environment screen displays each environment variable an associated description and its current state Use the HELP command to get more online information about each environment variable Environment Variables This section gives a bri...

Page 27: ...les will be executed If test execution is launched from a test screen the LOOP state indicates how many times the tagged tests will be executed TERM TERM HP UNKNOWN The TERM environment variable indicates the current type of the console Currently only HP serial terminals are fully supported by TMMGR If one is using a non HP terminal or a graphics display TERM should be set to UNKNOWN ERRCOUNT ERRC...

Page 28: ...modify all environment variables SELECT SELECT num is an integer indicating any valid selection number range is a range of selection numbers of the form A B A B Use this command to select a group of modules to test or a group of tests to execute A group of modules or tests is specified by explicitly specifying a list of selection numbers and or a range of selection numbers All tests or testable mo...

Page 29: ...dules HELP HELP name name is the name of a valid command for the current page in the current screen Use this command to display help information for the various commands valid for the current page in the current screen Executing this command with no parameter will display a short summary of each valid command INFO INFO num NOTE num is a valid module or test selection number Use this command to dis...

Page 30: ...s the machine you are loading PDC update script onto the default is the machine on which you are running the Diagnostic IPR Media 4 Enter the appropriate information in the Depot field this is the media that is the tape or CD ROM from which you are loading the PDC update script 5 Move the cursor to Change Software View and select it 6 Select Products 7 Move the highlight bar to PROC_FIRMWARE 8 Mov...

Page 31: ... ISL or not If the system prompts you for such information you should type y Next you should see the ISL prompt 21 At the ISL prompt you should type ODE and hit Return 22 At the ODE prompt you should type README The README utility will display an index indicating which image files on the LIF are for which systems along with the versions associated with those image files 23 If you would like to see...

Page 32: ...32 Chapter3 Using the Diagnostic IPR Media to Run Offline Diagnostics Updating Processor Firmware Using the Diagnostic IPR Media ...

Page 33: ...as a verifier exerciser or expert tool on the selected device s Results are displayed on the system map NOTE For the most current information on Support Tools see our Web site Systems Hardware Diagnostics and Monitoring at http www docs hp com hpux systems This Web site also contains additional documents such as tutorials quick reference guides and release information This chapter introduces STM a...

Page 34: ...han the version running on the system you are connecting from 3 Select one or more devices from the system map that is displayed 4 Choose a support tool for example a verifier to run on the selected device s 5 Results appear on the system map for example on xstm a green icon indicates that a device successfully passed the test 6 If the device fails see the device Failure Log for the cause of the f...

Page 35: ...es and resources If possible use xstm xstm X Window graphical X Window graphics terminals or workstations Command usr sbin xstm or usr sbin stm ui bin stm x mstm menu based Non graphics terminals Command usr sbin mstm or usr sbin stm ui bin stm m cstm command line For running scripts Command usr sbin cstm or usr sbin stm ui bin stm c Displays from the three interfaces are shown in the rest of this...

Page 36: ...ct the devices to test After a test runs the system map displays the results For example a device icon in xstm is green for Successful and red for Failure System Map in xstm In xstm the system map is composed of device icons see Figure 4 1 You left click on a device icon to select that device and unselect all others Do a Control left click on a device icon to toggle the select of that device and l...

Page 37: ...erify Query Pending 8 16 5 2 0 SCSI Disk Information Successful 8 16 6 LAN Interface Exercise Abort Pending 8 16 7 Built in Keyboard Mouse Information Successful 8 20 Core I O Adapter Information Successful 8 20 2 RS 232 Interface Information Successful 8 20 5 EISA Adapter Information Successful 10 Bus Adapter Information Successful 32 CPU Exercise Successful 34 CPU Exercise Successful 49 MEMORY E...

Page 38: ...t Active Tool Status 1 8 Bus Adapter Information Successful 2 8 0 Bus Adapter Information Successful 3 8 0 0 NIO Terminal Multiplexor Information Successful 4 8 4 Fast Wide SCSI Interface Information Successful 5 8 4 3 0 SCSI Disk Exercise Aborted 6 8 4 4 0 SCSI Disk Verify Successful You select devices by using the select command with a device number or path modifier A minus sign in front of a pa...

Page 39: ...the device or subsystem This function is useful in providing very high confidence verification and in detecting intermittent errors Firmware Update Initiates the firmware update process for a selected device While the user interface to the firmware update tools is generic the tools themselves are device specific Expert Tools Are device specific troubleshooting utilities for use by sophisticated us...

Page 40: ...tm and cstm interfaces all have different ways to display menus and accept commands from the user The following subsections detail the xstm mstm and cstm menus and commands xstm Menus and Commands In xstm commands are accessed by means of pull down menus Figure 4 4 xstm Menus and Commands ...

Page 41: ... Menus and Commands mstm Menus and Commands In mstm you traverse screens and menus and select commands from pulldown menus which are similar to those found in xstm Figure 4 5 mstm Menu Bar and Softkeys Figure 4 6 mstm Pulldown Menu Example ...

Page 42: ... Bar The following table summarizes the use of the menu bar Table 4 1 Menu Bar Navigation To do this Do this Position the cursor on the menu bar Use the Tab key or the Menu Bar on off function key Move to a pulldown menu Use the cursor arrow keys Expand a menu sub menu Use the Return key Highlight a command or sub menu Use the cursor keys Perform a command Use the Return key Invoke a menu directly...

Page 43: ...Chapter 4 43 Support Tools Manager STM Menus and Commands Figure 4 7 mstm Menus and Commands ...

Page 44: ... Device Tools Utilities a all devices v verify l logtool d disks e exercise m memory i information p processors s current device status t tapes Other Shortcut Keys this help page cr execute shorcut key commands and exit from this help page back space delete the last shortcut key entered and undo its selection Notes The first device selection will unselect all currently selected devices Subsequent ...

Page 45: ...ssd sysactlog sal resetsysactlog rsa daemonstartup dsu localmaplog lml daemonshutdown dsd localsysactlog lsal daemonkill dk syslog sl daemonactlog dacl os os map map exit ex Tools Tools Continued Options information info fwupdatefaillog ffl infooptions iop infolog il fwupdateinfo finf veroptions vop infoactlog ial experttool xt diagoptions dop infofaillog ifl expactlog xal exeroptions eop infoinfo...

Page 46: ... a message identifying likely causes for the failure and suggesting possible actions Figure 4 8 shows a sample Failure Log in xstm Figure 4 8 Sample Failure Log in xstm If a test result is anything other than Successful or Failure look at the Test Activity Log for the device For example if a test results in a Incomplete status the Test Activity Log will explain whether the problems is due to mallo...

Page 47: ...e machines See Figure 4 9 for a display of possible connections The computer running the user interface is the UI system and the computer running the support tools is the Unit Under Test or UUT As always you can run the user interface and support tools on the same machine In this case the UI machine and the UUT are one and the same Figure 4 9 Possible UI and UUT Connections ...

Page 48: ...erlies them The UI also contains the text to be displayed message catalogs and help volumes The Unit Under Test UUT contains the binaries for the support tools Diagnose FW Update Exercise Expert etc and the libraries which support them Improving Performance This distributed design makes for good performance This is because data and code reside on the machine that makes use of them The UI system ha...

Page 49: ... Use the Help menu at the far right of the menu bar mstm Press the Help function key xstm Enter the command help Kinds of Help Help for the main STM user interface covers the following On Item Context sensitive information on parts of the interface xstm only On Tasks Cookbook procedures for performing common system tasks using STM On Application General information about the Support Tool Manager O...

Page 50: ...50 Chapter4 Support Tools Manager STM Getting Help Figure 4 11 Sample Online Help in xstm for FW Update Tool ...

Page 51: ...s stopped executing Only rarely would the tool stop executing Typically this behavior would be caused by a kernel driver that has stopped responding or something of that nature The tool may eventually start executing again if it becomes unblocked at which time its state will be changed back to Running You may choose to abort the tool at this point If the tool is blocked due to a kernel resource su...

Page 52: ...g mstm file admin local uut logs sys act log xstm File Administration Local UUT Logs System Activity Logs A device in the STM map is Unknown or its icon is blank This problem typically occurs if the driver that is associated with the device is not recognized by STM Please report all unknown devices through STARS Include the following information The contents of the Scan HW Log The output of the De...

Page 53: ...lable in subsequent releases Some tools may not be available because they require a license Currently the only tools that require a license are expert tools To see a list of the tools installed for a device and the license level if any required to run the tool use the Device Current Device Status command The following is a display for a SCSI disk device Installed tools Diagnostic None Verifier dis...

Page 54: ...54 Chapter4 Support Tools Manager STM Common Problems ...

Page 55: ...10 20 10 30 and 11 x HP UX systems it is also the update format distribution media for the 9 04 and 9 07 HP UX systems Online diagnostics subsystem STM Support Tools Manager HP UX 10 x and 11 x EMS Hardware Monitors HP UX 10 20 and 11 x only HP Predictive Support tools Series 800 only LIF resident offline diagnostics IPR patches This chapter summarizes the hardware support products listed above th...

Page 56: ...pport Tools Manager STM on HP UX 10 x and 11 x systems provides a simple interface to online diagnostics and support tools For information on STM see Chapter 4 Support Tools Manager STM in this manual The Web page for HP 9000 3000 Systems Hardware Diagnostics and Monitoring at http www docs hp com hpux systems see the topics under Diagnostics The manpages xstm 1M mstm 1M and cstm 1M ...

Page 57: ...20 or 11 x IPR 9902 or later 9902 refers to the February 1999 release Hardware event monitoring provides a high level of protection against system hardware failure By using hardware event monitoring you can virtually eliminate undetected hardware failures that could interrupt system operation or cause data loss The EMS Hardware Monitors are installed with the Support Tools Manager Once the monitor...

Page 58: ...ort sends any information on potential problems to the HP Response Center 2 HP Response Center analyzes the problem An HP Response Center Engineer RCE analyzes the data and adds it to the system s history If further action is needed the RCE may ask the system administrator to perform remote diagnostics or may forward the information to the HP Account CE 3 HP Account CE may provide backup support T...

Page 59: ...Support The inventory of software and patches installed on your systems is tracked by the PROACTIVE patching utility The HP Response Center Engineers use this information to recommend patches for customers with high end support agreements such as Personalized System Support or Business Continuity ...

Page 60: ...pported IODC tests on modules PERFVER PERFVER is an ODE based test module that runs supported IODC tests on devices The LIF LOAD product is structured as follows SD product LIF LOAD for Series 800 and 700 systems After the LIF LOAD product is installed the ODE tools will be available at the ISL level at the next reboot of the computer system For further information about ODE refer to Chapter 3 Usi...

Page 61: ...d to move the new kernel into place and reboot Move the new kernel as indicated and reboot your system The dart32 CD ROM should now be accessible You may now execute the procedure immediately following the next note NOTE For further information on installing Predictive Support tools using the terminal version of swinstall see the HP Predictive Support UX User s Guide part number H2571 90008 NOTE A...

Page 62: ...OM has been mounted by entering the mount command and look for diagtemp The following is a sample output diagtemp on dev dsk c0t3d0 ro on Thu Mar 25 15 46 16 1999 6 Determine which patches must be installed first before you install the diagnostics Without these patches some versions of the Support Tools cannot be installed or will not operate correctly See the section Required Patches in the DIAGN...

Page 63: ...y you can obtain the patches through the HP Electronic Support Centers For Americas and Asia Pacific http us support external hp com For Europe http europe support external hp com Log onto the appropriate URL and follow the directions given One problem with loading individual patches from these patch machines is that a system reboot is required for every patch that requires a reboot for example pa...

Page 64: ...64 Chapter5 Using the Diagnostic IPR Media to Install Diagnostics on Your System Installing Online Support Tools from the Diagnostic IPR Media CD ROM Figure 5 1 Specify Source Window ...

Page 65: ...agnostic IPR Media to Install Diagnostics on Your System Installing Online Support Tools from the Diagnostic IPR Media CD ROM 10 Click on the OK button The Software Selection Window appears Figure 5 2 Software Selection Window ...

Page 66: ...icate that the highlighted product is to be installed Repeat the selection process until all the desired products have been marked for installation It is not necessary to select EMS Config and EMS Core products since these are automatically installed when you install Support Tools Manager for HP UX 10 20 or 11 x IPR 9902 or later Most users select the offline diagnostics LIF LOAD and the Support T...

Page 67: ...tem Installing Online Support Tools from the Diagnostic IPR Media CD ROM Figure 5 4 Marking Products to Be Installed 12 To install the selected products use the Actions menu option Install analysis to begin the analysis phase of the installation Figure 5 5 Install Analysis Window ...

Page 68: ...rs Figure 5 6 First Confirmation Window 14 Click the Yes button to continue with the installation A second Confirmation Window appears for 10 20 systems This window does not appear for 11 x systems since a reboot is not required Figure 5 7 Second Confirmation Window 15 Click the Yes button to continue with the installation 16 Once installation is complete unmount the CD ROM with the umount command...

Page 69: ...should enable the EMS hardware monitors the monitors are automatically enabled on June 1999 and later releases Follow this procedure to enable the monitors a Run the monitoring request manager by typing etc opt resmon lbin monconfig b From the main menu selection prompt enter E nable Monitoring c See the EMS Hardware Monitors User s Guide and other monitor documentation for information about confi...

Page 70: ...70 Chapter5 Using the Diagnostic IPR Media to Install Diagnostics on Your System Installing Online Support Tools from the Diagnostic IPR Media CD ROM ...

Page 71: ...can be used to make a system disk image so that the system can be dropped back to 10 x if for some reason 11 x is not desired The utility uses two commands BACKUP and RESTORE to perform the copying of information between devices Both commands in effect perform the same function copying However the BACKUP command copies data from disk to tape and can tolerate read errors but will halt on a write er...

Page 72: ...veral higher level commands like LOG HELP and so forth COPYUTIL is a TM running under ODE This means COPYUTIL must first satisfy all of ODE s requirements Since COPYUTIL is not a diagnostic all the hardware except the disk drive must be working properly and the disk drive can only have software read errors that will not cause the system to halt NOTE An online version of COPYUTIL running under STM ...

Page 73: ...l not be liable for any damages resulting from the use of this program Version A 00 22 Please wait while I scan the device busses Ty Indx Path Product ID Bus Size Rev D 0 2 0 1 0 0 QUANTUMLPS270S disc drive SCSI 258 MB 5909 D 1 2 0 1 1 0 SEAGATEST3600N disc drive SCSI 499 MB 9686 D 2 2 0 1 3 0 HPC2244 disc drive SCSI 1 0 GB 0B04 D 3 2 0 1 4 0 QUANTUMLPS525S A2565A disc drive SCSI 499 MB 3100 T 4 2...

Page 74: ...eted 40 completed 50 completed 60 completed 70 completed 80 completed 90 completed 100 completed COPYUTIL You can now exit the program switch off the power replace the bad disk and start the program again COPYUTIL exit Replace the SUPPORT MEDIA now if you removed it earlier ODE Assume you have restarted the program and it looks just like the last time You now want to restore the data back onto the...

Page 75: ...Restored Successful COPYUTIL You have now restored the data onto your new disk drive You may wish to verify that the data on the tape and the disk are the same You now have restored data and can exit the program and power cycle the machine Remember that if the BACKUP command could not read a block the RESTORED block is all nulls ...

Page 76: ...76 AppendixA Disk Copy Utility To make an image of a disk after install or upgrade HP UX 10 x to 11 x Quick Start Instructions ...

Page 77: ...dware failure By using hardware event monitoring you can virtually eliminate undetected hardware failures that could interrupt system operation or cause data loss NOTE Complete Information For complete information on installing and using EMS hardware event monitors as well as a list of supported hardware refer to the EMS Hardware Monitors Users Guide An electronic copy of this book is included on ...

Page 78: ...sent to SYSLOG Serious and Critical events sent to EMAIL address root The Hardware Monitoring Request Manager etc opt resmon lbin monconfig can be used to customize the monitoring requests and add new ones At the time of publication monitors are provided to support the following HP Disk Arrays Fibre Channel Interconnect Fibre Channel Interface Cards High Availability Storage System Enclosures SCSI...

Page 79: ...Appendix B 79 EMS Hardware Monitors Enabling Hardware Monitoring ...

Page 80: ...80 AppendixB EMS Hardware Monitors Enabling Hardware Monitoring ...

Page 81: ...ing 60 menu command user interface 23 running the ODE command line interface 21 Test Module Manager TMMGR 23 online tools 33 getting result information 46 installing 61 logs 46 tool types 18 39 Operating System OS will it boot 16 P PERFVER 60 Predictive Support 11 58 processor firmware updating 30 S support tools installing 55 overview 13 usage 16 17 Support Tools Manager STM 14 33 56 blank icon 5...

Reviews: