background image

38

Chapter 3

Teststation

Teststation file system

Teststation file system

The /spp directory located on the workstation’s local disk (HP-UX version
10.20) contains all the necessary files necessary for the workstation to
function as the V-Class server’s teststation.

 Figure 10

Teststation file system

/spp/etc

The /spp/etc directory contains many of the unique daemons which run
on the teststation. These daemons provide services in the management of
the V-Class node. Two daemons that are always running on the
teststation are:

cmd

The complex configuration management daemon that
builds the configuration file /spp/data/node_0.cfg.

conserver

The console-server that directs RS-232 console traffic
from the Utility Board to the upper right console
window on the teststation.

/spp

etc

(spp specific daemons)

bin

(compiled executables)

scripts

(executable scripts)

data

(log and config files)

firmware

(firmware directory)

est

(support for scan testing)

man

(Man pages for /spp commands)

unsupported (unsupported utilities)

Summary of Contents for 9000 V-Class

Page 1: ...Operator s Guide HP 9000 V Class Server First Edition A3725 96005 Customer Order Number A3725 90005 October 1997 Printed in USA ...

Page 2: ...to this material including but not limited to the implied warranties of merchantability and fitness for a particular purpose Hewlett Packard shall not be liable for errors contained herein or for incidental or consequential damages in connection with the furnishing performance or use of this material FCC Notice This equipment has been tested and found to comply with the limits for a Class A digita...

Page 3: ... the user will be required to correct the interference at his own expense The user is cautioned that changes or modifications not expressly approved by the party responsible for compliance Hewlett Packard could result in the equipment being noncompliant with FCC Class A requirements and void the user s authority to operate the equipment Do not connect external equipment to the utility outlets in t...

Page 4: ......

Page 5: ...logic bus 8 Node configurations 9 Shared memory 10 2 Indicators switches and displays 13 Operator panel 14 Key switch panel 15 Key switch 15 DC ON LED 15 TOC 16 CDROM drive 17 Busy indicator 17 Eject button 17 Emergency eject 18 Optional DAT drive 19 LEDs 19 Eject button 20 System Displays 21 LCD liquid crystal display 22 Attention light bar 24 3 Teststation 25 Teststation features 26 Teststation ...

Page 6: ...nsole s connection 37 Accessing system logs 37 Teststation file system 38 spp etc 38 spp bin 39 spp scripts 39 spp data 39 spp firmware 40 spp est 40 spp man 40 4 Firmware OBP and PDC 41 Boot sequence 42 Boot process output 44 HP mode boot menu 46 Enabling Autoboot 49 Syntax 49 Examples 50 HElp command 51 Syntax 51 Examples 51 5 Starting and stopping HP UX 53 Terms 54 Starting your HP 9000 V Class...

Page 7: ...tion 68 Performance problems 69 System hangs 70 System panics 72 Peripheral problem 73 Interface card and system problem 74 File system problem 75 LAN communication problem 75 Logical Volume Manager LVM related problem 76 Recovery from other situations 76 Rebooting the system 77 Monitoring the system after a system panic 77 Abnormal system shutdowns 79 Fast dump 79 Overview of the dump and save cy...

Page 8: ...e kernel 89 Using HP UX commands to configure dump devices into the kernel90 Runtime dump device definitions 91 The etc fstab file 92 The crashconf command 92 Dump order 93 What happens when the system crashes 93 Operator override options 94 The dump 94 The reboot 95 savecrash processing 95 Dual mode devices dump swap 95 What to do after the system has rebooted 95 Using crashutil to complete the s...

Page 9: ...able 1 Node configurations 9 Table 2 Indicator LED operation 19 Table 3 Processor status codes 23 Table 4 POST status codes 23 Table 5 HP UX processor status codes 24 Table 6 sppconsole commands 34 Table 7 Boot menu commands 47 ...

Page 10: ...x List of Tables ...

Page 11: ... interconnection 7 Figure 3 Operator panel 14 Figure 4 Key switch panel 15 Figure 5 CDROM drive 17 Figure 6 DDS 3 DAT drive front panel 19 Figure 7 System displays 21 Figure 8 Front panel LCD 22 Figure 9 Teststation sppuser windows 28 Figure 10 Teststation file system 38 Figure 11 Boot process 43 ...

Page 12: ...xii List of Figures ...

Page 13: ...lls and data structures and types In command examples monospace identifies command output including error messages italic In paragraph text italic identifies titles of documents In command syntax diagrams italic identifies variables that you must provide The following command example uses brackets to indicate that the variable output_file is optional command input_file output_file Brackets In comm...

Page 14: ...y to avoid damage to equipment damage to software loss of data or invalid test results Horizontal ellipses In command examples horizontal ellipses show repetition of the preceding items Vertical ellipses Vertical ellipses show that lines of code have been left out of an example Keycap Keycap indicates the keyboard keys you must press to execute the command example ...

Page 15: ...o 16 HP PA 8200 Precision Architecture Reduced Instruction Set Computer PA RISC processors and can support memory configurations of 256 Mbytes to 16 Gbytes This chapter provides a block diagram of the V Class server and discusses its features and configurations ...

Page 16: ...e refer to that document for detailed information about the features of the PA 8200 This Operator s Guide does not attempt to duplicate information in that manual Instead it presents only V Class server specific information The processors of the system are supported by several Application Specific Integrated Circuits ASIC hardware controllers an enhanced memory system and a high bandwidth I O subs...

Page 17: ...Agent Controllers EPACs An EPAC allows the processor and the I O subsystem the Exemplar PCI bus Interface controller EPIC access to the Hyperplane crossbar Also connected to the Hyperplane crossbar are the Exemplar Memory Access controllers EMAC Up to two processors are located on each EPAC Memory is controlled by the EMAC Input and output devices connect to the node through EPIC which is connecte...

Page 18: ...y I O Hyperplane crossbar ERAC Proc I O I O I O I O I O EPAC agent Proc EPAC Proc agent EPAC agent Proc EPAC Proc agent EPAC Proc agent agent EPAC Proc Proc Proc EPAC Proc agent EMAC Memory control access Memory Memory access control EMAC Memory EMAC control access Memory Memory EMAC Memory access control Memory Memory control access EMAC Memory EMAC control access Memory Memory EMAC Memory access...

Page 19: ... an I O only interface The EPAC has the following buses Runway bus 0 1 Two each 64 bit bidirectional buses for processor 0 and processor 1 respectively These buses have a raw bandwidth of 960 Mbytes sec Hyperplane crossbar port bus 0 1 Four 32 bit unidirectional buses connected to two Hyperplane crossbar ERACs two in each direction These buses have a total raw bandwidth of 1 9 Gbytes sec I O port ...

Page 20: ...four ERACs that provide an interconnect for each processor and I O device to memory Each of the four ERACs has the following buses EPAC Port A B C D Eight 32 bit unidirectional interfaces to four EPAC ports four in each direction Each port has simultaneous input and output bandwidth of 960 Mbytes sec EMAC Port A B C D Eight 32 bit unidirectional interfaces to four EMAC ports four in each direction...

Page 21: ...dom Access Memory SDRAM The EMAC has the following buses ERAC Port A B Four unidirectional 32 bit interfaces two in each direction This interface supports a total simultaneous read write bandwidth of 1 9 Gbytes sec Even Memory Bidirectional 88 bit interface to the even memory banks associated with the EMAC Odd Memory Bidirectional 88 bit interface to the odd memory banks associated with the EMAC E...

Page 22: ...AC which forwards the response to the EPAC Utilities board and core logic bus The ECUB or Utilities board connects to the core logic bus and contains two field programmable gate arrays FPGAs the EPUC and EMUC The EPUC allows processors access to the system core logic and booting firmware and the EMUC processes the environmental state of the node and interrupts the processors when appropriate V Cla...

Page 23: ...sor agents Memory boards Total memory EMB I O chassis 4 2 2 256 1 4 2 2 512 1 4 2 2 1024 1 4 2 4 2048 1 8 4 4 512 1 or 2 8 4 4 1024 1 or 2 8 4 4 2048 1 or 2 8 4 8 4096 1 or 2 12 6 8 1024 1 2 or 3 12 6 8 2048 1 2 or 3 12 6 8 3072 1 2 or 3 12 6 8 4096 1 2 or 3 16 8 8 1024 1 2 3 or 4 16 8 8 2048 1 2 3 or 4 16 8 8 3072 1 2 3 or 4 16 8 8 4096 1 2 3 or 4 ...

Page 24: ...l memory references linearly ascending physical address to shared memory are interleaved across up to eight memory boards on a 32 byte basis See V Class Architecture for information regarding memory interleaving With all processor references to memory copies of the accessed data are encached into either the instruction or data caches of each processor If the processor making the memory reference m...

Page 25: ... write data store into a cache line the cache line must be owned exclusively by the processor This implies that any other copies must be invalidated Modified cache lines must be written back to memory from the cache before being overwritten ...

Page 26: ...12 Chapter1 Overview Shared memory ...

Page 27: ...Chapter 2 13 2 Indicators switches and displays This section describes indicators switches and displays of the HP 9000 V Class server ...

Page 28: ...of the server and contains the key switch panel CDROM drive optional DAT tape drive and the LCD display Figure 3 shows the location of the operator panel and its components Figure 3 Operator panel 9 24 97 IOEXS094 TOC DC ON DC OFF CON SOL E ENA BLE CON SLO LE SEC URE CDROM drive Key switch panel LCD display Optional DAT drive ...

Page 29: ...y switch The key switch has four positions Placing the key switch in any position other than DC OFF powers up the V Class server DC OFF DC power is not applied to the system Placing the keyswitch in this position is the normal method for turning off power to the system ON DC power is applied to the system POST Power On Self Test begins executing and brings up the system from an indeterminate state...

Page 30: ...16 Chapter2 Indicators switches and displays Operator panel TOC The TOC Transfer Of Control button is a recessed switch that resets the system ...

Page 31: ...indicator The busy indicator LED flashes to indicate that a read operation is occurring CAUTION Do not push the eject button while this LED is flashing If you do the operation in progress is aborted and the CDROM is ejected possibly causing a loss of data Eject button Push the eject button to open the CD tray to remove CDROMs from the drive 9 18 97 IOEXS096 Headphone jack Volume control Busy indic...

Page 32: ...18 Chapter2 Indicators switches and displays Operator panel Emergency eject Insert a paper clip into the emergency eject hole to open a jammed CD tray ...

Page 33: ...3 DAT drive front panel LEDs The two LEDs provide operating information for normal as well as error conditions Table 2 shows the meaning of the different LED patterns Table 2 Indicator LED operation Activity LED Attention LED Eject button Tape Clean Tape Activity LED green Clean Attention LED amber Meaning Flashing slowly Off A load or unload of a cartridge is in progress Flashing rapidly Off A ca...

Page 34: ...tape 3 The tape is then rewound to Beginning of Media BOM unthreaded from the mechanism and ejected CAUTION Do not push the eject button while the LED is flashing If you do the operation in progress is aborted and the cartridge is ejected possibly causing a loss of data Any Flashing slowly Media caution signal Indicates that a cartridge is near the end of its life or that the heads need cleaning A...

Page 35: ...ays System Displays The V Class servers provide two means of displaying status and error reporting an LCD and an Attention light bar Figure 7 System displays 9 18 97 IOLM010 TOC DC ON DC OFF CON SOL E ENA BLE CON SLO LE SEC URE LCD display Attention light bar ...

Page 36: ...utput to the LCD POST is described in the HP Diagnostics Guide S Class X Class and V Class Servers The following explains the output shown in Figure 8 The first row displays the serial number of the system The second row displays processor status The third row displays the revision of POST and the POST status codes The forth row displays which processor is in control While HP UX is running the LCD...

Page 37: ...nel I Idle F FORTH OBP S Server U User E Emulator M Monarch D Installed but not responding during POST d User disabled Processor down Code Meaning a Monarch CPU is selected b POST code checksum c Test controller code checksum e Configuration parameter checksum Rebuilding configuration parameter f Starting core logic hardware initialization g EPUC EMUC CSR initialization h Core logic SRAM pattern t...

Page 38: ...he front of the HP 9000 V Class server as shown in Figure 7 on page 21 This light bar displays system status in three ways Off system powered down Steady on system powered up Flashing error condition j Core logic SRAM address uniqueness test k Zero filling core logic SRAM l Starting C runtime environment Code Meaning Code Meaning F01F The processor is least busy F05F The processor is more busy F0A...

Page 39: ...d workstation that serves as a teststation for the V Class server An ethernet bus called the test bus connects the teststation to the utilities board located in the V Class server chassis A teststation is required to enable service personnel to verify and troubleshoot the V Class system ...

Page 40: ... workstation as a console and the teststation software Product number B6044AA This option provides the special hardware and teststation software to connect an existing workstation as a console There are two ethernet LAN cards installed within the teststation one is on the core I O card and the other is a plug in LAN card Additional software which provides all the necessary functions of the teststa...

Page 41: ...nmental and system level events Configuration of hardware and boot parameters Booting the Operating System The teststation is connected directly to the V Class server via a dedicated LAN connection as well as a special RS 232 serial connection Together the LAN and the serial connection provide control status and HPMC error information to the teststation where operators and CEs may view status ...

Page 42: ...mer the new passwords must be supplied to the Hewlett Packard Customer Engineer CE upon request Teststation sppuser windows When the sppuser is logged on to the teststation the windows appear in the configuration shown in Figure 9 Figure 9 Teststation sppuser windows Message window Console window sppuser tsh shell sppuser tsh shell Commands and scripts executing on the teststation ccmd daemon stat...

Page 43: ...for the V Class server All POST Power On Self Test status is displayed here The user can type commands while in HP mode boot menu to boot and configure the node See Chapter 4 Firmware OBP and PDC for more information about HP mode boot menu HP service personnel can also enter a special mode called forth mode OBP to perform special configuration commands tcsh shell windows tcsh The tsh windows LOWE...

Page 44: ...d Step 3 Release the mouse button to select the option The Root Menu options include sspdsh window Creates a spp diagnostic shell window on your screen This should be used by service personnel only ksh window Creates a new ksh window on your screen test station console Creates a new teststation window on your screen sppconsole menu Creates a new sppconsole window by selecting node 0 complex from t...

Page 45: ...tstation Teststation logons shuffle down Moves the top window in a stack of windows to the bottom refresh all Refreshes your entire X display restart Stops and restarts mwm logout Closes all open windows and stops mwm ...

Page 46: ...dies or stops running restart it from the teststation using one of the following methods Using the Root Menu To restart the console using the Root Menu complete the following steps Step 1 Press and hold down any mouse button over the root window The root window is any part of your display that is not covered by a window The Root Menu appears Step 2 Select sppconsole menu The menu displays Step 3 S...

Page 47: ...owing steps Step 1 Press and hold down any mouse button over the root window The root window is any part of your display that is not covered by a window The Root Menu appears Step 2 Select logout The teststation closes all open windows stops mwm and returns a HP UX login prompt Step 3 Log into the teststation as sppuser The new sppconsole window displays ...

Page 48: ...ssed simultaneously The e does not have to be an uppercase E Example Performing a E command To execute the Ecf command complete the following steps 1 Press the Cntrl key and the e key simultaneously 2 Release the Cntrl key and the e key 3 Press the c key 4 Press the f key Command Description Ecw Display a list of other users connected to the console Ecf Force control of the console interface Ecs R...

Page 49: ...g steps Step 1 Remotely log in to the teststation as sppuser default password spp user with the following command rlogin hostname login sppuser Password spp user Step 2 Access the system console with the following command spp scripts sppconsole At this point you are in spy mode meaning that you can only monitor what is going on at the system console If you try to enter a command the following mess...

Page 50: ... spp user with the following command rlogin hostname login sppuser Password spp user Step 2 Access the system console with the following command spp scripts sppconsole At this point you are in spy mode meaning that you can only monitor what is going on at the system console If you try to enter a command the following message is displayed read only use Ecf to attach Ec for help Step 3 Assume contro...

Page 51: ...ive console window to a watch window enter CTRL Ecs Accessing system logs You can monitor system status via two logs event_log and consolelog located in spp data on the teststation The event_log file periodically logs system status Once the file reaches 4MB the system renames it to event_log old and creates a new event_log file The consolelog file logs data from the sppconsole window When the file...

Page 52: ...e daemons provide services in the management of the V Class node Two daemons that are always running on the teststation are cmd The complex configuration management daemon that builds the configuration file spp data node_0 cfg conserver The console server that directs RS 232 console traffic from the Utility Board to the upper right console window on the teststation spp etc spp specific daemons bin...

Page 53: ...HPMC High Priority Machine Checks spp scripts The spp scripts directory contains scripts that perform a variety of functions dcm Dump Configuration Manager hard_logger The hard error logger HP Machine Check HPMC data collection script sppconsole The console utility spp data The spp data directory contains node_0 cfg Configuration file with scan rings and configured hardware Built by the ccmd daemo...

Page 54: ...CIS failures event_log Log of all event information A read only file which captures information generated by the ccmd daemon spp firmware The spp firmware directory is where firmware files are loaded temporarily before they are moved to the Utility Board ECUB spp est The est directory contains files used during scan testing spp man The spp man directory contains the man manual pages and many of th...

Page 55: ...d PDC OpenBoot PROM OBP and SPP Processor Dependent Code SPP_PDC make up the firmware on HP V Class servers that makes it possible to boot HP UX This chapter discusses the boot sequence and the commands available from the boot menu ...

Page 56: ...n Self Test POST runs POST is described in the HP Diagnostics Guide S Class X Class and V Class Servers 2 OBP probes all the devices 3 OBP loads SPP_PDC in RAM 4 OBP starts the HP UX loader which in turns calls SPP_PDC to set up CPU s memory and I O devices in a way that HP UX understands 5 The next action depends on whether Autoboot is enabled 1 If autoboot is enabled the operating system boots u...

Page 57: ... Figure 11 Boot process NO NO YES YES Boot menu Prompt displays To discontinue press any key within 10 seconds Continue Automatically Press any key HP UX boots Processor is starting the autoboot process displays Autoboot Enabled to display boot menu ...

Page 58: ... CPUs Completing core SRAM initialization Initializing main memory Probing memory MB0L MB1L MB2R MB3R MB4L MB5L MB6R MB7R Parallel memory initialization in progress PB3R MB0L PB0R MB1L PB1R MB2R PB1L MB3R PB2L MB4L PB2R MB5L PB0L MB6R PB3L MB7R Nodemask 00000001 Booting OBP OBP Power On Boot on 0 4 PDC Firmware Version Information PDC_ENTRY version 3 1 0 29 POST Revision 3 1 0 1 OBP Fieldtest Rele...

Page 59: ...tive 2048 KB 2048 KB HP PA82000 8 4 0 200 MHz Active 2048 KB 2048 KB HP PA82000 9 4 0 200 MHz Active 2048 KB 2048 KB HP PA82000 10 4 0 200 MHz Active 2048 KB 2048 KB HP PA82000 11 4 0 200 MHz Active 2048 KB 2048 KB HP PA82000 12 4 0 200 MHz Active 2048 KB 2048 KB HP PA82000 13 4 0 200 MHz Active 2048 KB 2048 KB HP PA82000 14 4 0 200 MHz Active 2048 KB 2048 KB HP PA82000 15 4 0 200 MHz Active 2048 ...

Page 60: ...he Forth password PAth PRI ALT CON path Display or modify a path PDT CLEAR DEBUG Display clear Non Volatile PDT state PIM_info cpu HPMC TOC LPMC Display PIM of current or any CPU RESET hard debug Force a reset of the system RESTrict ON OFF Display Select restricted access to Forth SCSI INIT RATE bus slot val List Set SCSI controller parms SEArch path Search for boot devices SECure ON OFF Display o...

Page 61: ...storage after a system crash CAUTION this command can delete important troubleshooting information do not enter the CLEARPIM command unless directed to CPUconfig proc ON OFF Displays or sets the configuration of processors DEfault Sets the system environment variables to defined values and changes certain HP variables so that HP UX can boot DIsplay Displays this menu ForthMode Switches to the Fort...

Page 62: ...PU RESET hard debug Resets the system state RESTrict ON OFF Displays or sets restricted access to Forth mode SCSI INIT RATE bus slot val Displays or sets SCSI controller initiator ID or transfer rate SEArch path Displays pathnames for devices with bootable media in the system SECure ON OFF Displays or sets secure boot mode If secure mode is set the boot process cannot be interrupted Only useful if...

Page 63: ...h Autoboot and Autosearch is OFF Syntax AUto BOot SEArch ON OFF Used alone this command displays the current status of the Autoboot and Autosearch flags BOot If ON the OS is automatically loaded from the primary boot path after a power up or reset Otherwise the system displays the boot menu and waits for interactive boot commands During an autoboot the process pauses for 10 seconds to allow the op...

Page 64: ...t Examples au This command displays the status of the Autoboot and Autosearch flags Autoboot ON Autosearch ON au bo This command displays the current setting of the Autoboot flag Autoboot ON au bo on This command sets the Autoboot flag ON Autoboot ON ...

Page 65: ... named command Examples The following example illustrate use of this command help au This command displays information for the auto command AUto BOot SEArch ON OFF Display or set the specified flag AUto boot on Enable auto boot on next boot AUto boot off Disable auto boot on next boot AUto search on Enable auto search on next boot AUto search off Disable auto search on next boot Auto search enable...

Page 66: ...52 Chapter4 Firmware OBP and PDC HElp command ...

Page 67: ...Chapter 5 53 5 Starting and stopping HP UX This chapter provides information about starting and stopping HP UX Starting your HP 9000 V Class Server Using the console Starting HP UX Stopping HP UX ...

Page 68: ...a single image of the operating system A V Class server consists of one node multiuser mode Allows multiple users to access the system simultaneously See also single user mode run level An HP UX mode of operation The etc inittab file defines which terminals and processes are active at each run level run level s Restricts user input to the console Also known as single user mode single user mode Res...

Page 69: ...ardware errors Debugs a hung system Runs HP UX V Class node Hosts OpenBoot PROM OBP software Runs HP UX The boot procedure differs according to the value of the Autoboot flag See Enabling Autoboot on page 49 for information on how to set Autoboot After you power up your V Class server if Autoboot is set to True OBP automatically starts HP UX Press the ESC key within 10 seconds to interrupt the boo...

Page 70: ...tem OBP then passes control to the sbin init process which sequentially executes the contents of etc inittab The inittab file executes the sbin bcheckrc and sbin rc scripts to check your file system and initialize the system The operating system continues to boot and displays additional information about your system After the boot process completes an HP UX login prompt appears in the sppconsole w...

Page 71: ... for example unable to open a specified device file the start up process ends and you need to solve the problem Restarting HP UX To restart the system after a reboot or hang you must bring the system to single user state Characteristics of the single user state include The only access to the system is through the sppconsole The only processes running on the system are the Shell on the console Back...

Page 72: ...m Step 1 Select the sppconsole window on the teststation or assume control remotely See Accessing the console remotely on page 34 Step 2 Log in as root Step 3 Change to the root directory Enter cd Step 4 Bring the system to single user state with the shutdown command Enter shutdown Step 5 Reboot the system with the reboot command Enter reboot ...

Page 73: ... Use shutdown Shutdown considerations Only the system administrator or a designated superuser can shut down the system The sbin shutdown command Warns all users to log out of the system within a grace period you specify Halts daemons Kills unauthorized processes Unmounts file systems Puts the system in single user mode Writes the contents of the I O buffers to a disk CAUTION Do not run shutdown fr...

Page 74: ...oot Step 3 Change to the root directory Enter cd Step 4 Shut down the system using the shutdown command Enter shutdown Progress messages detailing system shutdown activities print to your terminal Upon reaching run level 0 the system Restarts in single user mode Displays the root prompt Step 5 Bring the system to a complete stop with the reboot command Enter reboot h CAUTION Turn power off to the ...

Page 75: ...Power This chapter describes how to configure the UPS and what to do after a power failure Tasks covered include Configuring a PowerTrust UPS on V Class systems Monitoring the PowerTrust UPS Handling ac power failures ...

Page 76: ...creasing the value of shutdown_delay_mins if the site commonly experiences momentary power interruptions greater than one 1 minute for which recovery of power is expected The parameter shutdown_timeout_mins This parameter specifies the number of minutes to monitor the shutdown h operation before initiating reboot with the halt option In this way a reboot is executed even if the shutdown process ch...

Page 77: ...switches and indicators on the front panel of the PowerTrust UPS Look at the teststation Some messages from the UPS are displayed on the console Look in a log file UPS messages are recorded in a system log file On HP UX the log files is usr adm syslog a text file which can be viewed with vi or another editor For more information see the PowerTrust System Guide PN 3329 90002 that accompanied the Po...

Page 78: ...red for UPS you will see UPS messages on the console See Appendix A in PowerTrust System Guide PN 3329 90002 for explanations of the messages 3 The indicators on the PowerTrust UPS front panel labeled AC Output and Battery Power will be lit If ac power does not return If ac power does not return within the time specified by the parameter shutdown_delay_mins default 1 minute the shutdown 1M command...

Page 79: ...minutes if the battery is fully charged 1 You will hear an auditory alarm a single beep every 10 seconds If desired you can turn off the auditory alarm by pressing the switch on the PowerTrust UPS labeled Silence Alarm CAUTION Do NOT press the switch on the PowerTrust UPS labeled Output On Output Off Pressing this switch would shut off power to the node and other units connected to the UPS 2 With ...

Page 80: ... full battery discharge If the unit comes within approximately two 2 minutes of full battery discharge the PowerTrust UPS will issue three beeps every 10 seconds Prepare for an immediate loss of battery power For more information see the PowerTrust System Guide PN 3329 90002 that accompanied the PowerTrust UPS The operating system should have been configured for the UPSs as part of the system inst...

Page 81: ...pond to user input This lack of response indicates either a performance problem or system interruption Performance problems are generally characterized by The system responds to one or more programs users but not all or sluggishly to others The system seems to be very slow System interruptions usually result in a total loss of CPU resources for all users programs due to a System hang System panic ...

Page 82: ...llect troubleshooting information Step 1 If an error message is displayed on the system console record it Step 2 Record the information displayed on the system LCD See LCD liquid crystal display on page 22 for more information Step 3 Record any relevant information contained in the log files in the spp data directory on the teststation event_log Main log file hard_hist Filtered output from the har...

Page 83: ...m Step 1 At the console window of the teststation use one or both of the following commands to check for active processes making heavy use of system resources ps top See Managing Systems and Workgroups and the ps and top man pages for more information about options and usage Step 2 Enter a Ctrl C from the terminal exhibiting the problem to abort an executing command Step 3 Check another terminal t...

Page 84: ... both of the following utilities to communicate with the server ping telnet See the ping and telnet man pages for more information about options and usage Step 5 If possible wait about 15 minutes to see if the computer is really hung or if it has a performance problem With some performance problems a computer may not respond to user input for 15 minutes or longer Step 6 If the computer is really h...

Page 85: ...y board reset 4 TOC reset Used to produce a crash dump Step 7 Save the core dump file and contact the HP Response Center to have the core dump file analyzed Refer to your service contract for the phone number of your HP Response Center See Fast dump on page 79 for more information ...

Page 86: ... data in the event of a disk head crash or similar situation How frequently you update these backups depends on how much data you can afford to lose For information on how to back up data refer to Managing Systems and Workgroups After HP UX experiences a system panic the system May display an HPMC tombstone on the console if panic was caused by an HPMC A tombstone is a list of register values used...

Page 87: ...wording of the panic message should allow you to classify the problem into one of the following areas Peripheral problem Server or I O card problem File system problem LAN communication problem Logical Volume Manager LVM related problem Other Peripheral problem Use the following procedure to troubleshoot an apparent peripheral hardware failure Step 1 Check to ensure the device is powered on and on...

Page 88: ...ave an interface card or system problem See Interface card and system problem If the problem reappears it might be necessary to have the problem fixed by Hewlett Packard service personnel Interface card and system problem Use the following procedure if a hardware failure appears to be associated with an interface card or with the an internal component of the system Step 1 If an HPMC tombstone is d...

Page 89: ...e one with the directory has problems it is important to use the n option to the reboot command right after fsck completes fsck is normally run automatically at boot time See Rebooting the system on page 77 LAN communication problem Use the following procedure if the panic messages indicate a problem with LAN communication Step 1 Check LAN cable and media access unit MAU connections Step 2 Ensure ...

Page 90: ...show up immediately It will occur when the truncated part of the file system is overwritten by something else such as a new logical volume or the extension of a logical volume in the same volume group as the truncated file system For more information on LVM see the Managing Systems and Workgroups Recovery from other situations When you suspect the problem was something other than previously discus...

Page 91: ...ve saved a system core file to disk See Abnormal system shutdowns on page 79 Step 2 If the system panicked due to a corrupted file system fsck will report the errors and any corrections it makes If fsck terminates and requests to be run manually refer to the Managing Systems and Workgroups for further instructions If the problems were associated with your root file system fsck will ask you to rebo...

Page 92: ...ting the system Check applications to be sure that they are running properly and monitor the system closely for the next 24 hours For a short while backups may be done more frequently than normal until confidence in the system has been restored ...

Page 93: ...y without operator interaction By default fast dump selectively dumps only the parts of memory that are expected to be useful in debugging It improves system availability in terms of both the time and space needed to dump and analyze a large memory system The following commands allow you to configure save and manipulate the fast core dump crashconf Configures the destination and contents of a cras...

Page 94: ...X 11 0 dump devices had to be defined in the kernel configuration and they still can be using Release 11 0 Beginning with Release 11 0 however a new more flexible method for defining dump devices is available using crashconf Beginning with HP UX Release 11 0 there are three places where dump devices are configured 1 In the kernel same as releases prior to Release 11 0 2 During system initializatio...

Page 95: ... the dump is complete or no more defined space is available To ensure enough dump space define a dump area that is at least as big as your computer s physical memory plus 1 Mbyte Setting the amount of memory dumped and the classes of the memory pages determines the size of the dump The content can be configured while the system is running and changed without rebooting the system The larger the siz...

Page 96: ... levels of core dumps no dump selective dump or full dump Selective dump causes only the selected memory pages to get dumped see the crashconf 1M man page for more information NOTE In some specific cases HP UX will override the selective dump and request a full dump The operator is given ten seconds to override HP UX and continue with a selective dump The fewer pages dumped to disk and on reboot c...

Page 97: ...h paging swapping and as a dump device If system recovery time is critical do not configure the primary paging device as a dump device When the primary paging device is not used as one of the dump devices or after the crash image on the primary paging device has been saved by default savecrash runs in the background This reduces system boot time by running the system with only the primary paging d...

Page 98: ...your memory dump to someone else for analysis you first have to finish moving the pages on the dedicated dump devices to the HP UX file system area so that you can use a utility such as tar to bundle them for shipment To do that use the command usr sbin crashutil instead of savecrash to complete the copy ...

Page 99: ...isk space A large amount of time and disk space is needed to dump the entire contents of memory in a system with a large memory configuration or to copy a large memory image to the HP UX file system area during the reboot process The amount of dump area should at least be equal to the amount of memory in your system depending on a number of factors additional disk space greater than the amount of ...

Page 100: ...conds so the window of vulnerability for this situation is practically nonexistent Using a device for both paging and as a dump device It is possible to use a specific device for both paging purposes and as a dump device If however crash dump integrity is critical this is not recommended If savecrash determines that a dump device is already enabled for paging and that paging activity has already t...

Page 101: ...tical for a given type of crash and save only those pages Choosing this option can save a lot of disk space on the dump devices and again later on the HP UX file system area For instructions on how to do this see Defining dump devices on page 88 Compressed save vs noncompressed save Regardless of whether you choose full or selective dump whatever is saved on the dump devices needs to be copied to ...

Page 102: ...rocess edit the file etc rc config savecrash and comment out the line that sets the environment variable SAVE_PART 1 Defining dump devices When defining dump devices it is important to accurately determine the amount of space needed to hold the dump without wasting disk space To save a full dump the amount of dump space needed is equal to the size of the system s physical memory For selective dump...

Page 103: ... for a margin of safety In the above example the calculation would be 6208 x 4 Kbytes x 1 25 approx 30 Mbytes Kernel dump device definitions Capturing dumps for crashes that occur during early stages of the boot process requires sufficient dump space in the kernel configuration Using SAM to configure dump devices into the kernel The easiest way to configure dump devices is to use SAM A screen for ...

Page 104: ...nel using HP UX commands Step 1 Edit system file the file that config uses to build your new kernel This is usually the file stand system but can be another file if you prefer Dump to Hardware Device For each hardware dump device you want to configure into the kernel add a dump statement in the area of the file designated Kernel Device info immediately prior to any tunable parameter definitions Fo...

Page 105: ...imary paging device swap device as the dump device Step 2 Once you have edited the system file build a new kernel file using the config command Step 3 Save the existing kernel file probably stand vmunix to a safe place such as stand vmunix safe in case the new kernel file can not be booted Step 4 Boot the system from the new kernel file to activate the new dump device definitions Runtime dump devi...

Page 106: ...ical volumes from volume groups other than the root volume group The crashconf command You can also use the sbin crashconf command to add to remove or redefine dump devices The following are two ways to do this Reread the etc fstab file using the crashconf a option Use device arguments with crashconf to configure the devices With either method you can use the crashconf r option to specify that new...

Page 107: ...ices was built from a kernel build from the etc fstab file from use of the crashconf command or any combination of the these dump devices are used dumped to in the reverse order from which they were defined The last dump device in the list is the first one used and the first device in the list is the last one used Place devices that are used for both paging and dumping early in the list of dump de...

Page 108: ...N for no dump at the system console within the 10 second override period If disk space is limited but the operator feels that a dump is important the operator can enter S for selective dump regardless of the currently defined dump level The dump After the operator overrides the current dump level or the 10 second override period expires HP UX writes the physical memory contents to the dump devices...

Page 109: ...paging and dumping do not disable savecrash boot processing Loss of the dumped memory image to subsequent system paging activity can occur What to do after the system has rebooted After the system reboots make sure that the physical memory image dumped to the dump devices is copied to the HP UX file system area then you can either package and send it in for analysis or analyze it yourself using a ...

Page 110: ...AVECRASH 0 If you have delayed copying the physical memory image from the dump devices to the HP UX file system area run savecrash manually to do the copy when the system is running and when you have configured enough space to hold the copy in your HP UX file system area If you chose to do a partial save the only pages copied to your HP UX file system area during the boot process are those that we...

Page 111: ...sion is usr sbin crashutil v version source destination version Designates the version of the destination format source Designates the pathname of the crash dump to be converted destination Designates the pathname where the converted file will be written If no destination is specified the source will be overwritten See the crashutil 1M manpage for more information Analyzing crash dumps Analyzing c...

Page 112: ...98 Chapter7 Recovering from failures Abnormal system shutdowns ...

Page 113: ...ow 29 consolelog 39 consolelog file system status log 37 control and status registers discussed 5 8 core dump 79 core logic 3 8 bus 3 4 5 8 crashdump 79 crashutil 79 creating new windows 30 crossbar 3 5 6 csh window 30 D DAT drive eject button 20 indicators 19 data mover 10 dc off 15 dc on 15 dcm 39 DDS 3 DAT drive 19 default passwords 28 diag tool menu 30 display status 29 displays 13 do_reset 39...

Page 114: ...g file 37 logging in 28 logins 28 logons 28 logout 30 M Media CDROM 17 DAT 19 Tape 19 memory access 8 10 banks 7 8 interleaved 10 Message Passing Interface MPI 10 message window 29 mwm 29 30 N node environmental monitoring functions 24 node_0 cfg 39 O OBP 41 commands auto 47 49 boot 47 boottimer 47 clearpim 47 cpuconfig 47 default 47 display 47 forthmode 47 io 47 ls 47 os 47 password 48 path 48 pd...

Page 115: ...startup process overview 56 stopping 59 spy mode 34 sspdsh window 30 status 29 switches 13 symmetric multiprocessor SMP 3 Synchronous Dynamic Random Access Memory SDRAM 7 system reboot procedure 60 restarting the 57 shutting down 59 system displays 21 LCD Liquid Crystal Display 22 system hangs 70 system logs consolelog file 37 event_log file 37 system panics 72 file system problem 75 interface car...

Page 116: ...102 Index X X tool menu 30 ...

Reviews: