Operating System Surveillance
Operating system surveillance provides the service processor with a means to detect hang conditions, as
well as hardware or software failures, while the operating system is running. It also provides the operating
system with a means to detect a service processor failure caused by the lack of a return heartbeat.
Operating system surveillance is not enabled by default, allowing you to run operating systems that do not
support this service processor option.
You can also use service processor menus and AIX service aids to enable or disable operating system
surveillance.
For operating system surveillance to work correctly, you must set these parameters:
v
Surveillance enable/disable
v
Surveillance interval
The maximum time the service processor should wait for a heartbeat from the operating system before
timeout.
v
Surveillance delay
The length of time to wait from the time the operating system is started to when the first heartbeat is
expected.
Surveillance does not take effect until the next time the operating system is started after the parameters
have been set.
If desired, you can initiate surveillance mode immediately from service aids. In addition to the three options
above, a fourth option allows you to select immediate surveillance, and rebooting of the system is not
necessarily required.
If operating system surveillance is enabled (and system firmware has passed control to the operating
system), and the service processor does not detect any heartbeats from the operating system, the service
processor assumes the system is hung and takes action according to the reboot/restart policy settings.
See “Service Processor Reboot/Restart Recovery” on page 52.
If surveillance is selected from the service processor menus which are only available at system boot, then
surveillance is enabled by default as soon as the system boots. From service aids, the selection is
optional.
Call-Out (Call-Home)
Note:
The call-out function is handled by the Service Focal point application on the Hardware
Management Console (HMC).
The service processor can call out (call-home) when it detects one of the following conditions:
v
System firmware surveillance failure
v
Operating system surveillance failure (if supported by operating system)
v
Restarts
v
Critical hardware failure
v
Abnormal operating system termination
To enable the call-out feature, do the following:
1. Connect a modem to any serial port.
2. Set up the following using the service processor menus or diagnostic service aids:
v
Enable call-out for the serial port where the modem is connected.
60
pSeries 610 Model 6C3 and Model 6E3 User’s Guide
Summary of Contents for pSeries 615 Model 6C3
Page 1: ...pSeries 615 Model 6C3 and Model 6E3 User s Guide SA38 0629 00 ERserver ...
Page 2: ......
Page 3: ...pSeries 615 Model 6C3 and Model 6E3 User s Guide SA38 0629 00 ERserver ...
Page 14: ...xii pSeries 610 Model 6C3 and Model 6E3 User s Guide ...
Page 16: ...xiv pSeries 610 Model 6C3 and Model 6E3 User s Guide ...
Page 20: ...xviii pSeries 610 Model 6C3 and Model 6E3 User s Guide ...
Page 26: ...6 pSeries 610 Model 6C3 and Model 6E3 User s Guide ...
Page 46: ...26 pSeries 610 Model 6C3 and Model 6E3 User s Guide ...
Page 86: ...66 pSeries 610 Model 6C3 and Model 6E3 User s Guide ...
Page 110: ...90 pSeries 610 Model 6C3 and Model 6E3 User s Guide ...
Page 146: ...126 pSeries 610 Model 6C3 and Model 6E3 User s Guide ...
Page 164: ...144 pSeries 610 Model 6C3 and Model 6E3 User s Guide ...
Page 168: ...148 pSeries 610 Model 6C3 and Model 6E3 User s Guide ...
Page 172: ...152 pSeries 610 Model 6C3 and Model 6E3 User s Guide ...
Page 204: ...184 pSeries 610 Model 6C3 and Model 6E3 User s Guide ...
Page 207: ......