
4. Make sure there is proper air flow, and no cables or other obstructions are blocking the front or rear of the array.
5. Try replacing each PCM
one at a time.
Sensor locations
The storage system monitors conditions at different points within each enclosure to alert you to problems. Power,
cooling fan, temperature, and voltage sensors are located at key points in the enclosure. Controller modules actively
manage the enclosure. Each module has a SAS
expander with its own storage enclosure processor (SEP) to monitor the
status of these sensors to perform SCSI enclosure services (SES) functions according to the ANSI SES Standard. If one of
these modules fails, the other module will continue to operate.
Refer to a module's specification or the SES interface specification for definitions of the module's functions and its SES
control.
The following sections describe each element and its sensors.
Power supply sensors
Each enclosure has two fully redundant Power and Cooling Modules (PCMs) with load-sharing capabilities. The power
supply sensors described in the following table monitor power or system driven voltage, current, temperature, and fan
status in each PCM. If the power supply sensors report a voltage that is under or over the threshold, check the input
voltage.
Description
Event/PCM fault LED condition
Power supply 1
Voltage, current, temperature, fan fault. Power or system driven
Power supply 2
Voltage, current, temperature, fan fault. Power or system driven
Table 23 Power supply sensor descriptions
Cooling fan sensors
Each PCM includes two fans. The normal range for fan speed is 4,000 to 13,000 RPM. Under normal operation, the
cooling fans are spinning with no fail states. When a fan speed is outside the allowable threshold, the enclosure
management software records a failure and posts an alarm. Replace the PCM reporting the fan failure.
Temperature sensors
Extreme high and low temperatures can cause significant damage if they go unnoticed. When a temperature fault is
reported, it must be remedied as quickly as possible to avoid system damage. This can be done by warming or cooling
the installation location.
Description
Normal operating range
Warning operating range
Failure threshold
CPU temperature
(internal digital thermal sensor)
2ºC–98ºC
0ºC–1ºC,
99ºC–104ºC
>104ºC
SAS3008 internal digital sensor
2ºC–104ºC
0ºC–1ºC,
105ºC–115ºC
<0ºC, >115ºC
Supercapacitor pack thermistor
0ºC–50ºC
None
None
SAS35x36 onboard temperature
2ºC–104ºC
0ºC–1ºC,
105ºC–115ºC
<0ºC, >115ºC*
ASIC onboard temperature
2ºC–100ºC
0ºC–1ºC,
101ºC–105ºC
<0ºC, >110ºC*
Controller module inlet
6ºC–62ºC
1ºC–5ºC,
63ºC–67ºC
<0ºC, >68ºC*
SAS 12Gb FRAMs
2ºC–104ºC
0ºC–1ºC,
105ºC–115ºC
<0ºC, >115ºC*
Table 24 Controller platform temperature sensor descriptions
62
Chapter 7
Troubleshooting