Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface
for the server to function as a sealed system. If internal cooling areas are compromised, the
server cooling system, which relies on the movement of cool air through the server, cannot
function properly, and the airflow inside the server becomes chaotic and non-directional.
Action
: Inspect the server interior to ensure that the air baffle is properly installed. Ensure that
all external-facing slots (storage drive, DVD, PCIe) are occupied with either a component or a
component filler panel. Ensure that the server top cover is in place and sits flat and snug on top
of the server.
Prevention
: When servicing the server, ensure that the air baffle is installed correctly and that
the server has no unoccupied external-facing slots. Never operate the server without the top
cover installed.
Hardware Component Failure
Components, such as power supplies and fan modules, are an integral part of the server cooling
system. When one of these components fails, the server internal temperature can rise. This rise
in temperature can cause other components to enter into an over-temperature state. Additionally,
some components, such as processors, might overheat when they are failing, which can also
generate an over-temperature event.
To reduce the risk related to component failure, power supplies and fan modules are installed
in pairs to provide redundancy. Redundancy ensures that if one component in the pair fails,
the other functioning component can continue to maintain the subsystem. For example, power
supplies serve a dual function; they provide both power and airflow. If one power supply fails,
the other functioning power supply can maintain both the power and the cooling subsystems.
Action
: Investigate the cause of the over-temperature event, and replace failed components
immediately
. For hardware troubleshooting information, see
“Troubleshooting Server Hardware
Prevention
: Component redundancy is provided to allow for component failure in critical
subsystems, such as the cooling subsystem. However, once a component in a redundant
system fails, the redundancy no longer exists, and the risk for server shutdown and component
failures increases. Therefore, it is important to maintain redundant systems and replace failed
components
immediately
.
Troubleshooting Power Issues
If your server does not power on, the cause of the problem might be:
■
“AC Power Connection” on page 34
Troubleshooting and Diagnostics
33
Summary of Contents for EXADATA X5-2
Page 2: ......
Page 12: ...12 Oracle Exadata Storage Server X5 2 High Capacity Service Manual January 2018 ...
Page 20: ...20 Oracle Exadata Storage Server X5 2 High Capacity Service Manual January 2018 ...
Page 160: ...160 Oracle Exadata Storage Server X5 2 High Capacity Service Manual January 2018 ...
Page 176: ...176 Oracle Exadata Storage Server X5 2 High Capacity Service Manual January 2018 ...
Page 202: ...202 Oracle Exadata Storage Server X5 2 High Capacity Service Manual January 2018 ...
Page 228: ...228 Oracle Exadata Storage Server X5 2 High Capacity Service Manual January 2018 ...