136
Maintenance
The following information describes the guidelines and tasks for daily server maintenance.
Guidelines
•
Keep the equipment room clean and tidy. Remove unnecessary devices and objects from the
equipment room.
•
Make sure the temperature and humidity in the equipment room meet the server operating
requirements.
•
Regularly check the server from HDM for operating health issues.
•
Keep the operating system and software up to date as required.
•
Make a reliable backup plan:
{
Back up data regularly.
{
If data operations on the server are frequent, back up data as needed in shorter intervals
than the regular backup interval.
{
Check the backup data regularly for data corruption.
•
Stock spare components on site in case replacements are needed. After a spare component is
used, prepare a new one.
•
Keep the network topology up to date to facilitate network troubleshooting.
Maintenance tools
The following are major tools for server maintenance:
•
Hygrothermograph
—
Monitor the operating environment of the server.
•
HDM and FIST
—
Monitor the operating status of the server.
Maintenance tasks
Observing LED status
Observe the LED status on the front and rear panels of the server to verify that the server modules
are operating correctly. For more information about the status of the front and rear panel LEDs, see
front panel and rear panel in "
Appendix A Server specifications
Reviewing the logs maintained by HDM
You can review the logs in HDM to identify the operating health status of the server, troubleshoot
server issues, and audit user behaviors.
HDM maintains the following logs:
•
Event
log
—Records events reported by server sensors, including fan events, overheating
events, power supply overloaded events, and processor, memory, and PCIe error events.
•
HDM
log
—Includes audit log entries and firmware update log entries.
{
Audit log entries record HDM administrative events, including access to HDM and remote
console startup.