
4
System Monitoring
4.1
System Monitoring Tools
Urika-GX provides many tools for monitoring system components, resources and services.
Table 13. Monitoring Tools
Monitoring Tool
Monitored Component/Service/Resource
HSS
Helps monitor physical system components, such as
PCIe channels and Dual Aries Network Card (dance).
iSCB
Helps monitor sub-rack level attributes, such as power
supply, amperage, fan status, and temperature info are
monitored via the iSCB.
CAUTION: iSCB CLI commands other than
the
status
command should NOT be
executed on the Urika-GX system, unless
advised by Cray Support. For more
information, contact Cray support.
The
capmc
,
ux-nid-cobbler-status
and
ux-
nid-status
commands
Helps monitor node status.
Nagios
Helps monitor:
●
CPU - per node and aggregated
●
Memory - per node and aggregated
●
Storage
○
Lustre
○
SSD and HDD per node and aggregated
●
Management, operational and Aries network
bandwidth used
●
Node status
Grafana
Provides information about utilization of system
resources, such as CPU, memory, I/O, etc.
urika-state
command
Retrieves the status of analytic applications.
System Monitoring
S3016
65