Quick Start and Basic Operation
DGX A100 System
DU-09821-001_v06
| 24
4.3
Obtaining an NGC Account
NVIDIA GPU Cloud (NGC) provides simple access to GPU-optimized software for deep
learning, machine learning and high-performance computing (HPC). An NGC account grants
you access to these tools as well as the ability to set up a private registry to manage your
customized software.
Work with NVIDIA Enterprise Support to set up an NGC enterprise account if you are the
organization administrator for your DGX A100 purchase. See the “NVIDIA GPU Cloud
Documentation” for detailed instructions on getting an NGC enterprise account.
4.4
Turning DGX A100 On and Off
DGX A100 is a complex system, integrating a large number of cutting-edge components with
specific startup and shutdown sequences. Observe the following startup and shutdown
instructions.
4.4.1
Startup Considerations
To keep your DGX A100 running smoothly, allow up to a minute of idle time after reaching the
login prompt. This ensures that all components can complete their initialization.
4.4.2
Shutdown Considerations
WARNING: Risk of Danger - Removing power cables or using Power Distribution Units (PDUs) to
shut off the system while the Operating System is running may cause damage to sensitive
components in the DGX A100 server.
When shutting down DGX A100, always initiate the shutdown from the operating system,
momentary press of the power button, or by using Graceful Shutdown from the BMC, and wait
until the system enters a powered-off state before performing any maintenance.
4.5
Verifying Functionality – Quick Health
Check
NVIDIA provides customers a diagnostics and management tool called NVIDIA System
Management, or NVSM. The nvsm command can be used to determine the system's health,
identify component issues and alerts, or run a stress test to make sure all components are in
working order while under load. The use of Docker is key to getting the most performance out
of the system since NVIDIA has optimized containers for all the major frameworks and
workloads used on DGX systems.
Содержание DGX A100
Страница 1: ...DU 09821 001_v06 May 2022 DGX A100 System User Guide ...
Страница 74: ...Using the BMC DGX A100 System DU 09821 001_v06 69 7 Select Server CA Configuration 8 Select Enroll Cert ...
Страница 76: ...Using the BMC DGX A100 System DU 09821 001_v06 71 ...
Страница 107: ...Redfish APIs Support DGX A100 System DU 09821 001_v06 102 Korea RoHS Material Content Declaration ...
Страница 108: ...Redfish APIs Support DGX A100 System DU 09821 001_v06 103 ...