A.4 RAS Features
The server supports a variety of Reliability, Availability, and Serviceability (RAS)
features. You can configure these features for better RAS.
For details about how to configure RAS features, see
Platform BIOS Parameter Reference
Table A-3 RAS features
Module
Feature
Description
CPU
Corrected Machine Check
Interrupt
Corrects error-triggered interruption.
DIMM
Failed DIMM Isolation
Identifies a faulty DIMM and isolates it
from others before it is replaced.
Memory Thermal
Throttling
Automatically adjusts DIMM
temperatures to avoid damage due to
overheating.
Rank Sparing
Allocates some memory ranks as
backup ranks to prevent the system
from crashing due to uncorrectable
errors.
Memory Address Parity
Protection
Detects memory command and
address errors.
Memory Demand and
Patrol Scrubbing
Corrects errors upon detection. If these
errors are not corrected promptly,
uncorrectable errors may occur.
Memory Mirroring
Improves system reliability.
Single Device Data
Correction
Provides a single-device multi-bit error
correction capability to improve
memory reliability.
Device Tagging
Degrades and rectifies DIMM device
faults to improve DIMM availability.
Data Scrambling
Optimizes data stream distribution
and reduces the error possibility to
improve the reliability of data streams
in the memory and the capability to
detect address errors.
PCIe
PCIe Advanced Error
Reporting
Improves server serviceability.
UPI
Intel UPI Link Level Retry
Provides a retry mechanism upon
errors to improve UPI reliability.
Huawei FusionServer Pro 1288H V5 Server
Technical White Paper
A Appendix
Issue 14 (2020-09-03)
Copyright © Huawei Technologies Co., Ltd.
65