November 2012
Altera Corporation
Arria V GZ Hard IP for PCI Express
13. Error Handling
Each PCI Express compliant device must implement a basic level of error
management and can optionally implement advanced error management. The Altera
Arria V GZ Hard IP for PCI Express implements both basic and advanced error
reporting. Given its position and role within the fabric, error handling for a Root Port
is more complex than that of an Endpoint.
The
PCI Express Base Specification 2.1 and 3.0
define three types of errors, outlined in
The following sections describe the errors detected by the three layers of the PCI
Express protocol and error logging. It includes the following sections:
■
■
■
■
Error Reporting and Data Poisoning
■
Uncorrectable and Correctable Error Status Bits
Table 13–1. Error Classification
Type
Responsible
Agent
Description
Correctable
Hardware
While correctable errors may affect system performance, data integrity is
maintained.
Uncorrectable, non-fatal
Device software
Uncorrectable, non-fatal errors are defined as errors in which data is lost,
but system integrity is maintained. For example, the fabric may lose a
particular TLP, but it still works without problems.
Uncorrectable, fatal
System software
Errors generated by a loss of data and system failure are considered
uncorrectable and fatal. Software must determine how to handle such
errors: whether to reset the link or implement other means to minimize
the problem.
November 2012
UG-01097-1.4