Chapter 1
Overview
Server Errors
56
Server Errors
To support high availability (HA), the new chipset has included functionality to do error correction, detection
and recovery. Errors in the new chipset are divided into the following categories:
- Protection domain access
- Hardware correctable
- Global shared memory
- Hardware uncorrectable
- Fatal
- Blocking timeout
- Deadlock recovery errors
These categories are listed in increasing severity, ranging from protection domain (PD) access errors, which
are caused by software or hardware running in another PD, to deadlock recovery errors, which indicate a
serious hardware failure that requires a reset of the cell to recover. The term "software" refers to privileged
code, such as PDC or the OS, but not to user code. The sx2000 chipset supports the PD concept, where user
and software errors in one PD cannot affect another PD.
Protection Domain Access Errors
PD access errors are caused by transactions outside the PD that are not allowed. Packets from outside the
coherency set should not impact the interface, and some packets from within the coherency set but outside
the PD are handled as a PD access error. These errors typically occur due to a software error or to bad
hardware in another PD. These errors do not indicate a hardware failure in the reporting cell.
An example of a PD access error is an interrupt from a cell outside the PD that is not part of the interrupt
protection set. For these errors, the sx2000 chipset typically drops the transaction or converts it to a harmless
transaction, and logs the error. No error is signaled. PD access level errors themselves do not result in the
block entering No_shared mode or fatal error mode.
Hardware Corrected Errors
Hardware correctable errors are errors that can be corrected by hardware. A typical example of a hardware
correctable error is a single bit ECC error. For these errors, the sx2000 chipset corrects and logs the error. No
direct notification is given to software that an error has occurred (no LPMC is generated). For firmware or
software to detect that an error has occurred, the error logs must be read.
Global Shared Memory Errrors
Global shared memory (GSM) is a high performance mechanism for communication between separate PDs
using GNI memory without exposing your PD to hardware or software failures of the other PD. Each PD
supports eight sharing ranges. Each of these ranges is readable and writable within the PD, and
programmable to be read_only or readable writable to other PDs. Ranges of memory, called sharing windows,
Содержание Integrity Superdome sx2000
Страница 8: ...Contents 8 ...
Страница 10: ...Tables 10 ...
Страница 14: ...Figures 14 ...
Страница 53: ...Chapter 1 Overview New Server Cabling 53 Figure 1 11 Backplane Cables ...
Страница 119: ...Chapter 3 Installing the System Turning On Housekeeping Power 119 Figure 3 37 BPS LEDs BPS LEDs ...
Страница 169: ...Appendix A 169 A sx2000 LEDs ...
Страница 174: ...Appendix A sx2000 LEDs 174 ...
Страница 187: ...Appendix B Management Processor Commands MP Command HE 187 Example B 11 HE Command ...
Страница 199: ...Appendix B Management Processor Commands MP Command PS 199 Example B 20 PS Command ...
Страница 212: ...Appendix B Management Processor Commands MP Command XD 212 ...
Страница 224: ...Appendix C Powering the System On and Off Turning On Housekeeping Power 224 Figure C 14 BPS LEDs BPS LEDs ...
Страница 230: ...Appendix D Templates Templates 230 Figure D 2 SD16 and SD32 Space Requirements ...
Страница 233: ...Appendix D Templates Templates 233 Figure D 4 Computer Floor Template ...
Страница 234: ...Appendix D Templates Templates 234 Figure D 5 Computer Floor Template ...
Страница 235: ...Appendix D Templates Templates 235 Figure D 6 Computer Floor Template ...
Страница 236: ...Appendix D Templates Templates 236 Figure D 7 Computer Floor Template ...
Страница 237: ...Appendix D Templates Templates 237 Figure D 8 Computer Floor Template ...
Страница 238: ...Appendix D Templates Templates 238 Figure D 9 SD32 and SD64 and I O Expansion Cabinet Templates ...
Страница 239: ...Appendix D Templates Templates 239 Figure D 10 SD32 and SD64 and I O Expansion Cabinet Templates ...
Страница 240: ...Appendix D Templates Templates 240 Figure D 11 SD32 and SD64 and I O Expansion Cabinet Templates ...
Страница 241: ...Appendix D Templates Templates 241 Figure D 12 SD32 and SD64 and I O Expansion Cabinet Templates ...
Страница 242: ...Appendix D Templates Templates 242 Figure D 13 SD32 and SD64 and I O Expansion Cabinet Templates ...
Страница 243: ...Appendix D Templates Templates 243 Figure D 14 SD32 and SD64 and I O Expansion Cabinet Templates ...
Страница 244: ...Appendix D Templates Templates 244 ...
Страница 247: ...Index 247 W wiring check 101 wrist strap usage 76 ...