Volume 2, Part 2: MP Coherence and Synchronization
2:511
In the Itanium architecture, dependencies between operations by a processor have
implications for the ordering of those operations at that processor. The discussion in
and
on
explores this issue in
greater depth.
The following sections examine the Itanium ordering model in detail.
presents several memory ordering executions to illustrate important behaviors of the
model.
discusses how memory attributes and the ordering model
interact. Finally,
describes how the Itanium memory ordering model
compares with other memory ordering models.
2.2.1
Memory Ordering Executions
Multiprocessor software that uses shared memory to communicate between processes
often makes assumptions about the order in which other agents in the system will
observe memory accesses. As
describes, the Itanium
architecture provides a rich set of ordering semantics that allows software to express
different ordering constraints on a memory operation, such as a load. Writing correct
multiprocessor software requires that the programmer (or compiler) select the ordering
semantic appropriate to enforce the expected behavior.
For example, an algorithm that requires two store operations A and B become visible to
other processors in the order {A, B} will use stores with different ordering semantics
than an algorithm that does not require any particular ordering of A and B. Although it
is always safe to enforce stricter ordering constraints than an algorithm requires, doing
so may lead to lower performance. If the ordering of memory operations is not
important, software should use unordered ordering semantics whenever possible for
best possible performance.
This section presents multiprocessor executions to demonstrate the ordering behaviors
that the Itanium architecture allows and to contrast the Itanium ordering model with
other ordering models. The executions consist of sequences of memory accesses that
execute on two or more processors and highlight outcomes that the Itanium memory
ordering model either allows or disallows once all accesses on all processors complete.
A programmer can use these executions as a guide to determine which Itanium
memory ordering semantics are appropriate to ensure a particular visibility order of
memory accesses.
presents the assumptions and notational conventions that the
upcoming discussions use to examine the executions. The remaining eleven sections
each explore a different facet of the Itanium ordering model:
• Relaxed ordering of unordered memory operations (
).
• Using acquire and release semantics to order operations (
).
• Loads may pass stores (
) and how to prevent this behavior
• When dependencies do or do not establish memory ordering (
and
• Satisfying loads from store buffers (
) and how to prevent this
).
• Semaphore operations and local bypass (
).
Summary of Contents for ITANIUM ARCHITECTURE - SOFTWARE DEVELOPERS VOLUME 3 REV 2.3
Page 1: ......
Page 11: ...x Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Page 13: ...1 2 Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Page 33: ...1 22 Volume 1 Part 1 Introduction to the Intel Itanium Architecture ...
Page 57: ...1 46 Volume 1 Part 1 Execution Environment ...
Page 147: ...1 136 Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Page 149: ...1 138 Volume 1 Part 2 About the Optimization Guide ...
Page 191: ...1 180 Volume 1 Part 2 Predication Control Flow and Instruction Stream ...
Page 230: ......
Page 248: ...236 Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Page 250: ...2 2 Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Page 264: ...2 16 Volume 2 Part 1 Intel Itanium System Environment ...
Page 380: ...2 132 Volume 2 Part 1 Interruptions ...
Page 398: ...2 150 Volume 2 Part 1 Register Stack Engine ...
Page 486: ...2 238 Volume 2 Part 1 IA 32 Interruption Vector Descriptions ...
Page 750: ...2 502 Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Page 754: ...2 506 Volume 2 Part 2 About the System Programmer s Guide ...
Page 796: ...2 548 Volume 2 Part 2 Interruptions and Serialization ...
Page 808: ...2 560 Volume 2 Part 2 Context Management ...
Page 842: ...2 594 Volume 2 Part 2 Floating point System Software ...
Page 850: ...2 602 Volume 2 Part 2 IA 32 Application Support ...
Page 862: ...2 614 Volume 2 Part 2 External Interrupt Architecture ...
Page 870: ...2 622 Volume 2 Part 2 Performance Monitoring Support ...
Page 891: ......
Page 1099: ...3 200 Volume 3 Instruction Reference padd Interruptions Illegal Operation fault ...
Page 1295: ...3 396 Volume 3 Resource and Dependency Semantics ...
Page 1296: ......
Page 1302: ...402 Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Page 1494: ...4 192 Volume 4 Base IA 32 Instruction Reference FWAIT Wait See entry for WAIT ...
Page 1647: ...Volume 4 Base IA 32 Instruction Reference 4 345 ROL ROR Rotate See entry for RCL RCR ROL ROR ...
Page 1884: ...4 582 Volume 4 IA 32 SSE Instruction Reference ...
Page 1885: ...Index Intel Itanium Architecture Software Developer s Manual Rev 2 3 Index ...
Page 1886: ...Index Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Page 1898: ...INDEX Index 12 Index for Volumes 1 2 3 and 4 ...