Volume 2, Part 2: MP Coherence and Synchronization
2:515
This contradicts the postulated outcome r1 = 0 and r2 = 0 and thus the Itanium
memory ordering model disallows the r1 = 1 and r2 = 0 outcome. Specifically, if M3
reads 0, then M4, M5, and M6 may not yet be visible but M1 and M2 must be visible.
Thus, when M6 becomes visible it must observe x = 1 because M1 is already visible.
2.2.1.6
Data Dependency Does Not Establish MP Ordering
The dependency rules define the relationship between memory operations that access
the same address. Specifically, the Itanium architecture resolves read-after-write
(RAW), write-after-read (WAR), and write-after-write (WAW) dependencies through
memory in program order on the local processor. As
discusses,
dependencies are fundamentally different from the ordering semantics even though
both affect ordering relationships between groups of memory accesses.
The execution shown in
illustrates this difference.
The following discussion focuses on the outcome r1 = 1, r2 = 1, and r3 = 0. This
outcome is allowed only because the Itanium architecture treats data dependencies and
the ordering semantics differently.
The ordering semantics require
, but do not place any constraints on the
relative order of operations M1, M2, or M3. Due to the register and memory
dependencies between the instructions on Processor #0, these operations complete
in
program order
on Processor #0 and also become
locally
visible in this order. However,
the operations need
not
be made visible to remote processors in program order. In this
outcome it appears to Processor #0 as if
while to Processor #1 it appears that
There are two things to note here. First, the behavior is another example of
the local bypass behavior that
presents on
. Second, there
are no dependencies
directly
between M1 and M3 that requires them to become
globally visible in program order.
Note:
All processors will observe the order established by a particular processor in
case of a WAW memory dependency to the same location. For example, all pro-
cessors in the coherence domain eventually see a value of 1 in location x in the
following code:
st
[x] = 0
// M1: set [x] to 0
st
[x] = 1
// M2: set [x] to 1,
// cannot move above M1 due to WAW
because there is a WAW memory dependency between from M2 to M1
and the Itanium architecture requires that the local processor resolves
RAW, WAR, and WAW dependencies between its memory accesses in
program order. Thus,
even though the ordering semantics do
not place any constraints on the relative ordering of M1 and M2.
Table 2-5.
Dependencies Do Not Establish MP Ordering (1)
Processor #0
Processor #1
st
[x] = 1 ;;
// M1
ld
r1 = [x] ;;
// M2
st
[y] = r1 ;;
// M3
ld.acq
r2 = [y]
// M4
ld
r3 = [x]
// M5
Outcomes:
r1 = 1, r2 = 1, and r3 = 0 is allowed
M4
M5
M1
M3
M3
M1.
M1
M2
Содержание ITANIUM ARCHITECTURE - SOFTWARE DEVELOPERS VOLUME 3 REV 2.3
Страница 1: ......
Страница 11: ...x Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Страница 12: ...1 1 Intel Itanium Architecture Software Developer s Manual Rev 2 3 Part I Application Architecture Guide ...
Страница 13: ...1 2 Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Страница 33: ...1 22 Volume 1 Part 1 Introduction to the Intel Itanium Architecture ...
Страница 57: ...1 46 Volume 1 Part 1 Execution Environment ...
Страница 147: ...1 136 Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Страница 149: ...1 138 Volume 1 Part 2 About the Optimization Guide ...
Страница 191: ...1 180 Volume 1 Part 2 Predication Control Flow and Instruction Stream ...
Страница 230: ......
Страница 248: ...236 Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Страница 249: ...2 1 Intel Itanium Architecture Software Developer s Manual Rev 2 3 Part I System Architecture Guide ...
Страница 250: ...2 2 Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Страница 264: ...2 16 Volume 2 Part 1 Intel Itanium System Environment ...
Страница 380: ...2 132 Volume 2 Part 1 Interruptions ...
Страница 398: ...2 150 Volume 2 Part 1 Register Stack Engine ...
Страница 486: ...2 238 Volume 2 Part 1 IA 32 Interruption Vector Descriptions ...
Страница 749: ...2 501 Intel Itanium Architecture Software Developer s Manual Rev 2 3 Part II System Programmer s Guide ...
Страница 750: ...2 502 Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Страница 754: ...2 506 Volume 2 Part 2 About the System Programmer s Guide ...
Страница 796: ...2 548 Volume 2 Part 2 Interruptions and Serialization ...
Страница 808: ...2 560 Volume 2 Part 2 Context Management ...
Страница 842: ...2 594 Volume 2 Part 2 Floating point System Software ...
Страница 850: ...2 602 Volume 2 Part 2 IA 32 Application Support ...
Страница 862: ...2 614 Volume 2 Part 2 External Interrupt Architecture ...
Страница 870: ...2 622 Volume 2 Part 2 Performance Monitoring Support ...
Страница 891: ......
Страница 941: ...3 42 Volume 3 Instruction Reference cmp illegal_operation_fault PR p1 0 PR p2 0 Interruptions Illegal Operation fault ...
Страница 1099: ...3 200 Volume 3 Instruction Reference padd Interruptions Illegal Operation fault ...
Страница 1191: ...3 292 Volume 3 Pseudo Code Functions Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Страница 1295: ...3 396 Volume 3 Resource and Dependency Semantics ...
Страница 1296: ......
Страница 1302: ...402 Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Страница 1494: ...4 192 Volume 4 Base IA 32 Instruction Reference FWAIT Wait See entry for WAIT ...
Страница 1564: ...4 262 Volume 4 Base IA 32 Instruction Reference LES Load Full Pointer See entry for LDS LES LFS LGS LSS ...
Страница 1565: ...Volume 4 Base IA 32 Instruction Reference 4 263 LFS Load Full Pointer See entry for LDS LES LFS LGS LSS ...
Страница 1568: ...4 266 Volume 4 Base IA 32 Instruction Reference LGS Load Full Pointer See entry for LDS LES LFS LGS LSS ...
Страница 1583: ...Volume 4 Base IA 32 Instruction Reference 4 281 LSS Load Full Pointer See entry for LDS LES LFS LGS LSS ...
Страница 1647: ...Volume 4 Base IA 32 Instruction Reference 4 345 ROL ROR Rotate See entry for RCL RCR ROL ROR ...
Страница 1663: ...Volume 4 Base IA 32 Instruction Reference 4 361 SHL SHR Shift Instructions See entry for SAL SAR SHL SHR ...
Страница 1668: ...4 366 Volume 4 Base IA 32 Instruction Reference SIDT Store Interrupt Descriptor Table Register See entry for SGDT SIDT ...
Страница 1884: ...4 582 Volume 4 IA 32 SSE Instruction Reference ...
Страница 1885: ...Index Intel Itanium Architecture Software Developer s Manual Rev 2 3 Index ...
Страница 1886: ...Index Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Страница 1898: ...INDEX Index 12 Index for Volumes 1 2 3 and 4 ...