Volume 2, Part 2: MP Coherence and Synchronization
2:519
to account for both the memory ordering semantics and dependencies. It is important
to keep in mind that the observance of a dependency between two operations does not
imply an ordering relationship (from the standpoint of the memory ordering model)
between the operations as
describes.
Assuming that a processor can bypass locally-written values before they are made
globally-visible implies that there is a local and a global visibility points for a memory
operation where a value always becomes locally visible before it becomes globally
visible. Since M1 and M4 can have local visibility with respect to M2 and M5 as well as
global visibility,
where m1 and M1 represent local and global visibility of memory operation 1,
respectively. There are two things to note. First, the ordering of the local visibilities of
operations M1 and M4 (m1 and m4, respectively) allow each processor to honor its data
dependencies. That is, Processor #2 honors the RAW dependency through memory
between M1 and M2 by requiring m1 to become visible before M2. Second, that these
requirements do not place any constraints on the relative ordering perceived by a
remote
observer of operation M1 with M2 and M3 or of operation M4 with M5 and M6
(as the local visibilities meet the
local
ordering constraints that the dependencies
impose).
The code in
and these constraints together imply that
Thus, the outcome r1 = 1, r3 = 1, r2 = 0, and r4 = 0 is allowed because these
statements are consistent with our definition of local and global visibility. Specifically, a
value becomes locally visible before it becomes globally visible. Similar reasoning can
show that the constraints also imply that
2.2.1.9
Preventing Store Buffers from Satisfying Local Loads
In the code shown in
, there are no ordering constraints
between the store and acquire load from the standpoint of memory ordering semantics
(however, there is a RAW dependency through memory that forces the acquire load to
follow the store). Bypassing may not occur if doing so violates the memory ordering
constraints of memory operations between the store and the bypassing read.
presents a variation on the execution in
from
that illustrates this behavior.
Table 2-11.
Preventing Store Buffers from Satisfying Local Loads
Processor #0
Processor #1
st
[x] = 1
// M1
mf
// M2
ld.acq r1 = [x]
// M3
ld
r2 = [y]
// M4
st
[y] = 1
// M5
mf
// M6
ld.acq r3 = [y]
// M7
ld
r4 = [x]
// M8
Outcome:
r1 = 1, r3 = 1, r2 = 0, and r4 = 0 is not allowed
m1
M2
M3; m1
M1
m4
M5
M6; m4
M4
r1 = 1
m1
M2
r3 = 1
m4
M5
r2 = 0
M3
M4
m1
M6 because m1
M3 and M3
M4 and M4
M6
r4 = 0
M6
M1
m1
M6 and M6
M1
m1
M1
m4
M4.
Содержание ITANIUM ARCHITECTURE - SOFTWARE DEVELOPERS VOLUME 3 REV 2.3
Страница 1: ......
Страница 11: ...x Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Страница 12: ...1 1 Intel Itanium Architecture Software Developer s Manual Rev 2 3 Part I Application Architecture Guide ...
Страница 13: ...1 2 Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Страница 33: ...1 22 Volume 1 Part 1 Introduction to the Intel Itanium Architecture ...
Страница 57: ...1 46 Volume 1 Part 1 Execution Environment ...
Страница 147: ...1 136 Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Страница 149: ...1 138 Volume 1 Part 2 About the Optimization Guide ...
Страница 191: ...1 180 Volume 1 Part 2 Predication Control Flow and Instruction Stream ...
Страница 230: ......
Страница 248: ...236 Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Страница 249: ...2 1 Intel Itanium Architecture Software Developer s Manual Rev 2 3 Part I System Architecture Guide ...
Страница 250: ...2 2 Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Страница 264: ...2 16 Volume 2 Part 1 Intel Itanium System Environment ...
Страница 380: ...2 132 Volume 2 Part 1 Interruptions ...
Страница 398: ...2 150 Volume 2 Part 1 Register Stack Engine ...
Страница 486: ...2 238 Volume 2 Part 1 IA 32 Interruption Vector Descriptions ...
Страница 749: ...2 501 Intel Itanium Architecture Software Developer s Manual Rev 2 3 Part II System Programmer s Guide ...
Страница 750: ...2 502 Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Страница 754: ...2 506 Volume 2 Part 2 About the System Programmer s Guide ...
Страница 796: ...2 548 Volume 2 Part 2 Interruptions and Serialization ...
Страница 808: ...2 560 Volume 2 Part 2 Context Management ...
Страница 842: ...2 594 Volume 2 Part 2 Floating point System Software ...
Страница 850: ...2 602 Volume 2 Part 2 IA 32 Application Support ...
Страница 862: ...2 614 Volume 2 Part 2 External Interrupt Architecture ...
Страница 870: ...2 622 Volume 2 Part 2 Performance Monitoring Support ...
Страница 891: ......
Страница 941: ...3 42 Volume 3 Instruction Reference cmp illegal_operation_fault PR p1 0 PR p2 0 Interruptions Illegal Operation fault ...
Страница 1099: ...3 200 Volume 3 Instruction Reference padd Interruptions Illegal Operation fault ...
Страница 1191: ...3 292 Volume 3 Pseudo Code Functions Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Страница 1295: ...3 396 Volume 3 Resource and Dependency Semantics ...
Страница 1296: ......
Страница 1302: ...402 Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Страница 1494: ...4 192 Volume 4 Base IA 32 Instruction Reference FWAIT Wait See entry for WAIT ...
Страница 1564: ...4 262 Volume 4 Base IA 32 Instruction Reference LES Load Full Pointer See entry for LDS LES LFS LGS LSS ...
Страница 1565: ...Volume 4 Base IA 32 Instruction Reference 4 263 LFS Load Full Pointer See entry for LDS LES LFS LGS LSS ...
Страница 1568: ...4 266 Volume 4 Base IA 32 Instruction Reference LGS Load Full Pointer See entry for LDS LES LFS LGS LSS ...
Страница 1583: ...Volume 4 Base IA 32 Instruction Reference 4 281 LSS Load Full Pointer See entry for LDS LES LFS LGS LSS ...
Страница 1647: ...Volume 4 Base IA 32 Instruction Reference 4 345 ROL ROR Rotate See entry for RCL RCR ROL ROR ...
Страница 1663: ...Volume 4 Base IA 32 Instruction Reference 4 361 SHL SHR Shift Instructions See entry for SAL SAR SHL SHR ...
Страница 1668: ...4 366 Volume 4 Base IA 32 Instruction Reference SIDT Store Interrupt Descriptor Table Register See entry for SGDT SIDT ...
Страница 1884: ...4 582 Volume 4 IA 32 SSE Instruction Reference ...
Страница 1885: ...Index Intel Itanium Architecture Software Developer s Manual Rev 2 3 Index ...
Страница 1886: ...Index Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Страница 1898: ...INDEX Index 12 Index for Volumes 1 2 3 and 4 ...