2:532
Volume 2, Part 2: MP Coherence and Synchronization
• Programmed I/O for paging of code pages.
• DMA for paging of code pages.
The next four sections discuss these techniques in greater depth.
To illustrate the code sequences for self- and cross-modifying code, the examples in
this section use the syntax “
st
[foo]
=
new
” to represent a group of aligned stores that
change the instruction at address
foo
to the instruction “
new
”. The Itanium architecture
requires that the instruction stream see aligned stores atomically. In addition, the
syntax “
fc.i
foo
” represents a group of flush cache instructions that ensures the cache
line addressed by
foo
is coherent with all the instruction caches. Updating more than
one instruction simply requires the appropriate store/flush “pair” for each updated
instruction
1
.
2.5.1
Self-modifying Code
presents the Itanium instruction sequence necessary to update a code
image location on the local processor only.
This code fragment changes the instruction at the address
code
to the new instruction
new_inst
. After executing this code, the change is visible to both the local processor’s
caches and its pipeline.
The
st
instruction updates the code image and the
fc.i
instruction ensures the value
stored is coherent with the instruction cache. The
fc.i
is necessary because the
Itanium architecture does not require instruction caches to be coherent with data stores
for Itanium architecture-based code. Next, the
sync.i
ensures that the code update is
visible to the instruction stream of the local processor and orders the cache flush with
respect to subsequent operations by waiting for the prior
fc.i
instructions to be made
visible. Finally, the
srlz.i
instruction forces the pipeline to re-initiate any instruction
group fetches it performed after the
srlz.i
and also waits for the
sync.i
to complete;
effectively making the pipeline coherent with the updated code image.
The serialization instruction is not necessary if software can
guarantee
that the
processor encounters an event that re-initiates code fetches performed after the
sync.i
, such as an interruption or an
rfi
, before executing the new code. Events such
as an interrupt or
rfi
both perform an instruction serialization which in this example
waits for the
sync.i
to complete and then re-initiates code fetches.
1.
This description hides some of the complexity involved. Specifically, the flush and store operations
have different sizes. Whereas multiple store instructions are necessary to update a 16 byte instruc-
tion, a single cache line flush invalidates at least two 16 byte instructions.
Figure 2-8.
Updating a Code Image on the Local Processor
patch_local:
st
[code] = new_inst
// write new instruction
fc.i
code ;;
// flush new instruction
sync.i ;;
// sync i stream with store
srlz.i ;;
// serialize
// Local caches and pipeline are now coherent with new_inst...
Содержание ITANIUM ARCHITECTURE - SOFTWARE DEVELOPERS VOLUME 3 REV 2.3
Страница 1: ......
Страница 11: ...x Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Страница 12: ...1 1 Intel Itanium Architecture Software Developer s Manual Rev 2 3 Part I Application Architecture Guide ...
Страница 13: ...1 2 Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Страница 33: ...1 22 Volume 1 Part 1 Introduction to the Intel Itanium Architecture ...
Страница 57: ...1 46 Volume 1 Part 1 Execution Environment ...
Страница 147: ...1 136 Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Страница 149: ...1 138 Volume 1 Part 2 About the Optimization Guide ...
Страница 191: ...1 180 Volume 1 Part 2 Predication Control Flow and Instruction Stream ...
Страница 230: ......
Страница 248: ...236 Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Страница 249: ...2 1 Intel Itanium Architecture Software Developer s Manual Rev 2 3 Part I System Architecture Guide ...
Страница 250: ...2 2 Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Страница 264: ...2 16 Volume 2 Part 1 Intel Itanium System Environment ...
Страница 380: ...2 132 Volume 2 Part 1 Interruptions ...
Страница 398: ...2 150 Volume 2 Part 1 Register Stack Engine ...
Страница 486: ...2 238 Volume 2 Part 1 IA 32 Interruption Vector Descriptions ...
Страница 749: ...2 501 Intel Itanium Architecture Software Developer s Manual Rev 2 3 Part II System Programmer s Guide ...
Страница 750: ...2 502 Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Страница 754: ...2 506 Volume 2 Part 2 About the System Programmer s Guide ...
Страница 796: ...2 548 Volume 2 Part 2 Interruptions and Serialization ...
Страница 808: ...2 560 Volume 2 Part 2 Context Management ...
Страница 842: ...2 594 Volume 2 Part 2 Floating point System Software ...
Страница 850: ...2 602 Volume 2 Part 2 IA 32 Application Support ...
Страница 862: ...2 614 Volume 2 Part 2 External Interrupt Architecture ...
Страница 870: ...2 622 Volume 2 Part 2 Performance Monitoring Support ...
Страница 891: ......
Страница 941: ...3 42 Volume 3 Instruction Reference cmp illegal_operation_fault PR p1 0 PR p2 0 Interruptions Illegal Operation fault ...
Страница 1099: ...3 200 Volume 3 Instruction Reference padd Interruptions Illegal Operation fault ...
Страница 1191: ...3 292 Volume 3 Pseudo Code Functions Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Страница 1295: ...3 396 Volume 3 Resource and Dependency Semantics ...
Страница 1296: ......
Страница 1302: ...402 Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Страница 1494: ...4 192 Volume 4 Base IA 32 Instruction Reference FWAIT Wait See entry for WAIT ...
Страница 1564: ...4 262 Volume 4 Base IA 32 Instruction Reference LES Load Full Pointer See entry for LDS LES LFS LGS LSS ...
Страница 1565: ...Volume 4 Base IA 32 Instruction Reference 4 263 LFS Load Full Pointer See entry for LDS LES LFS LGS LSS ...
Страница 1568: ...4 266 Volume 4 Base IA 32 Instruction Reference LGS Load Full Pointer See entry for LDS LES LFS LGS LSS ...
Страница 1583: ...Volume 4 Base IA 32 Instruction Reference 4 281 LSS Load Full Pointer See entry for LDS LES LFS LGS LSS ...
Страница 1647: ...Volume 4 Base IA 32 Instruction Reference 4 345 ROL ROR Rotate See entry for RCL RCR ROL ROR ...
Страница 1663: ...Volume 4 Base IA 32 Instruction Reference 4 361 SHL SHR Shift Instructions See entry for SAL SAR SHL SHR ...
Страница 1668: ...4 366 Volume 4 Base IA 32 Instruction Reference SIDT Store Interrupt Descriptor Table Register See entry for SGDT SIDT ...
Страница 1884: ...4 582 Volume 4 IA 32 SSE Instruction Reference ...
Страница 1885: ...Index Intel Itanium Architecture Software Developer s Manual Rev 2 3 Index ...
Страница 1886: ...Index Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Страница 1898: ...INDEX Index 12 Index for Volumes 1 2 3 and 4 ...