Volume 2, Part 2: MP Coherence and Synchronization
2:533
2.5.2
Cross-modifying Code
Consider a multi-threaded program for a multiprocessor system that dynamically
updates some procedure that any processor in the system may execute. The program
maintains several disjoint buffers to hold the new code and requires a processor to
execute an IP-relative branch instruction at some address
x
to reach the code. In this
scenario, the program updates the procedure by emitting the new code into a different
buffer and then patching the branch at address
x
to target this new buffer. By carefully
writing the update code, software can ensure that any processor in the system sees
either:
• The original branch at address
x
that targets the original code in the old buffer
along with the original code, or
• The new branch at address
x
that targets the new code in the new buffer along with
the new code.
The code in
illustrates an optimized Itanium architecture-based code
sequence that implements the cross-modifying code for this example.
To reach the new code at
new_code
, the processor executes the branch instruction at
x
.
Initially, this branch jumps to an address other than
new_code
.
Note:
The programmer needs to ensure that the branch to
new_code
is updated atom-
ically. If an 8-byte store is used to update the branch, then the programmer
needs to ensure that the branch to
new_code
is either in the first or last slot of
the bundle.
The release store ensures a processor cannot see the new branch at address
x
and the
original code at address
new_code
. That is, if a processor encounters “
branch
<new_code>
” at address
x
, then the processor’s instruction cache must be coherent
with the code image updates applied before the release store that updates the branch.
If remote processors may see either the old or new code sequence, the final three
instructions in
are not necessary. In this case, the remote processors see the
code image updates at some point in the future. In the meantime, they continue to
execute the old code.
Figure 2-9.
Supporting Cross-modifying Code without Explicit Serialization
patch:
st
[new_code] = new_inst // write new instruction
fc.i
new_code ;;
// flush new instruction
sync.i ;;
// sync i stream with store
// Update the target of the branch that jumps to the updated code.
// This branch MUST be ip-relative. Before executing the following
// store, the branch jumps to somewhere other than “new_code”.
//
st.rel [x] = “branch <new_code>”
// If it is desired to propagate “branch <new_code>” to both
// the local processor and remote processor now, the following
// code is also necessary:
//
fc.i
x ;;
// flush branch
sync.i ;;
// sync i stream with store
mf ;;
// fence
Summary of Contents for ITANIUM ARCHITECTURE - SOFTWARE DEVELOPERS VOLUME 3 REV 2.3
Page 1: ......
Page 11: ...x Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Page 13: ...1 2 Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Page 33: ...1 22 Volume 1 Part 1 Introduction to the Intel Itanium Architecture ...
Page 57: ...1 46 Volume 1 Part 1 Execution Environment ...
Page 147: ...1 136 Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Page 149: ...1 138 Volume 1 Part 2 About the Optimization Guide ...
Page 191: ...1 180 Volume 1 Part 2 Predication Control Flow and Instruction Stream ...
Page 230: ......
Page 248: ...236 Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Page 250: ...2 2 Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Page 264: ...2 16 Volume 2 Part 1 Intel Itanium System Environment ...
Page 380: ...2 132 Volume 2 Part 1 Interruptions ...
Page 398: ...2 150 Volume 2 Part 1 Register Stack Engine ...
Page 486: ...2 238 Volume 2 Part 1 IA 32 Interruption Vector Descriptions ...
Page 750: ...2 502 Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Page 754: ...2 506 Volume 2 Part 2 About the System Programmer s Guide ...
Page 796: ...2 548 Volume 2 Part 2 Interruptions and Serialization ...
Page 808: ...2 560 Volume 2 Part 2 Context Management ...
Page 842: ...2 594 Volume 2 Part 2 Floating point System Software ...
Page 850: ...2 602 Volume 2 Part 2 IA 32 Application Support ...
Page 862: ...2 614 Volume 2 Part 2 External Interrupt Architecture ...
Page 870: ...2 622 Volume 2 Part 2 Performance Monitoring Support ...
Page 891: ......
Page 1099: ...3 200 Volume 3 Instruction Reference padd Interruptions Illegal Operation fault ...
Page 1295: ...3 396 Volume 3 Resource and Dependency Semantics ...
Page 1296: ......
Page 1302: ...402 Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Page 1494: ...4 192 Volume 4 Base IA 32 Instruction Reference FWAIT Wait See entry for WAIT ...
Page 1647: ...Volume 4 Base IA 32 Instruction Reference 4 345 ROL ROR Rotate See entry for RCL RCR ROL ROR ...
Page 1884: ...4 582 Volume 4 IA 32 SSE Instruction Reference ...
Page 1885: ...Index Intel Itanium Architecture Software Developer s Manual Rev 2 3 Index ...
Page 1886: ...Index Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Page 1898: ...INDEX Index 12 Index for Volumes 1 2 3 and 4 ...