
1:18
Volume 1, Part 1: Introduction to the Intel
®
Itanium
®
Architecture
if (p5) r1 = r2 + r3
In this example
p5
is the controlling predicate that decides whether or not the
instruction executes and updates state. If the predicate value is true, then the
instruction updates state. Otherwise it generally behaves like a
nop
. Predicates are
assigned values by compare instructions.
Predicated execution avoids branches, and simplifies compiler optimizations by
converting a control dependency to a data dependency. Consider the original code:
if (a>b) c = c + 1
else d = d * e + f
The branch at
(a>b)
can be avoided by converting the code above to the predicated
code:
pT, pF = compare(a>b)
if (pT) c = c + 1
if (pF) d = d * e + f
The predicate
pT
is set to 1 if the condition evaluates to true, and to 0 if the condition
evaluates to false. The predicate
pF
is the complement of
pT
. The control dependency of
the instructions
c = c + 1
and
d = d * e + f
on the branch with the condition
(a>b)
is now converted into a data dependency on
compare(a>b)
through predicates
pT
and
pF
(the branch is eliminated). An added benefit is that the compiler can schedule the
instructions under
pT
and
pF
to execute in parallel. It is also worth noting that there are
several different types of compare instructions that write predicates in different
manners including unconditional compares and parallel compares.
2.7
Register Stack
The Itanium architecture avoids the unnecessary spilling and filling of registers at
procedure call and return interfaces through compiler-controlled renaming. At a call
site, a new frame of registers is available to the called procedure without the need for
register spill and fill (either by the caller or by the callee). Register access occurs by
renaming the virtual register identifiers in the instructions through a base register into
the physical registers. The callee can freely use available registers without having to
spill and eventually restore the caller’s registers. The callee executes an
alloc
instruction specifying the number of registers it expects to use in order to ensure that
enough registers are available. If sufficient registers are not available (stack overflow),
the
alloc
stalls the processor and spills the caller’s registers until the requested
number of registers are available.
At the return site, the base register is restored to the value that the caller was using to
access registers prior to the call. Some of the caller’s registers may have been spilled
by the hardware and not yet restored. In this case (stack underflow), the return stalls
the processor until the processor has restored an appropriate number of the caller’s
registers. The hardware can exploit the explicit register stack frame information to spill
and fill registers from the register stack to memory at the best opportunity
(independent of the calling and called procedures).
Содержание ITANIUM ARCHITECTURE - SOFTWARE DEVELOPERS VOLUME 3 REV 2.3
Страница 1: ......
Страница 11: ...x Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Страница 12: ...1 1 Intel Itanium Architecture Software Developer s Manual Rev 2 3 Part I Application Architecture Guide ...
Страница 13: ...1 2 Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Страница 33: ...1 22 Volume 1 Part 1 Introduction to the Intel Itanium Architecture ...
Страница 57: ...1 46 Volume 1 Part 1 Execution Environment ...
Страница 147: ...1 136 Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Страница 149: ...1 138 Volume 1 Part 2 About the Optimization Guide ...
Страница 191: ...1 180 Volume 1 Part 2 Predication Control Flow and Instruction Stream ...
Страница 230: ......
Страница 248: ...236 Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Страница 249: ...2 1 Intel Itanium Architecture Software Developer s Manual Rev 2 3 Part I System Architecture Guide ...
Страница 250: ...2 2 Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Страница 264: ...2 16 Volume 2 Part 1 Intel Itanium System Environment ...
Страница 380: ...2 132 Volume 2 Part 1 Interruptions ...
Страница 398: ...2 150 Volume 2 Part 1 Register Stack Engine ...
Страница 486: ...2 238 Volume 2 Part 1 IA 32 Interruption Vector Descriptions ...
Страница 749: ...2 501 Intel Itanium Architecture Software Developer s Manual Rev 2 3 Part II System Programmer s Guide ...
Страница 750: ...2 502 Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Страница 754: ...2 506 Volume 2 Part 2 About the System Programmer s Guide ...
Страница 796: ...2 548 Volume 2 Part 2 Interruptions and Serialization ...
Страница 808: ...2 560 Volume 2 Part 2 Context Management ...
Страница 842: ...2 594 Volume 2 Part 2 Floating point System Software ...
Страница 850: ...2 602 Volume 2 Part 2 IA 32 Application Support ...
Страница 862: ...2 614 Volume 2 Part 2 External Interrupt Architecture ...
Страница 870: ...2 622 Volume 2 Part 2 Performance Monitoring Support ...
Страница 891: ......
Страница 941: ...3 42 Volume 3 Instruction Reference cmp illegal_operation_fault PR p1 0 PR p2 0 Interruptions Illegal Operation fault ...
Страница 1099: ...3 200 Volume 3 Instruction Reference padd Interruptions Illegal Operation fault ...
Страница 1191: ...3 292 Volume 3 Pseudo Code Functions Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Страница 1295: ...3 396 Volume 3 Resource and Dependency Semantics ...
Страница 1296: ......
Страница 1302: ...402 Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Страница 1494: ...4 192 Volume 4 Base IA 32 Instruction Reference FWAIT Wait See entry for WAIT ...
Страница 1564: ...4 262 Volume 4 Base IA 32 Instruction Reference LES Load Full Pointer See entry for LDS LES LFS LGS LSS ...
Страница 1565: ...Volume 4 Base IA 32 Instruction Reference 4 263 LFS Load Full Pointer See entry for LDS LES LFS LGS LSS ...
Страница 1568: ...4 266 Volume 4 Base IA 32 Instruction Reference LGS Load Full Pointer See entry for LDS LES LFS LGS LSS ...
Страница 1583: ...Volume 4 Base IA 32 Instruction Reference 4 281 LSS Load Full Pointer See entry for LDS LES LFS LGS LSS ...
Страница 1647: ...Volume 4 Base IA 32 Instruction Reference 4 345 ROL ROR Rotate See entry for RCL RCR ROL ROR ...
Страница 1663: ...Volume 4 Base IA 32 Instruction Reference 4 361 SHL SHR Shift Instructions See entry for SAL SAR SHL SHR ...
Страница 1668: ...4 366 Volume 4 Base IA 32 Instruction Reference SIDT Store Interrupt Descriptor Table Register See entry for SGDT SIDT ...
Страница 1884: ...4 582 Volume 4 IA 32 SSE Instruction Reference ...
Страница 1885: ...Index Intel Itanium Architecture Software Developer s Manual Rev 2 3 Index ...
Страница 1886: ...Index Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Страница 1898: ...INDEX Index 12 Index for Volumes 1 2 3 and 4 ...