![Intel ITANIUM ARCHITECTURE - SOFTWARE DEVELOPERS VOLUME 3 REV 2.3 Manual Download Page 172](http://html.mh-extra.com/html/intel/itanium-architecture-software-developers-volume-3-rev-2-3/itanium-architecture-software-developers-volume-3-rev-2-3_manual_2073404172.webp)
Volume 1, Part 2: Memory Reference
1:161
3.5.6
Minimizing Check Code
Checks of speculative loads can sometimes be combined to reduce code size. The
propagation of NaT bits and NaTVals via speculative instructions can permit a single
check of a speculative result to replace multiple intermediate checks. The code below
demonstrates this optimization potential:
ld4.s
r1=[r10]
// Speculatively load to r1
ld4.s
r2=[r20]
// Speculatively load to r2
add
r3=r1,r2;; // Add two speculative values
// Other instructions
chk.s
r3,
imm
21
// Check for NaT bit in r3
st4
[r30]=r1
// Store r1
st4
[r40]=r2
// Store r2
st4
[r50]=r3
// Store r3
Only the result register,
r3
, needs to be checked before stores of any of
r1
,
r2
, or
r3
.
If a NaT bit were set at the time of the control speculative loads of
r1
or
r2
, the NaT bit
would have been propagated to
r3
from
r1
or
r2
via the
add
instruction.
Another way to reduce the amount of check code is to use control flow analysis to avoid
issuing extra
ld.c
or
ld.a
instructions. For example, the compiler can schedule a
single check where it is known to be reached by all copies of the advanced load. The
portion of a flow graph shown in
demonstrates where this technique might
be applied.
A single check in the lowermost block shown for all of the advanced loads is correct if
both of these conditions are met:
• The lowermost block post-dominates all of the blocks with advanced loads from
location
addr.
• The lowermost block precedes any uses of the advanced loads from
addr.
Figure 3-4.
Using a Single Check for Three Advanced Loads
ld.a
ld.a
ld.a
*p1 =
*p2 =
*p3 =
ld.c
Advanced loads from
addr
to the same register,
R
Stores
Single load check of
register
R
Summary of Contents for ITANIUM ARCHITECTURE - SOFTWARE DEVELOPERS VOLUME 3 REV 2.3
Page 1: ......
Page 11: ...x Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Page 13: ...1 2 Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Page 33: ...1 22 Volume 1 Part 1 Introduction to the Intel Itanium Architecture ...
Page 57: ...1 46 Volume 1 Part 1 Execution Environment ...
Page 147: ...1 136 Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Page 149: ...1 138 Volume 1 Part 2 About the Optimization Guide ...
Page 191: ...1 180 Volume 1 Part 2 Predication Control Flow and Instruction Stream ...
Page 230: ......
Page 248: ...236 Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Page 250: ...2 2 Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Page 264: ...2 16 Volume 2 Part 1 Intel Itanium System Environment ...
Page 380: ...2 132 Volume 2 Part 1 Interruptions ...
Page 398: ...2 150 Volume 2 Part 1 Register Stack Engine ...
Page 486: ...2 238 Volume 2 Part 1 IA 32 Interruption Vector Descriptions ...
Page 750: ...2 502 Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Page 754: ...2 506 Volume 2 Part 2 About the System Programmer s Guide ...
Page 796: ...2 548 Volume 2 Part 2 Interruptions and Serialization ...
Page 808: ...2 560 Volume 2 Part 2 Context Management ...
Page 842: ...2 594 Volume 2 Part 2 Floating point System Software ...
Page 850: ...2 602 Volume 2 Part 2 IA 32 Application Support ...
Page 862: ...2 614 Volume 2 Part 2 External Interrupt Architecture ...
Page 870: ...2 622 Volume 2 Part 2 Performance Monitoring Support ...
Page 891: ......
Page 1099: ...3 200 Volume 3 Instruction Reference padd Interruptions Illegal Operation fault ...
Page 1295: ...3 396 Volume 3 Resource and Dependency Semantics ...
Page 1296: ......
Page 1302: ...402 Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Page 1494: ...4 192 Volume 4 Base IA 32 Instruction Reference FWAIT Wait See entry for WAIT ...
Page 1647: ...Volume 4 Base IA 32 Instruction Reference 4 345 ROL ROR Rotate See entry for RCL RCR ROL ROR ...
Page 1884: ...4 582 Volume 4 IA 32 SSE Instruction Reference ...
Page 1885: ...Index Intel Itanium Architecture Software Developer s Manual Rev 2 3 Index ...
Page 1886: ...Index Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Page 1898: ...INDEX Index 12 Index for Volumes 1 2 3 and 4 ...