4:468
Volume 4: IA-32 SSE Instruction Reference
4.6.1.3
Compare Instructions
The CMPPS (Compare packed single-precision floating-point) instruction compares four
pairs of packed single-precision floating-point numbers using the immediate operand as
a predicate, returning per SP field an all “1” 32-bit mask or an all “0” 32-bit mask as a
result. The instruction supports a full set of 12 conditions: equal, less than, less than
equal, greater than, greater than or equal, unordered, not equal, not less than, not less
than or equal, not greater than, not greater than or equal, ordered.
The CMPSS (Compare scalar single-precision floating-point) instruction compares the
least significant pairs of packed single-precision floating-point numbers using the
immediate operand as a predicate (same as CMPPS), returning per SP field an all “1”
32-bit mask or an all “0” 32-bit mask as a result.
The COMISS (Compare scalar single-precision floating-point ordered and set EFLAGS)
instruction compares the least significant pairs of packed single-precision floating-point
numbers and sets the ZF,PF,CF bits in the EFLAGS register (the OF, SF and AF bits are
cleared).
The UCOMISS (Unordered compare scalar single-precision floating-point ordered and
set EFLAGS) instruction compares the least significant pairs of packed single-precision
floating-point numbers and sets the ZF,PF,CF bits in the EFLAGS register as described
above (the OF, SF and AF bits are cleared).
4.6.1.4
Shuffle Instructions
The SHUFPS (Shuffle packed single-precision floating-point) instruction is able to
shuffle any of the packed four single-precision floating-point numbers from one source
operand to the lower two destination fields; the upper two destination fields are
generated from a shuffle of any of the four SP FP numbers from the second source
operand (
). By using the same register for both sources, SHUFPS can return
any combination of the four SP FP numbers from this register.
The UNPCKHPS (Unpacked high packed single-precision floating-point) instruction
performs an interleaved unpack of the high-order data elements of first and second
packed single-precision floating-point operands. It ignores the lower half part of the
Figure 4-5.
Packed Shuffle Operation
X4
X3
X2
X1
Y4
Y3
Y2
Y1
{Y4 ... Y1}
{Y4 ... Y1}
{X4 ... X1}
{X4 ... X1}
Summary of Contents for ITANIUM ARCHITECTURE - SOFTWARE DEVELOPERS VOLUME 3 REV 2.3
Page 1: ......
Page 11: ...x Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Page 13: ...1 2 Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Page 33: ...1 22 Volume 1 Part 1 Introduction to the Intel Itanium Architecture ...
Page 57: ...1 46 Volume 1 Part 1 Execution Environment ...
Page 147: ...1 136 Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Page 149: ...1 138 Volume 1 Part 2 About the Optimization Guide ...
Page 191: ...1 180 Volume 1 Part 2 Predication Control Flow and Instruction Stream ...
Page 230: ......
Page 248: ...236 Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Page 250: ...2 2 Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Page 264: ...2 16 Volume 2 Part 1 Intel Itanium System Environment ...
Page 380: ...2 132 Volume 2 Part 1 Interruptions ...
Page 398: ...2 150 Volume 2 Part 1 Register Stack Engine ...
Page 486: ...2 238 Volume 2 Part 1 IA 32 Interruption Vector Descriptions ...
Page 750: ...2 502 Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Page 754: ...2 506 Volume 2 Part 2 About the System Programmer s Guide ...
Page 796: ...2 548 Volume 2 Part 2 Interruptions and Serialization ...
Page 808: ...2 560 Volume 2 Part 2 Context Management ...
Page 842: ...2 594 Volume 2 Part 2 Floating point System Software ...
Page 850: ...2 602 Volume 2 Part 2 IA 32 Application Support ...
Page 862: ...2 614 Volume 2 Part 2 External Interrupt Architecture ...
Page 870: ...2 622 Volume 2 Part 2 Performance Monitoring Support ...
Page 891: ......
Page 1099: ...3 200 Volume 3 Instruction Reference padd Interruptions Illegal Operation fault ...
Page 1295: ...3 396 Volume 3 Resource and Dependency Semantics ...
Page 1296: ......
Page 1302: ...402 Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Page 1494: ...4 192 Volume 4 Base IA 32 Instruction Reference FWAIT Wait See entry for WAIT ...
Page 1647: ...Volume 4 Base IA 32 Instruction Reference 4 345 ROL ROR Rotate See entry for RCL RCR ROL ROR ...
Page 1884: ...4 582 Volume 4 IA 32 SSE Instruction Reference ...
Page 1885: ...Index Intel Itanium Architecture Software Developer s Manual Rev 2 3 Index ...
Page 1886: ...Index Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Page 1898: ...INDEX Index 12 Index for Volumes 1 2 3 and 4 ...