IA-32 Instruction Latency and Throughput
C
C-9
PSUBB/PSUBW/PSUBD
xmm, xmm
2
2
1
2
2
1
MMX_ALU
PSUBSB/PSUBSW/PSUBU
SB/PSUBUSW xmm, xmm
2
2
1
2
2
1
MMX_ALU
PUNPCKHBW/PUNPCKH
WD/PUNPCKHDQ xmm,
xmm
4
4
1+1
2
2
2
MMX_SHFT
PUNPCKHQDQ xmm, xmm
4
4
1_1
2
2
2
MMX_SHFT
PUNPCKLBW/PUNPCKLW
D/PUNPCKLDQ xmm, xmm
2
2
2
2
2
2
MMX_SHFT
PUNPCKLQDQ
3
xmm,
xmm
4
4
1
1
1
1
FP_MISC
PXOR xmm, xmm
2
2
1
2
2
1
MMX_ALU
Table C-3
Streaming SIMD Extension 2 Double-precision Floating-point
Instructions
Instruction
Latency
1
Throughput
Execution
Unit
2
CPUID
0F3n
0F2n
0x69n
0F3n
0F2n
0x69n 0F2n
ADDPD xmm, xmm
5
4
4
2
2
2
FP_ADD
ADDSD xmm, xmm
5
4
3
2
2
1
FP_ADD
ANDNPD
3
xmm, xmm
4
4
1
2
2
1
MMX_ALU
ANDPD
3
xmm, xmm
4
4
1
2
2
1
MMX_ALU
CMPPD xmm, xmm,
imm8
5
4
4
2
2
2
FP_ADD
CMPSD xmm, xmm,
imm8
5
4
3
2
2
1
FP_ADD
continued
Table C-2
Streaming SIMD Extension 2 128-bit Integer Instructions
(continued)
Instruction
Latency
1
Throughput
Execution
Unit
2
Summary of Contents for ARCHITECTURE IA-32
Page 1: ...IA 32 Intel Architecture Optimization Reference Manual Order Number 248966 013US April 2006...
Page 220: ...IA 32 Intel Architecture Optimization 3 40...
Page 434: ...IA 32 Intel Architecture Optimization 9 20...
Page 514: ...IA 32 Intel Architecture Optimization B 60...
Page 536: ...IA 32 Intel Architecture Optimization C 22...