General Optimization Guidelines
2
2-3
* Streaming SIMD Extensions (SSE)
** Streaming SIMD Extensions 2 (SSE2)
General Practices and Coding Guidelines
This section discusses guidelines derived from the performance factors
listed in the “Tuning to Achieve Optimum Performance” section. It also
highlights practices that use performance tools.
The majority of these guidelines benefit processors based on the Intel
NetBurst microarchitecture and the Pentium M processor
microarchitecture. Some guidelines benefit one microarchitecture more
than the other. As a whole, these coding rules enable software to be
optimized for the common performance features of the Intel NetBurst
microarchitecture and the Pentium M processor microarchitecture.
The coding practices recommended under each heading and the bullets
under each heading are listed in order of importance.
Cache line splits
Access across
cache line
boundary
Denormal inputs and
outputs
Slows x87, SSE*,
SSE2** floating-
point operations
Cycling more than 2
values of Floating-point
Control Word
fldcw
not
optimized
Table 2-1
Coding Pitfalls Affecting Performance
(continued)
Factors Affecting
Performance Symptom
Example
(if applicable)
Section Reference
Summary of Contents for ARCHITECTURE IA-32
Page 1: ...IA 32 Intel Architecture Optimization Reference Manual Order Number 248966 013US April 2006...
Page 220: ...IA 32 Intel Architecture Optimization 3 40...
Page 434: ...IA 32 Intel Architecture Optimization 9 20...
Page 514: ...IA 32 Intel Architecture Optimization B 60...
Page 536: ...IA 32 Intel Architecture Optimization C 22...