xii
Use 64-Bit Registers Instead of Two 32-Bit Registers for 64-Bit Arithmetic ..................... 8-4
Use 32-Bit Versions of CVTSI2SS and CVTSI2SD When Possible ................................. 8-6
Using Software Prefetch................................................................................................... 8-6
Power Optimization for Mobile Usages
Overview ................................................................................................................................. 9-1
Mobile Usage Scenarios ......................................................................................................... 9-2
ACPI C-States ......................................................................................................................... 9-4
Adjust Performance to Meet Quality of Features ............................................................. 9-8
Reducing Amount of Work................................................................................................ 9-9
Platform-Level Optimizations.......................................................................................... 9-10
Handling Sleep State Transitions ................................................................................... 9-11
Using Enhanced Intel SpeedStep
®
Technology ............................................................. 9-12
Enhanced Deeper Sleep ....................................................................... 9-14
Technology .................................................................. 9-15
Thread Migration Considerations.............................................................................. 9-16
Multi-core Considerations for C-States ..................................................................... 9-17
Appendix A Application Performance Tools
Compilers ..................................................................................................................... A-2
Targeting a Processor (-Gn) ...................................................................................... A-3
Automatic Processor Dispatch Support (-Qx[extensions] and -Qax[extensions])...... A-4
Loop Unrolling............................................................................................................ A-5
Multithreading with OpenMP* .................................................................................... A-6
Rounding Control Option (-Qrcd) .................................................................................... A-6
Interprocedural and Profile-Guided Optimizations .......................................................... A-7
VTune™ Performance Analyzer................................................................................... A-8
Summary of Contents for ARCHITECTURE IA-32
Page 1: ...IA 32 Intel Architecture Optimization Reference Manual Order Number 248966 013US April 2006...
Page 220: ...IA 32 Intel Architecture Optimization 3 40...
Page 434: ...IA 32 Intel Architecture Optimization 9 20...
Page 514: ...IA 32 Intel Architecture Optimization B 60...
Page 536: ...IA 32 Intel Architecture Optimization C 22...