data:image/s3,"s3://crabby-images/198cf/198cf1ff856de9f6e91c5c656b580662f6e05c01" alt="AMD Athlon Processor x86 Скачать руководство пользователя страница 118"
102
Minimize Floating-Point-to-Integer Conversions
AMD Athlon™ Processor x86 Code Optimization
22007E/0—November 1999
Example 3 (Potentially faster):
MOV ECX, DWORD PTR[X+4]
;get upper 32 bits of double
XOR EDX, EDX
;i = 0
MOV EAX, ECX
;save sign bit
AND ECX, 07FF00000h
;isolate exponent field
CMP ECX, 03FF00000h
;if abs(x) < 1.0
JB $DONE2
; then i = 0
MOV EDX, DWORD PTR[X]
;get lower 32 bits of double
SHR ECX, 20
;extract exponent
SHRD EDX, EAX, 21
;extract mantissa
NEG ECX
;compute shift factor for extracting
ADD ECX, 1054
;non-fractional mantissa bits
OR EDX, 080000000h
;set integer bit of mantissa
SAR EAX, 31
;x < 0 ? 0xffffffff : 0
SHR EDX, CL
;i = trunc(abs(x))
XOR EDX, EAX
;i = x < 0 ? ~i : i
SUB EDX, EAX
;i = x < 0 ? -i : i
$DONE2:
MOV [I], EDX
;store result
For applications which can tolerate a floating-point-to-integer
conversion that is not compliant with existing programming
language standards (but is IEEE-754 compliant), perform the
conversion using the rounding mode that is currently in effect
(usually round-to-nearest-even).
Example 4 (Fastest):
FLD QWORD PTR [X] ; get double to be converted
FISTP DWORD PTR [I] ; store integer result
Some compilers offer an option to use the code from example 4
for floating-point-to-integer conversion, using the default
rounding mode.
Lastly, consider setting the rounding mode throughout an
application to truncate and using the code from example 4 to
perform extremely fast conversions that are compliant with
language standards and IEEE-754. This mode is also provided
as an option by some compilers. Note that use of this technique
also changes the rounding mode for all other FPU operations
inside the application, which can lead to significant changes in
numerical results and even program failure (for example, due to
lack of convergence in iterative algorithms).
Содержание Athlon Processor x86
Страница 1: ...AMD Athlon Processor x86 Code Optimization Guide TM...
Страница 12: ...xii List of Figures AMD Athlon Processor x86 Code Optimization 22007E 0 November 1999...
Страница 16: ...xvi Revision History AMD Athlon Processor x86 Code Optimization 22007E 0 November 1999...
Страница 60: ...44 Code Padding Using Neutral Code Fillers AMD Athlon Processor x86 Code Optimization 22007E 0 November 1999...
Страница 92: ...76 Push Memory Data Carefully AMD Athlon Processor x86 Code Optimization 22007E 0 November 1999...
Страница 122: ...106 Take Advantage of the FSINCOS Instruction AMD Athlon Processor x86 Code Optimization 22007E 0 November 1999...
Страница 156: ...140 AMD Athlon Processor Microarchitecture AMD Athlon Processor x86 Code Optimization 22007E 0 November 1999...
Страница 176: ...160 Write Combining Operations AMD Athlon Processor x86 Code Optimization 22007E 0 November 1999...
Страница 202: ...186 Page Attribute Table PAT AMD Athlon Processor x86 Code Optimization 22007E 0 November 1999...
Страница 252: ...236 VectorPath Instructions AMD Athlon Processor x86 Code Optimization 22007E 0 November 1999...
Страница 256: ...240 Index AMD Athlon Processor x86 Code Optimization 22007E 0 November 1999...