Overview of ’C64x Architectural Enhancements
8-3
’C64x Programming Considerations
erage. New packed multiply instructions provide support for both standard
multiplies, as well as rounded multiplies and dot products. With packed data
types, a single instruction can operate on two 16-bit quantities or four 8-bit
quantities at once.
8.1.4
Non-aligned Memory Accesses
In order to capitalize on its memory and processing bandwidth, the ’C64x pro-
vides support for non-aligned memory accesses. Non-aligned memory ac-
cesses provide a method for accessing packed data types without the restric-
tions imposed by 32-bit or 64-bit alignment boundaries. The ’C64x can access
up to 64 bits per cycle at any byte boundary with non-aligned load and store
instructions (LDNW, LDNDW, STNW, and STNDW).
8.1.5
Additional Specialized Instructions
The ’C64x also provides a number of new bit-manipulation and other special-
ized instructions for improving performance on bit-oriented algorithms. These
instructions are designed to improve performance on error correction, encryp-
tion, and other bit-level algorithms. Instructions in this category include BITC4,
BITR, ROTL, SHFL, and DEAL. See the
TMS320C6000 CPU and Instruction
Set User’s Guide for more details on these and related instructions.