Revision 1.0
Performance Tips
133
code which decides which attributes are necessary, we always compute
them all and only output the ones we are interested in.
This approach also saves precious IMEM space.
Profiling RSP Code
The RSP simulator can help profile your code, it can show pipeline stalls,
load delays, and DMA wait states. The RSP clock (CLK) of the simulator is
always available as a register.
Note:
Although it is accurate within a few percent, the RSP simulator is
not cycle accurate with the actual hardware. The differences are mainly
in VU loads and moves.
It is also useful to use the RDP Command Counter to profile code on the
actual hardware. This value can be sampled, saved to DMEM or DMA’d to
DRAM for later analysis. A sample code fragment to read and store the RDP
Command Counter is shown in Figure 6-1, “Real-time Clock Watching on
the RSP,” on page 134.
Содержание Ultra64
Страница 2: ...2 ...
Страница 10: ...10 ...
Страница 12: ...12 Figure 6 2 buildtask Operation 137 ...
Страница 14: ...14 ...
Страница 80: ...80 Vector Unit Instructions vmadm dres_int dres_int vconst 3 vmadn dres_frac vconst vconst 0 ...
Страница 104: ...104 RSP Coprocessor 0 ...
Страница 150: ...150 Advanced Information ...
Страница 155: ...Revision 1 0 155 ...
Страница 248: ...248 Exceptions None ...
Страница 251: ...Revision 1 0 251 Exceptions None ...
Страница 254: ...254 Exceptions None ...
Страница 257: ...Revision 1 0 257 Exceptions None ...
Страница 293: ...Revision 1 0 293 Exceptions None ...
Страница 316: ...316 Exceptions None ...