2–14
Internal Architecture
29 September 1997 – Subject To Change
Pipeline Organization
Figure 2–2 Instruction Pipeline Stages
HLO019B
IC
IC
0
0
IB
IB
SL
SL
1
1
2
2
AC
AC
3
3
4
4
5
5
6
6
7
8
7
IC
0
IB
SL
1
2
AC
3
4
5
6
Integer
Operate
Pipeline
Floating-
Point
Pipeline
Memory
Reference
Pipeline
Instruction Cache Read
Instruction Buffer, Branch Decode,
Determine Next PC
Slot by Function Unit
Register File Access Checks,
Integer Register File Access
First Integer
Operate Stage
If Needed, Second Integer
Operate Stage
Write Integer Register File
Arithmetic, logical, shift, and compare
instructions complete in pipeline stage 4
(1-cycle latency). CMOV completes in
stage 5 (2-cycle latency). IMULL has
an 8-cycle or 9-cycle latency. CMOV
or BR can issue in parallel (0-cycle
latency) with a dependent CMP
instruction.
Floating-Point Register
File Access
First Floating-Point
Operate Stage
Write Floating-Point Register File,
Last Floating-Point Operate Stage
Dcache Read Begins
Dcache Read Ends
Use Dcache Data, Store Writes Dcache
Bcache Tag/Data Access Begins
Bcache Tag Access Ends, 1st Datum Returned
Fill Dcache/Icache (1st OW)
Use Bcache Data
11
9
10
Bcache Read Latency
(5-20 CPU cycles)
10
9
Bcache Cycle Time
(2-10 CPU cycles)
2nd Datum Returned
Fill Dcache/Icache (2nd OW)
. . .