Volume 2, Part 2: MP Coherence and Synchronization
2:527
2.4.1
Spin Lock
Software commonly uses spin locks to guard access to a critical region of code. In these
locks, the software “spins” while waiting for a shared lock variable to indicate that the
critical region can be safely accessed. Typically, the lock code uses atomic operations
such as compare and exchange or fetch and add to update the shared lock variable.
shows a spin lock based on the
cmpxchg
instruction.
The spin lock code first initializes
ar.ccv
and a register with the values that indicate
that the lock is available and held, respectively. A compare and exchange obtains the
lock by exchanging
lock
with
1
if it currently holds
0
. Next, the first loop ensures that
the code spins in cache while the lock is held by someone else. Once this loop finds that
the lock is available, a compare and exchange instruction attempts to obtain the lock. If
this instruction fails (e.g. because someone else obtained the lock in the meantime),
the code resumes spinning in the first loop.
Spinning using only the
cmpxchg
/
cmp
/
br
loop may generate excessive coherency traffic.
For example, if the
cmpxchg
always stores to memory (even if the comparison fails) and
the lock is highly-contested, the platform may have to generate a number of read for
ownership transactions causing
lock
to move around the system. Using the first
ld8
/
cmp
/
br
loop avoids this problem by obtaining
lock
in a shared state. In the worst
case, when
lock
is not contested, this loop adds only the overhead of the additional
compare and branch.
The initial
ld8
need not be an acquire load because of the control-flow in the spin loop:
this load must become visible before the
cmpxchg8
because the load must return data
in order for the compare and branch to resolve. Further, the store that relinquishes the
lock after the critical section uses release semantics to prevent memory references
from the critical from moving after the reference that releases the lock. Finally, the
branches use “static predict not taken” hints to optimize for the case where the lock is
not highly contested.
Figure 2-4.
Spin Lock Code
// available. If it is 1, another process is in the critical section.
//
spin_lock:
mov
ar.ccv = 0
// cmpxchg looks for avail (0)
mov
r2 = 1
// cmpxchg sets to held (1)
spin:
ld8
r1 = [lock] ;;
// get lock in shared state
cmp.ne
p1, p0 = r1, r2
// is lock held (ie, lock == 1)?
(p1)
br.cond.spnt
spin ;;
// yes, continue spinning
cmpxchg8.acq
r1 = [lock], r2, ar.ccv ;;// attempt to grab lock
cmp.ne
p1, p0 = r1, r2
// was lock empty?
(p1)
br.cond.spnt
spin ;;
// bummer, continue spinning
cs_begin:
// critical section code goes here...
cs_end:
st8.rel
[lock] = r0 ;;
// release the lock
Содержание ITANIUM ARCHITECTURE - SOFTWARE DEVELOPERS VOLUME 3 REV 2.3
Страница 1: ......
Страница 11: ...x Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Страница 12: ...1 1 Intel Itanium Architecture Software Developer s Manual Rev 2 3 Part I Application Architecture Guide ...
Страница 13: ...1 2 Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Страница 33: ...1 22 Volume 1 Part 1 Introduction to the Intel Itanium Architecture ...
Страница 57: ...1 46 Volume 1 Part 1 Execution Environment ...
Страница 147: ...1 136 Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Страница 149: ...1 138 Volume 1 Part 2 About the Optimization Guide ...
Страница 191: ...1 180 Volume 1 Part 2 Predication Control Flow and Instruction Stream ...
Страница 230: ......
Страница 248: ...236 Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Страница 249: ...2 1 Intel Itanium Architecture Software Developer s Manual Rev 2 3 Part I System Architecture Guide ...
Страница 250: ...2 2 Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Страница 264: ...2 16 Volume 2 Part 1 Intel Itanium System Environment ...
Страница 380: ...2 132 Volume 2 Part 1 Interruptions ...
Страница 398: ...2 150 Volume 2 Part 1 Register Stack Engine ...
Страница 486: ...2 238 Volume 2 Part 1 IA 32 Interruption Vector Descriptions ...
Страница 749: ...2 501 Intel Itanium Architecture Software Developer s Manual Rev 2 3 Part II System Programmer s Guide ...
Страница 750: ...2 502 Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Страница 754: ...2 506 Volume 2 Part 2 About the System Programmer s Guide ...
Страница 796: ...2 548 Volume 2 Part 2 Interruptions and Serialization ...
Страница 808: ...2 560 Volume 2 Part 2 Context Management ...
Страница 842: ...2 594 Volume 2 Part 2 Floating point System Software ...
Страница 850: ...2 602 Volume 2 Part 2 IA 32 Application Support ...
Страница 862: ...2 614 Volume 2 Part 2 External Interrupt Architecture ...
Страница 870: ...2 622 Volume 2 Part 2 Performance Monitoring Support ...
Страница 891: ......
Страница 941: ...3 42 Volume 3 Instruction Reference cmp illegal_operation_fault PR p1 0 PR p2 0 Interruptions Illegal Operation fault ...
Страница 1099: ...3 200 Volume 3 Instruction Reference padd Interruptions Illegal Operation fault ...
Страница 1191: ...3 292 Volume 3 Pseudo Code Functions Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Страница 1295: ...3 396 Volume 3 Resource and Dependency Semantics ...
Страница 1296: ......
Страница 1302: ...402 Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Страница 1494: ...4 192 Volume 4 Base IA 32 Instruction Reference FWAIT Wait See entry for WAIT ...
Страница 1564: ...4 262 Volume 4 Base IA 32 Instruction Reference LES Load Full Pointer See entry for LDS LES LFS LGS LSS ...
Страница 1565: ...Volume 4 Base IA 32 Instruction Reference 4 263 LFS Load Full Pointer See entry for LDS LES LFS LGS LSS ...
Страница 1568: ...4 266 Volume 4 Base IA 32 Instruction Reference LGS Load Full Pointer See entry for LDS LES LFS LGS LSS ...
Страница 1583: ...Volume 4 Base IA 32 Instruction Reference 4 281 LSS Load Full Pointer See entry for LDS LES LFS LGS LSS ...
Страница 1647: ...Volume 4 Base IA 32 Instruction Reference 4 345 ROL ROR Rotate See entry for RCL RCR ROL ROR ...
Страница 1663: ...Volume 4 Base IA 32 Instruction Reference 4 361 SHL SHR Shift Instructions See entry for SAL SAR SHL SHR ...
Страница 1668: ...4 366 Volume 4 Base IA 32 Instruction Reference SIDT Store Interrupt Descriptor Table Register See entry for SGDT SIDT ...
Страница 1884: ...4 582 Volume 4 IA 32 SSE Instruction Reference ...
Страница 1885: ...Index Intel Itanium Architecture Software Developer s Manual Rev 2 3 Index ...
Страница 1886: ...Index Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Страница 1898: ...INDEX Index 12 Index for Volumes 1 2 3 and 4 ...