Volume 2, Part 2: Performance Monitoring Support
2:619
Performance Monitoring Support
12
Processors based on the Itanium architecture include a minimum of four performance
counters which can be programmed to count processor events. These event counts can
be used to analyze both hardware and software performance. Performance counters
can be configured to generate a counter overflow interrupt. This interrupt can be used
for event- or time-based profiling. For hot-spot analysis of running code, performance
monitor interrupts can be used to create a profile of frequently occurring instruction
pointers (IP). Another common use of event counts is to compute processor
performance metrics such as cycles per instructions (CPI), the current branch, cache or
TLB miss rates, etc.
The Itanium architecture provides architected support for context switching of
performance monitors by an Itanium architecture-based operating system. If supported
by the operating system, this allows performance counter events to be broken down
per thread or per process which is important for effective performance tuning of
Itanium architecture-based applications.
The remainder of this chapter reviews the architected performance monitoring
mechanisms. It also discusses the Itanium architecture-based operating system
support needed for two monitoring usage models: per process/thread and system-wide
event monitoring.
12.1
Architected Performance Monitoring Mechanisms
As defined in
Section 7.2, “Performance Monitoring” on page 2:155
, processors based
on the Itanium architecture provide a minimum of four generic performance counter
pairs (PMC/PMD[4..7]). The performance monitor control (PMC) registers are used to
select the event to be counted, and to define under what conditions the event should
qualify for being counted (for details refer to
Section 7.2.1, “Generic Performance
Counter Registers” on page 2:156
). The performance monitor data (PMD) registers
contain the event count or data.
The PMC/PMD registers can only be written by privileged software (PSR.cpl must be
zero). A counter can be configured as a “privileged” counter or a “user-level” counter by
setting of the PMC[i].pm bit. Privileged counters can only read at privilege level 0, while
user-level counters can by read by user mode code (unless the operating system has
explicitly disabled the user-level monitor reads using PSR.sp).
Once the PMC/PMD registers have been configured, counting is enabled and disabled by
setting bits in the PSR. User-level counters can be controlled at user-level using the
rum and sum instructions to toggle PSR.up. Privileged counters are controlled by
privileged software using the rsm, ssm, mov from/to PSR instructions to toggle PSR.pp.
Counting for all counters is further controlled by the PMC[0] freeze bit. When PMC[0].fr
is 0, all counters are disabled. When PMC[0].fr is 1, counting is enabled based on
PMC[i].pm, PSR.pp and PSR.up. For more details on controlling of the performance
monitors please refer to
Section 7.2.1, “Generic Performance Counter Registers” on
.
Содержание ITANIUM ARCHITECTURE - SOFTWARE DEVELOPERS VOLUME 3 REV 2.3
Страница 1: ......
Страница 11: ...x Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Страница 12: ...1 1 Intel Itanium Architecture Software Developer s Manual Rev 2 3 Part I Application Architecture Guide ...
Страница 13: ...1 2 Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Страница 33: ...1 22 Volume 1 Part 1 Introduction to the Intel Itanium Architecture ...
Страница 57: ...1 46 Volume 1 Part 1 Execution Environment ...
Страница 147: ...1 136 Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Страница 149: ...1 138 Volume 1 Part 2 About the Optimization Guide ...
Страница 191: ...1 180 Volume 1 Part 2 Predication Control Flow and Instruction Stream ...
Страница 230: ......
Страница 248: ...236 Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Страница 249: ...2 1 Intel Itanium Architecture Software Developer s Manual Rev 2 3 Part I System Architecture Guide ...
Страница 250: ...2 2 Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Страница 264: ...2 16 Volume 2 Part 1 Intel Itanium System Environment ...
Страница 380: ...2 132 Volume 2 Part 1 Interruptions ...
Страница 398: ...2 150 Volume 2 Part 1 Register Stack Engine ...
Страница 486: ...2 238 Volume 2 Part 1 IA 32 Interruption Vector Descriptions ...
Страница 749: ...2 501 Intel Itanium Architecture Software Developer s Manual Rev 2 3 Part II System Programmer s Guide ...
Страница 750: ...2 502 Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Страница 754: ...2 506 Volume 2 Part 2 About the System Programmer s Guide ...
Страница 796: ...2 548 Volume 2 Part 2 Interruptions and Serialization ...
Страница 808: ...2 560 Volume 2 Part 2 Context Management ...
Страница 842: ...2 594 Volume 2 Part 2 Floating point System Software ...
Страница 850: ...2 602 Volume 2 Part 2 IA 32 Application Support ...
Страница 862: ...2 614 Volume 2 Part 2 External Interrupt Architecture ...
Страница 870: ...2 622 Volume 2 Part 2 Performance Monitoring Support ...
Страница 891: ......
Страница 941: ...3 42 Volume 3 Instruction Reference cmp illegal_operation_fault PR p1 0 PR p2 0 Interruptions Illegal Operation fault ...
Страница 1099: ...3 200 Volume 3 Instruction Reference padd Interruptions Illegal Operation fault ...
Страница 1191: ...3 292 Volume 3 Pseudo Code Functions Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Страница 1295: ...3 396 Volume 3 Resource and Dependency Semantics ...
Страница 1296: ......
Страница 1302: ...402 Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Страница 1494: ...4 192 Volume 4 Base IA 32 Instruction Reference FWAIT Wait See entry for WAIT ...
Страница 1564: ...4 262 Volume 4 Base IA 32 Instruction Reference LES Load Full Pointer See entry for LDS LES LFS LGS LSS ...
Страница 1565: ...Volume 4 Base IA 32 Instruction Reference 4 263 LFS Load Full Pointer See entry for LDS LES LFS LGS LSS ...
Страница 1568: ...4 266 Volume 4 Base IA 32 Instruction Reference LGS Load Full Pointer See entry for LDS LES LFS LGS LSS ...
Страница 1583: ...Volume 4 Base IA 32 Instruction Reference 4 281 LSS Load Full Pointer See entry for LDS LES LFS LGS LSS ...
Страница 1647: ...Volume 4 Base IA 32 Instruction Reference 4 345 ROL ROR Rotate See entry for RCL RCR ROL ROR ...
Страница 1663: ...Volume 4 Base IA 32 Instruction Reference 4 361 SHL SHR Shift Instructions See entry for SAL SAR SHL SHR ...
Страница 1668: ...4 366 Volume 4 Base IA 32 Instruction Reference SIDT Store Interrupt Descriptor Table Register See entry for SGDT SIDT ...
Страница 1884: ...4 582 Volume 4 IA 32 SSE Instruction Reference ...
Страница 1885: ...Index Intel Itanium Architecture Software Developer s Manual Rev 2 3 Index ...
Страница 1886: ...Index Intel Itanium Architecture Software Developer s Manual Rev 2 3 ...
Страница 1898: ...INDEX Index 12 Index for Volumes 1 2 3 and 4 ...