background image

Intel® Xeon Phi™ Coprocessor D

EVELOPER

Q

UICK 

S

TART 

G

UIDE

 

 

16 

 

/opt/intel/vtune_amplifier_xe_2013/documentation/en/tutorials/fin
d_lw_hotspots/C++/index.htm

 

• 

Useful documentation on the Web:  

o

 

On the website 

http://software.intel.com/mic-developer 

 you will find a wide range of 

documentation that can be downloaded, most notably the 

Intel® Xeon Phi™ Coprocessor 

Software Developers Guide  

under 

“PROGRAMMING” 

tab., as well as including the 

System V 

Application Binary Interface K1OM Architecture Processor Supplement,

  

Intel® Xeon Phi™ 

Performance Monitor Units, and 

the 

Intel® Xeon Phi™ Coprocessor Instruction Set Reference 

Manual 

(under 

“OVERVIEW”

 tab) . From this site you will also be able to find a community 

forum to ask questions, links to other available tools, code samples, and case studies. 

o

 

http://software.intel.com/en-us/articles/programming-and-compiling-for-intel-many-
integrated-core-architecture

  contains a wealth of information on compilers.  

• 

Some sample offload code using the explicit memory copy model can be found in: 

o

 

C++:  

/

opt/intel/composerxe/Samples/en_US/C++/mic_samples/intro_sampleC/ 

o

 

Fortran:  

/opt/intel/composerxe/Samples/en_US/Fortran/mic_samples/

 

o

 

Intel® MKL:  

/opt/intel/composerxe/mkl/examples/mic*

 

o

 

For examples of Intel® MKL automated offload: 

/opt/intel/composerxe/mkl/examples/mic_ao/blasc 

and

 …/mic_ao/blasf

 

o

 

The rest of the samples demonstrate use of MKL via compiler-assisted offload 
(

/opt/intel/composerxe/mkl/examples/mic_offload).

 

• 

Some sample offload code using the implicit memory copy model can be found in: 

o

 

C:   

/opt/intel/composerxe/Samples/en_US/C++/mic_samples/shrd_sampleC 

and

 …/LEO_tutorial

 

o

 

C++:   

/opt/intel/composerxe/Samples/en_US/C++/mic_samples/shrd_sampleCP
P

 

Build-Related Information 

• 

The  offload  compiler  produces  “fat”  binaries  and 

.so

  files  that  contain  code  for  both host  and  the 

Intel® Xeon Phi™ Coprocessor. 

• 

The  offload  compiler  produces  code  that  examines  the  runtime  execution  environment  for  the 
presence of an Intel® Xeon Phi™ Coprocessor.  The offload compiler will create both host and Intel® MIC 
Architecture versions of all code marked for offload.

 

• 

A number of workarounds and hints can be found in 

releaseNotes-linux.txt

.

 

 

Compiler Switches and Makefiles 

When building applications that offload some of their code to the Intel® Xeon Phi™ Coprocessor, it is possible to 
cause the offloaded code to be built with different compiler options from the host code.  The method of 
passing these options to the compiler is documented in the compiler documentation under the “Compiler 
Reference/Compiler Options/Compiler Option Categories and Descriptions” section.  Look for the 

–offload-

option

 compiler switch.   In that same section, also look up the 

–offload-attribute-target

 compiler 

Содержание Xeon Phi

Страница 1: ...White Paper Intel Xeon Phi Coprocessor DEVELOPER S QUICK START GUIDE Version 1 7...

Страница 2: ...Intel Xeon Phi Coprocessor If It Hangs 11 Monitoring the Intel Xeon Phi Coprocessor 12 Running an Intel Xeon Phi Coprocessor program from the host system 12 Working directly with the uOS Environment I...

Страница 3: ...Programming on the Intel Xeon Phi Coprocessor OpenMP Intel Cilk Plus Extended Array Notation 23 Parallel Programming on the Intel Xeon Phi Coprocessor Intel Cilk Plus 24 Parallel Programming on Intel...

Страница 4: ...known methods BKMs developed by users at Intel This document does not 1 Cover each tool in detail Please refer to the user guides for the individual tools 2 Provide in depth training Terminology Host...

Страница 5: ...e with the Intel Xeon Phi Coprocessor SCI Symmetric Communications Interface the mechanism for inter node communication within a single platform where an node is a Intel Xeon Phi Coprocessor or an Int...

Страница 6: ...ty such as loading and unloading executables onto the Intel Xeon Phi Coprocessor invoking functions from the executables on the card and providing a two way notification mechanism between host and car...

Страница 7: ...U is a key feature of the Intel MIC Architecture based cores Fully utilizing the vector unit is critical for best Intel Xeon Phi Coprocessor performance It is important to note that Intel MIC Architec...

Страница 8: ...rnel 2 6 32 431 SUSE Linux Enterprise Server SLES 11 SP2 kernel 3 0 13 0 27 default or SUSE Linux Enterprise Server SLES 11 SP3 kernel 3 0 76 0 11 default Section 2 1 in readme txt Be sure to install...

Страница 9: ...If you acquired a serial number for Intel tools go to the Intel Registration Center IRC at http registrationcenter intel com to register and download the products Click the button Register Product wil...

Страница 10: ...MPSS gets started it loads the data collection driver automatically But for some reason if it fails to load the data collection driver you can manually load the driver by going to opt intel vtune_amp...

Страница 11: ...Access to the Intel Xeon Phi Coprocessor after Reboot The Intel Xeon Phi Coprocessor will not start when the host system reboots You will need to manually start the Intel Xeon Phi Coprocessor and then...

Страница 12: ...o Monitoring the Intel Xeon Phi Coprocessor If you want to monitor the load on your coprocessor its temperature etc run the System Management and Configuration SMC utility See section 8 3 of the MPSS...

Страница 13: ...is directory to their default path micinfo provides information about host and coprocessor system configuration micflash updates the flash on the coprocessor saves and retrieves the version and other...

Страница 14: ...l MIC Architecture o Intel Threading Building Blocks Intel TBB o Intel Integrated Performance Primitive Intel IPP Libraries packaged separately include o Intel MPI for Linux OS including Intel Many In...

Страница 15: ...cture Information on Intel MIC Architecture intrinsics can be found in the Compiler Reference Intrinsics section under Intrinsics for Intel MIC Architecture o Release_notes_ _2013SP1_l_en pdf please r...

Страница 16: ...ad opt intel composerxe mkl examples mic_ao blasc and mic_ao blasf o The rest of the samples demonstrate use of MKL via compiler assisted offload opt intel composerxe mkl examples mic_offload Some sam...

Страница 17: ...You can visit the Forum on the Intel Xeon Phi Coprocessor to post questions It can be found at the http software intel com en us forums intel many integrated core Using the Offload Compiler Explicit M...

Страница 18: ...e programmer uses pragma offload target mic as shown in the example below to mark statements offload constructs that should execute on the Intel Xeon Phi Coprocessor The offloaded region is defined as...

Страница 19: ...ode Example 3 Vector Reduction with Offload in C C Asynchronous Offload and Data Transfer Asynchronous offload and data transfer between the host and the Intel Xeon Phi Coprocessor is available For de...

Страница 20: ...hared memory there is no hardware that maps some portion of the memory on the Intel Xeon Phi Coprocessor to the host system The memory subsystems on the coprocessor and host are completely independent...

Страница 21: ...d Reference Guides The section Restrictions on Offload Using Shared Virtual Memory in the document Intel C Compiler User and Reference Guide shows some restrictions of using this programming model Nat...

Страница 22: ...no unusual complications beyond the larger number of threads Parallel Programming on the Intel Xeon Phi Coprocessor OpenMP There is no correspondence between OpenMP threads on the host CPU and on the...

Страница 23: ...et mic in size in data length size omp parallel do reduction ret do i 1 size ret ret data i enddo omp end parallel do FTNReductionOMP ret return end function FTNReductionOMP Code Example 6 Fortran Usi...

Страница 24: ...available to an application built for the Intel MIC Architecture using Intel Cilk Plus wrap the header files with pragma offload_attribute push target mic and pragma offload_attribute pop as follows p...

Страница 25: ...d global data required on the Intel Xeon Phi Coprocessor should be appended by the special function attribute __attribute__ target mic As an example parallel_reduce recursively splits an array into su...

Страница 26: ...CReductionTBB data size return ret Code Example 13 Offloading Intel TBB Code to the coprocessor in C C NOTE Codes using Intel TBB with an offload should be compiled with tbb flag instead of ltbb Using...

Страница 27: ...offload target mic PHI_DEV in A length matrix_elements free_if 0 in B length matrix_elements free_if 0 in C length matrix_elements free_if 0 Code Example 14 Sending the Data to the Intel Xeon Phi Cop...

Страница 28: ...a A N B N beta C N Code Example 17 Controlling Threads on the Intel Xeon Phi Coprocessor Using omp_set_num_threads Intel MKL Automatic Offload Model A few of the host Intel MKL functions are Automatic...

Страница 29: ...ugging Intel MIC Architecture applications under the Debugging with the Intel Debugger on Eclipse and Debugging on the Command Line sections of opt intel composerxe Documentation en_US debugger debugg...

Страница 30: ...efore joining Intel Charles was a consulting software engineer for Oracle Corporation where he concentrated on parallelism and 64 bit support in Windows NT and OpenVMS versions of the Oracle RDBMS on...

Страница 31: ...SUCH MISSION CRITICAL APPLICATION WHETHER OR NOT INTEL OR ITS SUBCONTRACTOR WAS NEGLIGENT IN THE DESIGN MANUFACTURE OR WARNING OF THE INTEL PRODUCT OR ANY OF ITS PARTS Intel may make changes to specif...

Страница 32: ...ions include SSE2 SSE3 and SSE3 instruction sets and other optimizations Intel does not guarantee the availability functionality or effectiveness of any optimization on microprocessors not manufacture...

Отзывы: