Teledyne SP Devices ADQ7 User Manual Download Page 10

Classification

Revision

Public

PA1

Document ID

Print date

19-2241

2019-05-02

buffer. After a set of buffers are completed user application has to reset data valid buffers by writing an

32-bit variable with value 1 in each position.

4.7

User Logic 2 Considerations

Metadata insertion and sample interleaving for two-channel mode is done in UL2. If UL2 is bypassed

GPU streaming will not work, writing to UL2 registers may alter functionality. If UL2 is modified with the

development kit, you have to make sure that metadata insertion still behaves correctly.

5

Example Code

The ADQ7 GPU P2P function is delivered with two examples, one for Nvidia GPU’s under Linux and

another for AMD GPU’s under Windows.

1

5.1

Signal Connections

The example code can collect data from one or two analog inputs using the A-trigger and optional B-

trigger as described in Section

3.1

Table

2

shows how the different input signals are labeled on the

ADQ7 backplate.

Table 2: Signal connections for ADQ7 device

Signal

ADQ7 input

Analog channel A

A

Analog channel B

B (optional)

A-trigger

Trig

B-trigger

Sync (optional)

5.2

Nvidia Example

The example for Nvidia GPUs is written using GPUDirect with CUDA and OpenGL.

GPUDirect works by setting up a bus writable buffers in GPU memory. Additionally,

marker

buffers

are setup in host memory used to synchronize writes to the GPU buffers with host program execution.

When a GPU buffer has been completely written by the digitizer it writes an iterator to the associated

marker buffer in host and sends an interrupt. The host program waits for the marker write and enqueues

a CUDA kernel at that time. This sequence is repeated in an alternating pattern for two buffers.

The example uses a CUDA kernel which computes an FFT for each record in the buffer, and draws

a point to an OpenGL buffer indicating the peak frequency of the FFT. Fig.

5

shows a screen capture of

the CUDA example.

1

Nvidia and AMD are protected trademarks of their respective owners.

ADQ7 GPU Peer-To-Peer – User Guide

www.teledyne-spdevices.com

Page 9 of 11

Summary of Contents for ADQ7

Page 1: ...ADQ7 GPU Peer To Peer User Guide Author s Teledyne SP Devices Document ID 19 2241 Classification Public Revision PA1 Print date 2019 05 02...

Page 2: ...a Valid 7 4 3 Set up P2P GPU with SetupDMAP2p2D 7 4 3 1 Nvidia 7 4 3 2 AMD 8 4 4 Wait for a Completed Buffer 8 4 4 1 Nvidia 8 4 4 2 AMD 8 4 5 Detect and Handle Overflows 8 4 6 Process Received Data an...

Page 3: ...sing unit Graphics card OCT Optical coherence tomography P2P Peer to peer UL2 User logic 2 open FPGA area in the ADQ7 firmware 2 Prerequisites Hardware ADQ7 digitizer Peer to peer capable GPU Windows...

Page 4: ...he right hand side of the data buffer the data valid buffer is shown containing the number one for each valid line A A A A A A A A B B A A A A A A A A B B A A A A A A A A B B 1 1 1 1 1 1 Figure 1 Reco...

Page 5: ...e attached example code 3 3 Backplane Peer To Peer Transfer Data is written from the digitizer directly to the GPU without going through the host CPU or host memory This reduces requirement on the hos...

Page 6: ...05 02 a switch it is not a mandatory function of the PCIe standard Therefore make sure that there is a PCIe switch between the two endpoints or that the root complex supports peer to peer transfer AD...

Page 7: ...be used Initialize GPU driver Allocate and pin buffers in GPU Initialize ADQ Set up triggers Set up P2P GPU with SetupDMAP2p2D Start streaming Wait for a completed buffer Detect and handle overflows P...

Page 8: ...with data processing active 32 128 512 2048 8192 32768 131072 524288 2097152 8388608 5000 5200 5400 5600 5800 6000 6200 6400 6600 6800 7000 7200 1ch 2ch Record length samples MiB s Figure 4 Maximum t...

Page 9: ...AMD Completed buffers are detected by GPU The function clEnqueueWaitSignalAMD is used to make GPU wait until the marker associated with the specified buffer is equal to or greater than the specified v...

Page 10: ...d on the ADQ7 backplate Table 2 Signal connections for ADQ7 device Signal ADQ7 input Analog channel A A Analog channel B B optional A trigger Trig B trigger Sync optional 5 2 Nvidia Example The exampl...

Page 11: ...el module for GPUDirect make 6 Load kernel module sudo insmod sh 7 Go to directory ADQ7_GPUDirect_example source 8 Build example make 9 Run example cuda_example Once the example is running press h for...

Page 12: ...OpenCL queue which blocks the queue until a marker write is detected At that time a kernel enqueued after the marker wait can run This sequence is repeated in an alternating pattern for two buffers T...

Page 13: ...ngen 6 SE 583 30 Link ping Sweden Phone 46 0 13 645 0600 Fax 46 0 13 991 3044 Email spd_info teledyne com Copyright 2019 Teledyne Signal Processing Devices Sweden AB All rights reserved including thos...

Reviews: