Teledyne SP Devices ADQ7 User Manual Download Page 4

Classification

Revision

Public

PA1

Document ID

Print date

19-2241

2019-05-02

3.1

Trigger and Data Alignment

A data collection is specified by a record length

M

and a line length. Where each line consists of

N

records. The A-scan trigger starts collection of a record. Each record is consecutively written directly to

GPU memory. In the normal case a line therefore takes up

M

×

N

samples of memory. When a line is

full records are continued at the next line immediately after the last.

A B-scan trigger indicates the start of a new line. At the detection of a B-scan trigger the next record

will be written at the start of the next line. If a B-scan trigger arrives before

N

records have been written

this means one or several records are missing. In this case the line is marked as invalid but the next line

will automatically be properly aligned in memory as if the previous line had been fully written.

Fig.

1

shows an example of the memory layout after a successful data collection. Each record is

represented by a dash and corresponding A- and B-triggers are labeled. In this example the line length

N

would be four, meaning four A-triggers are followed by one B-trigger. At the right hand side of the data

buffer, the

data valid buffer

is shown containing the number one for each valid line.

A

A

A

A

A

A

A

A

B

B

A

A

A

A

A

A

A

A

B

B

A

A

A

A

A

A

A

A

B

B

1

1

1

1

1

1

Figure 1: Records in GPU memory with labels indicating triggers. The data valid buffer indicates that all

lines are valid.

Information about invalid lines is written as metadata to a separate part of the memory in the GPU.

Fig.

2

shows the GPU memory with several A-scans missing, the B-scan has moved the following record

to the next line and the data valid buffer indicates that the second line is not valid. The digitizer will write

a zero to the buffer only when an invalid line is encountered, the buffer is expected to initially be filled

with ones.

The resulting data buffer in GPU memory will always contain consistent data aligned in records and

lines according to the trigger information received by the ADQ7, ready to be processed by the GPU.

3.2

Double Buffering and Kernel Scheduling

The example program employs a double buffering scheme when transferring data to GPU. The digitizer

will write to one buffer and signal the host when the buffer is completely filled. At that signal the host may

schedule processing of the data in the first buffer. Simultaneously the digitizer can start writing data to a

ADQ7 GPU Peer-To-Peer – User Guide

www.teledyne-spdevices.com

Page 3 of 11

Summary of Contents for ADQ7

Page 1: ...ADQ7 GPU Peer To Peer User Guide Author s Teledyne SP Devices Document ID 19 2241 Classification Public Revision PA1 Print date 2019 05 02...

Page 2: ...a Valid 7 4 3 Set up P2P GPU with SetupDMAP2p2D 7 4 3 1 Nvidia 7 4 3 2 AMD 8 4 4 Wait for a Completed Buffer 8 4 4 1 Nvidia 8 4 4 2 AMD 8 4 5 Detect and Handle Overflows 8 4 6 Process Received Data an...

Page 3: ...sing unit Graphics card OCT Optical coherence tomography P2P Peer to peer UL2 User logic 2 open FPGA area in the ADQ7 firmware 2 Prerequisites Hardware ADQ7 digitizer Peer to peer capable GPU Windows...

Page 4: ...he right hand side of the data buffer the data valid buffer is shown containing the number one for each valid line A A A A A A A A B B A A A A A A A A B B A A A A A A A A B B 1 1 1 1 1 1 Figure 1 Reco...

Page 5: ...e attached example code 3 3 Backplane Peer To Peer Transfer Data is written from the digitizer directly to the GPU without going through the host CPU or host memory This reduces requirement on the hos...

Page 6: ...05 02 a switch it is not a mandatory function of the PCIe standard Therefore make sure that there is a PCIe switch between the two endpoints or that the root complex supports peer to peer transfer AD...

Page 7: ...be used Initialize GPU driver Allocate and pin buffers in GPU Initialize ADQ Set up triggers Set up P2P GPU with SetupDMAP2p2D Start streaming Wait for a completed buffer Detect and handle overflows P...

Page 8: ...with data processing active 32 128 512 2048 8192 32768 131072 524288 2097152 8388608 5000 5200 5400 5600 5800 6000 6200 6400 6600 6800 7000 7200 1ch 2ch Record length samples MiB s Figure 4 Maximum t...

Page 9: ...AMD Completed buffers are detected by GPU The function clEnqueueWaitSignalAMD is used to make GPU wait until the marker associated with the specified buffer is equal to or greater than the specified v...

Page 10: ...d on the ADQ7 backplate Table 2 Signal connections for ADQ7 device Signal ADQ7 input Analog channel A A Analog channel B B optional A trigger Trig B trigger Sync optional 5 2 Nvidia Example The exampl...

Page 11: ...el module for GPUDirect make 6 Load kernel module sudo insmod sh 7 Go to directory ADQ7_GPUDirect_example source 8 Build example make 9 Run example cuda_example Once the example is running press h for...

Page 12: ...OpenCL queue which blocks the queue until a marker write is detected At that time a kernel enqueued after the marker wait can run This sequence is repeated in an alternating pattern for two buffers T...

Page 13: ...ngen 6 SE 583 30 Link ping Sweden Phone 46 0 13 645 0600 Fax 46 0 13 991 3044 Email spd_info teledyne com Copyright 2019 Teledyne Signal Processing Devices Sweden AB All rights reserved including thos...

Reviews: