Demo for Drafting: On-GPU Data Access (CUDA, CuPy, PyTorch, DLPack) #2429

xloem · 2022-11-08T23:54:19Z

This is a scratch work showcasing concepts for gpu data access coming together. It's a quick half-working thing.

#1986 #1985 #210 #2391 #120 (note: #120 includes opencl work!) #2426 #57

Demo (watch the text output: attaches a vertex buffer to a program, then exports it to pytorch and cupy in-place):

pip3 install torch cupy-cuda11x # or as appropriate
python3 -m vispy.gloo.dlpack.cuda_
# the pauses are from importing torch and cupy during the data export

I copied from the dlpack repository to try to write a basic dlpack wrapper for vispy's GL buffers. I spent maybe six hours on this, and am sharing it now that it actually runs at all. I'm not sure whether I will return to this or not, but I expect others would find it helpful (as well as frustrating and undocumented).

EDIT: I first posted this without striding the merged program data. It is now manually unstrided, and the output data is correct.

EDIT: I first posted this with an extant cupy crash. I've now addressed byte offset quirks for both torch and cupy, and passed the device type as a python int rather than a ctypes value. cupy now loads the data correctly.

EDIT: The next remaining issue would be organizing the code with vispy's GL pipeline. I'm not sure how to flush GL commands from the client without breaking the pipeline (which I haven't studied). A similar but separate issue may be synchronizing the CUDA stream with the GLIR queue.

But it was pretty exciting for me to see similar data output in the torch tensor as I passed in to vispy's program, once I got this far.

EDIT: Current output is:

Sending position buffer to vispy:
position buffer = [[-1. -1.]
 [-1.  1.]
 [ 1. -1.]
 [ 1.  1.]]
Pulling position out:
position torch tensor =  tensor([[-1., -1.],
        [-1.,  1.],
        [ 1., -1.],
        [ 1.,  1.]], device='cuda:0')
position cupy array =  [[-1. -1.]
 [-1.  1.]
 [ 1. -1.]
 [ 1.  1.]]

jakirkham · 2022-11-09T03:07:53Z

Thanks for sharing! 🙏

But it was pretty exciting for me to see similar data output in the torch tensor as I passed in to vispy's program, once I got this far.

Could you please share specifically what was run and the error seen?

xloem · 2022-11-09T07:02:11Z

I've updated the post to include the output. The command run is the first code block.

xloem · 2022-11-09T08:22:34Z

It runs without errors now. As added in my edit, current issues are:

messiness, it was a quick hack
integrating with vispy's GL pipeline; flushing commands and accessing buffers in a more general way, without breaking anything (current hacks for this start at https://github.com/vispy/vispy/pull/2429/files#diff-8e1ef1ba0eaf0087bd91c73715bd8a4a93ae941501d26b3cd3c9ef864fc49bdaR102 )

initial commit

b152cf0

fixes

555c3e3

xloem force-pushed the dlpack-scratch branch from 951a695 to 555c3e3 Compare November 9, 2022 07:59

fixes for cupy and workaround for vispy gl pipeline

1cb72bb

fix inverted read-only/write-only logic

6013f3d

xloem force-pushed the dlpack-scratch branch from a37f843 to 6013f3d Compare November 9, 2022 08:33

fixed relative package importing

22a9b44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Demo for Drafting: On-GPU Data Access (CUDA, CuPy, PyTorch, DLPack) #2429

Demo for Drafting: On-GPU Data Access (CUDA, CuPy, PyTorch, DLPack) #2429

xloem commented Nov 8, 2022 •

edited

jakirkham commented Nov 9, 2022

xloem commented Nov 9, 2022

xloem commented Nov 9, 2022 •

edited

Demo for Drafting: On-GPU Data Access (CUDA, CuPy, PyTorch, DLPack) #2429

Are you sure you want to change the base?

Demo for Drafting: On-GPU Data Access (CUDA, CuPy, PyTorch, DLPack) #2429

Conversation

xloem commented Nov 8, 2022 • edited

jakirkham commented Nov 9, 2022

xloem commented Nov 9, 2022

xloem commented Nov 9, 2022 • edited

xloem commented Nov 8, 2022 •

edited

xloem commented Nov 9, 2022 •

edited