WIP: Texture as data source #2351

brisvag · 2022-07-05T15:44:12Z

Note that this PR includes #2350 to correctly cast dtypes.

This PR is an attempt at implementing a form of "data source" in vispy. This stems from previous discussions both on napari channels and on the vispy gitter.

This particular implementation does not introduce a new object; instead, it expands the gloo.Texture object to be able to keep track of the data it represents on the CPU side as well. This is a WIP implementation which currently works well with the ImageVisual, but not all tests pass and probably breaks all sorts of other things. To see how it works, try the following:

from vispy import scene
from vispy.io import load_data_file, read_png

canvas = scene.SceneCanvas(keys='interactive')
canvas.size = 800, 600
canvas.show()

view = canvas.central_widget.add_view()
img_data = read_png(load_data_file('mona_lisa/mona_lisa_sm.png'))

image = scene.visuals.Image(img_data, parent=view.scene)

view.camera = scene.PanZoomCamera(aspect=1)
view.camera.flip = (0, 1, 0)
view.camera.set_range()

Then, you can alter the texture data like this:

image._texture[:, 250:300] = [127, 0, 0]
image.update()

Or access it from the texture + chnage it from the Image level:

image.set_data(image._texture._data * 2)
image.update()

The ultimate goal is to then be able to easily do things like sharing textures between images:

import numpy as np

rand_data = np.random.randint(0, 127, (100, 100), dtype=np.int8)
image2 = scene.visuals.Image(rand_data, parent=view.scene)
image2.transform = scene.STTransform(translate=(-100, 0))
image.set_texture(image2._texture)

without ending up with broken states. The above code works here, but I haven't tested it properly yet, and I'm sure it needs some checks and protections (and maybe disallowing changing texture class).

@djhoese does this resemble what you had in mind? I found it was a lot easier to work on the Texture object, rather than creating a wrapper around it.

Might be interested and/or have good insight: @almarklein, @rossant

cc @melonora

…texture

djhoese · 2022-07-05T16:59:29Z

I expressed some of my fears about the casting changes in my review of #2350. For this PR specifically, I'm concerned about the lack of separation of responsibilities. The low-level Texture classes are meant to be low-level. If this can't be done as a subclass or a new wrapper object that uses a low-level Texture class then fine, but I'm worried about the maintainability. Note the similar-in-purpose PR for buffers: #1607

As far as how I expected this to work, it is close, but I was hoping for more separation. I'd like the average user wanting to do something like to not have to know the term "texture" at all, hence the wrapper class idea. I'm also not sure a user should be getting the data source (texture) from another Visual to pass it to another. There is the more obvious ._texture private attribute usage that should be fixed long term, but I was hoping for something closer to this:

rand_data = np.random...
shared_data = DataSource(rand_data)
image2 = Image(shared_data, ...)
image.set_data(shared_data)

If we really want users to know about textures then we need to think hard on where the line is drawn between vispy high-level and GL. Knowing about vertices and meshes is one thing, but knowing about textures in your actual usage of a Visual is a much lower level concept.

brisvag · 2022-07-06T11:16:27Z

The low-level Texture classes are meant to be low-level. If this can't be done as a subclass or a new wrapper object that uses a low-level Texture class then fine, but I'm worried about the maintainability. Note the similar-in-purpose PR for buffers: #1607

I think this is definitely doable either way. I went in this direction because It seemed less disruptive to how visuals interact with the underlying data. If we instead make a wrapper or subclass (such as in #1607), we need to effectively replace Textures and Buffers all over the codebase with the new wrappers (so at that point, why keep the Texture objects separate in the first place, if they're never used), or do like in #1607 and add everywhere the logic to choose if one wants to "use a cpu buffer". Note that this is not trivial, because every single visual uses and manages its buffers and textures very differently.

Personally I think we could go with a wrapper if that's better for maintainability, but in that case I would then make Texture basically a private thing that's there just for code separation.

I'm also not sure a user should be getting the data source (texture) from another Visual to pass it to another. There is the more obvious ._texture private attribute usage that should be fixed long term, but I was hoping for something closer to this:
rand_data = np.random...
shared_data = DataSource(rand_data)
image2 = Image(shared_data, ...)
image.set_data(shared_data)

Oh yeah I 100% agree on this; my example was like that just because that's a quick way to create a Texture with the correct setup, but your example is definitely how it should work!

As far as how I expected this to work, it is close, but I was hoping for more separation. I'd like the average user wanting to do something like to not have to know the term "texture" at all, hence the wrapper class idea.

If we really want users to know about textures then we need to think hard on where the line is drawn between vispy high-level and GL. Knowing about vertices and meshes is one thing, but knowing about textures in your actual usage of a Visual is a much lower level concept.

I see what you mean, but this is just a semantics problem. I mean, learning that Texture is the object that holds a reference to your data is not really harder than learning that it's called DataSource... And if you do know what a texture is, I don't think it's too confusing to learn that vispy keeps track of the data on CPU as well.

…urce

djhoese · 2022-07-06T14:56:43Z

Personally I think we could go with a wrapper if that's better for maintainability, but in that case I would then make Texture basically a private thing that's there just for code separation.

Due to gloo being a public API we can't actually make Texture private. I think I'm mostly thinking code separation. The Texture class is a "dumb" wrapper around the GL concept with only a few conveniences implemented inside. If what you plan on adding are just more conveniences or the argument can be made that they are, then I'm OK with that. The other reason I was hoping for a separate class was that it would be easier to define it as a Protocol that can be re-implemented for many different data sources. I fear I might be combining two separate ideas again though. Or maybe not...

If we think about what we want the user to be able to do, there are two main concepts:

Allow the user to tell the Visual to share low-level data storage to improve performance and reduce waste.
Provide data to the Visual in an abstract way so the Visual doesn't care what it is getting, just that it knows how to get the data it wants from this object. This idea though is a black hole of possibilities (automatically updating GPU textures with dask data or tiles as they are made available, etc). This could also be thought of as a TextureWrapper or TextureProxy class that doesn't need to be implemented now and could provide the same interface as the Texture class for high-level stuff. If we assume that the Visuals will only modify the data from the Texture properties and not through .set_data then this may be easy to implement in the future.

I see what you mean, but this is just a semantics problem. I mean, learning that Texture is the object that holds a reference to your data is not really harder than learning that it's called DataSource... And if you do know what a texture is, I don't think it's too confusing to learn that vispy keeps track of the data on CPU as well.

Good point.

I guess if none of my ramblings above make you think of something else then stick with the Texture class modifications. Once the tests pass then we can have more discussions.

brisvag · 2022-07-06T15:50:43Z

If what you plan on adding are just more conveniences or the argument can be made that they are, then I'm OK with that.

I think we're already doing quite complicated convenience stuff with the Scaled mixins. They are very cool and make working with Textures more intuitive, except they force you to do what the Image and Volume visuals are doing now and keep track of the "original" data yourself. In my mind moving this logic to the Texture not only makes working with the normal Texture itself easier (you can intuitively do texture[:2] = 2), but makes the Scaled mixins really work as intended.

This idea though is a black hole of possibilities (automatically updating GPU textures with dask data or tiles as they are made available, etc). This could also be thought of as a TextureWrapper or TextureProxy class that doesn't need to be implemented now and could provide the same interface as the Texture class for high-level stuff. If we assume that the Visuals will only modify the data from the Texture properties and not through .set_data then this may be easy to implement in the future.

Yeah this would be awesome, and it's along the idea of what we tried to do in napari with the TiledImageVisual, just more low-level and transparent. I agree that it's a bit out of scope for this; I would probably use a wrapper to Texture for that.

I guess if none of my ramblings above make you think of something else then stick with the Texture class modifications. Once the tests pass then we can have more discussions.

So far, I still think this is the right path :) I'll slowly chip away at all the weird issues and come back to you with a proper PR!

djhoese · 2022-07-06T16:04:59Z

I think we're already doing quite complicated convenience stuff with the Scaled mixins. They are very cool and make working with Textures more intuitive, except they force you to do what the Image and Volume visuals are doing now and keep track of the "original" data yourself. In my mind moving this logic to the Texture not only makes working with the normal Texture itself easier (you can intuitively do texture[:2] = 2), but makes the Scaled mixins really work as intended.

Hm holding onto the input data array is not great when it comes to holding on to memory on the CPU when it doesn't need to be. Like uploading a texture that goes on a Mesh. Right now aren't we allowed to delete the numpy array that was given to the texture since the data now lives on the GPU?

brisvag · 2022-07-07T12:07:15Z

I see what you mean... I guess in that case it's convenient to have a way to flush the CPU buffer, and disable part of the functionality. Or like in #1607, make it opt-in, and otherwise automatically "disable" all this (unless it's a CPU scaled mixin, which requires holding onto the original data).

djhoese · 2022-07-07T14:41:43Z

Even if it CPU scaled it isn't required. I mean, only if the limits are changed and it needs to be re-scaled.

The other thing about doing complicated convenience stuff in the scaled mixins is that none of that was supposed to be seen by the user. It was just combining two shared functionalities between the ImageVisual and VolumeVisual.

brisvag · 2022-07-07T15:32:14Z

Even if it CPU scaled it isn't required. I mean, only if the limits are changed and it needs to be re-scaled.

Fair enough :P

Co-authored-by: melonora <[email protected]>

…urce

Co-authored-by: melonora <[email protected]>

…texture

…urce

brisvag added 9 commits June 28, 2022 09:49

getting some stuff fixed

867065b

Merge branch 'main' of github.com:vispy/vispy into fix/dtype-casting-…

8cac383

…texture

do not use private attr

3cd6665

Merge branch 'main' of github.com:vispy/vispy into fix/dtype-casting-…

2f5e35e

…texture

fix test

aaa0f20

remove remnants of other code

87896dd

happy flake

80acec5

texture as data source

e06cea4

fix broken call

e77d2f1

brisvag mentioned this pull request Jul 6, 2022

Fix dtype casting for Texture objects #2350

Merged

brisvag added 2 commits July 6, 2022 12:10

reduce copies

60a6704

move to base class

9b9d7a9

Merge branch 'fix/dtype-casting-texture' into feature/texture-data-so…

506a16c

…urce

brisvag and others added 9 commits July 21, 2022 16:42

bigger canvas for testing

d41b72c

remove numpy import

32b0749

require app

a990c56

fix test error with dtype coercion if copy=False

d9b31d0

Co-authored-by: melonora <[email protected]>

Merge branch 'fix/dtype-casting-texture' into feature/texture-data-so…

695e21e

…urce

fix wrong dtype init for colormap

71de09f

Co-authored-by: melonora <[email protected]>

remove (maybe) useless coercion

c3dbb8c

Co-authored-by: melonora <[email protected]>

some fixes

bcc067e

Fix comment typo in vispy/visuals/_scalable_textures.py

0074a28

brisvag added 7 commits July 26, 2022 11:37

rename casting func and add volume test

3753e98

add requires_pyopengl to test

29aee53

try to not cast dtype in image and volume visual

7ef71d8

fix wrong method call

5a57ed7

fix example

df075e8

Merge branch 'main' of github.com:vispy/vispy into fix/dtype-casting-…

676f152

…texture

Merge branch 'fix/dtype-casting-texture' into feature/texture-data-so…

c84a815

…urce

brisvag mentioned this pull request Aug 2, 2022

NAP 4: asynchronous slicing napari/napari#4892

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP: Texture as data source #2351

WIP: Texture as data source #2351

brisvag commented Jul 5, 2022

djhoese commented Jul 5, 2022

brisvag commented Jul 6, 2022

djhoese commented Jul 6, 2022

brisvag commented Jul 6, 2022

djhoese commented Jul 6, 2022

brisvag commented Jul 7, 2022

djhoese commented Jul 7, 2022

brisvag commented Jul 7, 2022

WIP: Texture as data source #2351

Are you sure you want to change the base?

WIP: Texture as data source #2351

Conversation

brisvag commented Jul 5, 2022

djhoese commented Jul 5, 2022

brisvag commented Jul 6, 2022

djhoese commented Jul 6, 2022

brisvag commented Jul 6, 2022

djhoese commented Jul 6, 2022

brisvag commented Jul 7, 2022

djhoese commented Jul 7, 2022

brisvag commented Jul 7, 2022