Enhance Tensor Flexibility with Structs #327

BrianPetkovsek · 2023-06-29T20:55:04Z

replaces kp::Tensor::TensorDataTypes with std::type_info. This allows you to use any struct as a tensor rather than being limited by base datatypes (int, uint, float, etc..).

The easiest way I found to use structs in shaders is to use the "GL_GOOGLE_include_directive" extension to include a hpp file that holds the struct. this way you can use the struct in your shader code and c++ code. (see new example)

replaces kp::Tensor::TensorDataTypes with std::type_info. This allows you to use any struct as a datatype Signed-off-by: Brian Petkovsek <[email protected]>

Signed-off-by: Brian Petkovsek <[email protected]>

axsaucedo · 2023-07-02T12:23:18Z

@BrianPetkovsek thank you for the contribution, this seems like a reasonable addition - also I wasn't aware of the GL_GOOGLE_include_directive, quite interesting. It seems relatively analogous to the way the Constants are impleemnted as these also have flexibility to be other structures.

In regards to performance would this have any distinction?

As a heads up there seems to still have build issues in the python package - the rest of the examples should also be tested to ensure they work correctly as although a small design change we just need to ensure the rest works - if you can validate the basic examples I can validate the heavier ones like the android / godot.

Signed-off-by: Brian Petkovsek <[email protected]>

Implement TypeContainers

BrianPetkovsek · 2023-07-07T06:05:10Z

Hi @axsaucedo,

I’ve decided to change my approach and use an abstract class called ABCTypeContainer instead of typeid and std::type_info. This allows for the implementation of both a C++ and Python version, which I have already included.

I’ve made some significant changes to main.cpp for the pybind11 bindings, but I’m not entirely sure how push_consts, specconstsvec, and pushconstsvec are implemented. If they’re just buffers, it might be easier to make them a std::vector<byte>.

The C++ type container could use some cleanup. Currently, it uses a counter system with the struct IdCounter. If there’s a way to consolidate this without using an external struct, it should be done.

C++ type container works by
The classId() method in the C++ type container returns a unique identifier for each unique template type instantiation of the TypeContainer template. This is achieved by using a static variable id that is initialized with the value of counter from the IdCounter struct and then incremented, ensuring that each instantiation has a unique id.

The PyTypeContainer works by comparing the dtypes of numpy arrays. A dtype can act as either a structure or just a datatype, allowing for struct-like capabilities (see https://numpy.org/doc/stable/user/basics.rec.html#structured-arrays).

This also resolves #99 as you can put multi-dimensional arrays in structs

Signed-off-by: Brian Petkovsek <[email protected]>

Update Tests

BrianPetkovsek · 2023-07-07T19:10:29Z

Awaiting review/comments.

axsaucedo

Thank you for your further iteration @BrianPetkovsek - I have now added a few comments throughout the document.

The main feedback consistent throuhout is to explore how we can 1) keep it as simple as possible - even if we have to reduce functionality 2) ensure determinism, and 3) explicit over implicit.

Going back to my previous question I would be keen to understand if this affects performance - let me know if you can find some at least heuristic based insights.

In regards to your question:

I’ve made some significant changes to main.cpp for the pybind11 bindings, but I’m not entirely sure how push_consts, specconstsvec, and pushconstsvec are implemented. If they’re just buffers, it might be easier to make them a std::vector.

They are indeed just bytes similar to the rest of the data, you can see how these are set in the algorithm and in the opalgo, as well as in the respective Push/SpecConstTest. This means they could benefit from a similar approach as structs can be used. It seems however the current implementation is still failing on the tests - you should be able to run the tests locally with act

As mentioned in the comments however I would be keen to avoid where possible large changes in the C++ API just to make the Python API work, I know this wasn't the major driver but looking at the complexity increase from the first iteration to now I would be keen to explore ways to decrease the complexity of this.

axsaucedo · 2023-07-09T10:14:49Z

setup.py

@@ -83,7 +83,11 @@ def build_extension(self, ext):
 description='Kompute: Blazing fast, mobile-enabled, asynchronous, and optimized for advanced GPU processing usecases.',
 long_description=long_description,
 long_description_content_type='text/markdown',
- ext_modules=[CMakeExtension('kp')],
+ ext_modules=[CMakeExtension('kp/kp')],
+ packages = find_packages(where="python/src"),


Why is this being removed?

just puts the module into a folder (kp), then init imports kp.

was using it for testing if I needed to create a pure python class, I Implemented PyTypeContainer then moved it to c++.
It can be reverted back but really it just allows the ability to package pure python with the project if needed.

setup.py

src/OpTensorCopy.cpp

axsaucedo · 2023-07-09T10:19:00Z

src/include/kompute/TypeContainer.hpp

+ private:
+ size_t classId()
+ {
+ static size_t id = counter++;


This introduces non-deterministic behaviour, one run TypeContainer will be different to another run - we should avoid. The typeid + std::typeinfo seemed more robust, what's the reason to not use here?

changed to typeid

python/src/main.cpp

axsaucedo · 2023-07-09T10:34:56Z

python/src/main.cpp

 }

+class PyTypeContainer : public ABCTypeContainer


Reading this it seems that this new iteration of the implementation was added to explore the implementation on the python side, but it seems this is also adding quite a lot of complexity on the C++ side (also there's quite a few python-internal references here that I'm not sure about) - I would be keen to explore how complexity can be reduced in this implementation

axsaucedo · 2023-07-09T10:35:27Z

python/src/main.cpp

+{
+ public:
+ PyTypeContainer(py::object clazz)
+ : clazz_(clazz)


I don't follow what's going on here, what's class_(...) in this case?

class_ should be renamed to dtype. It stores the dtype so we can cast the data back to the correct format in the method data() (line 139). Like wise we return the dtype in data_type() (line 161)

renamed for better understanding

axsaucedo · 2023-07-09T10:36:41Z

python/src/main.cpp

+ auto frombuffer = np.attr("frombuffer");
+
+ auto dtype =
+ dynamic_cast<PyTypeContainer*>(&(*self.dataType()))->clazz_;


Not sure I follow - what is class_ in this case (same Q as above)?

clazz_ should be renamed to dtype.

we need the dtype to cast the array from byte to the original dtype

Fixed an issue causing tests to fail, the base object was being overwritten by frombuffer. fixed the implementation.

python/src/main.cpp

Signed-off-by: Brian Petkovsek <[email protected]>

Pull

Signed-off-by: Brian Petkovsek <[email protected]>

Pullrq

Update tests

BrianPetkovsek · 2023-07-10T13:03:47Z

fixed the issues with the tests.

I created a new class called Buffer. This class is a basic buffer class that holds Buffer data like pointer location, size, length, and end. positions. The class gets compiled away at compile time.

I have also updated the classes that uses push/spec constants to allow for Buffer without affecting the external api, (you can still use vectors as push/spec).

BrianPetkovsek · 2023-07-10T13:04:16Z

Awaiting review/comments.

Signed-off-by: Brian Petkovsek <[email protected]>

Compile Fixes

BrianPetkovsek · 2023-07-10T18:32:00Z

Im compiling my build with MSVC , seems like gcc is more picky with compiling, fixed the build issues.

BrianPetkovsek · 2023-07-10T19:25:38Z

I'll downloads gcc and fix it

axsaucedo · 2023-07-10T19:45:57Z

Thank you for having a deeper look into this, I was able to get an initial review. I have a few followups:

I have also updated the classes that uses push/spec constants to allow for Buffer without affecting the external api, (you can still use vectors as push/spec).

Would you be able to provide further context on the reasoning for having a Buffer class? Is this to simplify the way that the python wrapper implements it? I feel this buffer adds extra complexity - I do agree that this is only used internally so may not be as bad, but I am wondering if this would be necessary or whether it's only for use in the python side?

On a side note, in regards to naming conventions we should not use conventions that exist in Vulkan to avoid confusion (ie Buffer =? vkBuffer)

In regards to the dataType it seems it's passed everywhere as a pointer, is there a reason why this is not just passed everywhere as reference? GIven its deterministic behaviour I would assume this coudl now be the case.

BrianPetkovsek added 3 commits June 29, 2023 16:50

replaceTensorDataTypes with std::type_info

f91f6d4

replaces kp::Tensor::TensorDataTypes with std::type_info. This allows you to use any struct as a datatype Signed-off-by: Brian Petkovsek <[email protected]>

Update tests for typeid implementation

7da4c1c

Signed-off-by: Brian Petkovsek <[email protected]>

Add structure array multiplication example

87aa8a9

Signed-off-by: Brian Petkovsek <[email protected]>

BrianPetkovsek added 3 commits July 7, 2023 01:00

Implement ABCTypeContainer

b5c9078

Signed-off-by: Brian Petkovsek <[email protected]>

Implement C++ TypeContainer

909ad4b

Signed-off-by: Brian Petkovsek <[email protected]>

Implement PyTypeContainer

83ee84a

Signed-off-by: Brian Petkovsek <[email protected]>

BrianPetkovsek force-pushed the pullrq branch from f55ac4d to 87aa8a9 Compare July 7, 2023 05:02

Merge pull request #3 from BrianPetkovsek/master

1eecb55

Implement TypeContainers

BrianPetkovsek added 3 commits July 7, 2023 14:39

Update tests

1132742

Signed-off-by: Brian Petkovsek <[email protected]>

Update tests

2c3a4ef

Signed-off-by: Brian Petkovsek <[email protected]>

Merge pull request #4 from BrianPetkovsek/master

67deb25

Update Tests

axsaucedo requested changes Jul 9, 2023

View reviewed changes

BrianPetkovsek added 8 commits July 10, 2023 07:55

Make TypeContainer use typeid

50f9952

Signed-off-by: Brian Petkovsek <[email protected]>

Change dataType to pointer from shared_ptr

6b9569e

Signed-off-by: Brian Petkovsek <[email protected]>

Implement simple buffer

9df3785

Signed-off-by: Brian Petkovsek <[email protected]>

Fix pushconstsvec, specconstsvec and tensor data return

34d0596

Signed-off-by: Brian Petkovsek <[email protected]>

Merge pull request #5 from BrianPetkovsek/master

3250768

Pull

Updated tests

711b559

Signed-off-by: Brian Petkovsek <[email protected]>

Merge pull request #6 from BrianPetkovsek/pullrq

8657265

Pullrq

Merge pull request #7 from BrianPetkovsek/master

a712058

Update tests

BrianPetkovsek added 4 commits July 10, 2023 14:04

simplify itemsize

47afbdf

Signed-off-by: Brian Petkovsek <[email protected]>

fix build errors

8e3bcf2

Signed-off-by: Brian Petkovsek <[email protected]>

Revert change

9da945b

Signed-off-by: Brian Petkovsek <[email protected]>

Merge pull request #8 from BrianPetkovsek/master

33b4b19

Compile Fixes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enhance Tensor Flexibility with Structs #327

Enhance Tensor Flexibility with Structs #327

BrianPetkovsek commented Jun 29, 2023

axsaucedo commented Jul 2, 2023

BrianPetkovsek commented Jul 7, 2023 •

edited

BrianPetkovsek commented Jul 7, 2023

axsaucedo left a comment •

edited

axsaucedo Jul 9, 2023

BrianPetkovsek Jul 10, 2023

axsaucedo Jul 9, 2023

BrianPetkovsek Jul 10, 2023

axsaucedo Jul 9, 2023

axsaucedo Jul 9, 2023

BrianPetkovsek Jul 9, 2023

BrianPetkovsek Jul 10, 2023

axsaucedo Jul 9, 2023

BrianPetkovsek Jul 9, 2023

BrianPetkovsek Jul 10, 2023

BrianPetkovsek commented Jul 10, 2023

BrianPetkovsek commented Jul 10, 2023

BrianPetkovsek commented Jul 10, 2023

BrianPetkovsek commented Jul 10, 2023

axsaucedo commented Jul 10, 2023

Enhance Tensor Flexibility with Structs #327

Are you sure you want to change the base?

Enhance Tensor Flexibility with Structs #327

Conversation

BrianPetkovsek commented Jun 29, 2023

axsaucedo commented Jul 2, 2023

BrianPetkovsek commented Jul 7, 2023 • edited

BrianPetkovsek commented Jul 7, 2023

axsaucedo left a comment • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

BrianPetkovsek commented Jul 10, 2023

BrianPetkovsek commented Jul 10, 2023

BrianPetkovsek commented Jul 10, 2023

BrianPetkovsek commented Jul 10, 2023

axsaucedo commented Jul 10, 2023

BrianPetkovsek commented Jul 7, 2023 •

edited

axsaucedo left a comment •

edited