feat : Adds `ggml_upscale_ext` #814

balisujohn · 2024-05-06T07:18:29Z

Closes issue #812

Adds ggml_upscale_ext alternate version of ggml_upscale which lets the user upscale a tensor to a specified shape using nearest neighbor interpolation.

Adds a CPU and cuda implementation, and test, though it's not clear to me either this or ggml_upscale has adequate tests.

Also, the reason I added this is because it is necessary to convert tortoise-tts to ggml.

~~This is unfinished since I still need to add the cuda implementation and possibly expand tests since I haven't verified it actually works yet. Will remove draft annotation when ready for review.~~

…cuda implementation next

balisujohn · 2024-05-06T07:26:29Z

(I also realize there are some unnecessary params in some of the functions, so I will clean that up before review)

balisujohn · 2024-05-08T01:02:33Z

Ready for review! :^)

ggerganov · 2024-05-11T18:49:21Z

include/ggml/ggml.h

@@ -468,6 +468,7 @@ extern "C" {
 GGML_OP_POOL_1D,
 GGML_OP_POOL_2D,
 GGML_OP_UPSCALE, // nearest interpolate
+ GGML_OP_UPSCALE_TO_SHAPE, // nearest interpolate to specified tensor shape


There is no need to add extra OP, we can reuse the existing GGML_OP_UPSCALE

ggerganov · 2024-05-11T18:51:38Z

We should generalize the existing upscale OP and kernels since the existing ggml_upscale functionality is a special case of the more general ggml_upscale_to_shape

balisujohn · 2024-05-11T19:15:06Z

We should generalize the existing upscale OP and kernels since the existing ggml_upscale functionality is a special case of the more general ggml_upscale_to_shape

I can do that. My thinking was to avoid breaking backwards compatibility, but I can just put the new behavior in the ggml_upscale op since it is a strict superset of ggml_upscales original behavior.

ggerganov · 2024-05-11T20:03:17Z

ggml_upscale can still do what it currently does in order to keep backwards compatibility. Take a look at ggml_soft_max and ggml_soft_max_ext as an example of what I have in mind

balisujohn · 2024-05-12T05:58:22Z

ready for review again :^)

balisujohn · 2024-05-12T08:27:49Z

very impressive response time!

ggerganov

This implementation ignore half-pixel effects - I'm not sure how the reference PyTorch operators behave, but it's something that needs a deeper look

I also wonder if we should take the time to make the upscale operator more general to support downscaling - i.e. GGML_OP_RESCALE

ggerganov · 2024-05-12T08:35:01Z

src/ggml-cuda/upscale.cu

+static __global__ void upscale_f32(const float * x, float * dst,
+ const int ne00, const int ne01, const int ne02, const int ne03,
+ const int ne10, const int ne11, const int ne12, const int ne13,
+ const float sf0, const float sf1, const float sf2, const float sf3) {


Please update the kernel to use the tensor strides - see the CPU and Metal implementations as an example. Otherwise, this would work only for contiguous data

When ready, try to see if you can add tests in test-backend-ops that exercise non-contiguous data

…test for non-contiguous behavior

balisujohn · 2024-05-12T22:59:36Z

This implementation ignore half-pixel effects - I'm not sure how the reference PyTorch operators behave, but it's something that needs a deeper look

I also wonder if we should take the time to make the upscale operator more general to support downscaling - i.e. GGML_OP_RESCALE

Alright so wrt behaving similarly to PyTorch nearest neighbor interpolation I can confirm

cur = ggml_upscale_ext(ctx0, cur, 187, 1024, 1,1);

behaves the same way (only tested with cuda backend) as

expanded_code_emb = F.interpolate(code_emb, size=187, mode='nearest')

for my internal test case tensor shaped [43,1024]. (dimensions read left to right)

I think it would be nice to support downscaling but I also think it might be good to leave that to a future PR.

Ready for review again :^)

balisujohn · 2024-05-12T23:01:42Z

Also is there somewhere inputs and output target literals are defined for the test cases? I found it a bit strange that I didn't have to specify the actual elementwise values of input literals and output target literals when defining the tests.

ggerganov · 2024-05-15T08:52:14Z

The test-backend-ops tool does not provide mechanism for specific input / output values. You can write a standalone test, similar to test-conv1d.cpp for example. Would be useful to demonstrate that the numerical results match with the results from PyTorch.

Also, does PyTorch support floating-point scaling factor? If so, we should consider extending the functionality

initial commit with CPU implementation of upscale to shape and test, …

fe66383

…cuda implementation next

balisujohn marked this pull request as draft May 6, 2024 07:23

balisujohn changed the title ~~Draft PR adding ggml_upscale_to_shape~~ Draft PR adding ggml_upscale_to_shape May 6, 2024

balisujohn changed the title ~~Draft PR adding ggml_upscale_to_shape~~ feat : Draft PR adding ggml_upscale_to_shape May 6, 2024

balisujohn added 4 commits May 7, 2024 16:17

experimental commit to see if dst shape is correct

798c97f

test version

2cb6087

test

bce8e9c

removed unnecessary params

1a8e549

balisujohn changed the title ~~feat : Draft PR adding ggml_upscale_to_shape~~ feat : Adds ggml_upscale_to_shape May 8, 2024

balisujohn marked this pull request as ready for review May 8, 2024 01:03

ggerganov reviewed May 11, 2024

View reviewed changes

balisujohn added 2 commits May 12, 2024 00:35

refactor

84efb6d

fixed tests

c8a3335

balisujohn changed the title ~~feat : Adds ggml_upscale_to_shape~~ feat : Adds ggml_upscale_ext May 12, 2024

ggml : metal impl + cleanup + sycl dev warnings

e103296

ggerganov force-pushed the dev-upscale branch from cc8e262 to e103296 Compare May 12, 2024 08:32

ggerganov requested changes May 12, 2024

View reviewed changes

patched ggml_upscale cuda op to handle non-contiguous tensors, added …

0fc9997

…test for non-contiguous behavior

balisujohn requested a review from ggerganov May 14, 2024 05:43

metal : fix upsacle op to support nb00 + style

d999075

ggerganov approved these changes May 15, 2024

View reviewed changes

ggerganov merged commit 126d349 into ggerganov:master May 15, 2024
4 checks passed

This was referenced May 15, 2024

[SYCL] Fix compilation error from GGML_ASSERT ggerganov/llama.cpp#7303

Merged

[SYCL] Update SYCL upscale operation ggerganov/llama.cpp#7321

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat : Adds `ggml_upscale_ext` #814

feat : Adds `ggml_upscale_ext` #814

balisujohn commented May 6, 2024 •

edited

balisujohn commented May 6, 2024

balisujohn commented May 8, 2024

ggerganov May 11, 2024

balisujohn May 12, 2024

ggerganov commented May 11, 2024

balisujohn commented May 11, 2024

ggerganov commented May 11, 2024

balisujohn commented May 12, 2024

balisujohn commented May 12, 2024

ggerganov left a comment

ggerganov May 12, 2024

balisujohn May 12, 2024

balisujohn commented May 12, 2024

balisujohn commented May 12, 2024 •

edited

ggerganov commented May 15, 2024

feat : Adds ggml_upscale_ext #814

feat : Adds ggml_upscale_ext #814

Conversation

balisujohn commented May 6, 2024 • edited

balisujohn commented May 6, 2024

balisujohn commented May 8, 2024

ggerganov May 11, 2024

Choose a reason for hiding this comment

balisujohn May 12, 2024

Choose a reason for hiding this comment

ggerganov commented May 11, 2024

balisujohn commented May 11, 2024

ggerganov commented May 11, 2024

balisujohn commented May 12, 2024

balisujohn commented May 12, 2024

ggerganov left a comment

Choose a reason for hiding this comment

ggerganov May 12, 2024

Choose a reason for hiding this comment

balisujohn May 12, 2024

Choose a reason for hiding this comment

balisujohn commented May 12, 2024

balisujohn commented May 12, 2024 • edited

ggerganov commented May 15, 2024

feat : Adds `ggml_upscale_ext` #814

feat : Adds `ggml_upscale_ext` #814

balisujohn commented May 6, 2024 •

edited

balisujohn commented May 12, 2024 •

edited