Add pre and post processing steps to allow non float dtypes #2882

ashnair1 · 2024-04-12T20:41:02Z

Changes

When using cross entropy loss with torchmetrics, masks needs to be of type int but kornia only considers float as valid data type.

Type of change

🔬 New feature (non-breaking change which adds functionality)

Checklist

My code follows the style guidelines of this project
I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
My changes generate no new warnings
Did you update CHANGELOG in case of a major change?

kornia/augmentation/_2d/mix/base.py

johnnv1

add some tests to ensure too... since the random generators use float values, maybe the output will be float anyway... I'm not sure, but that's what I'm waiting for xD

shijianjian

I do not think it is reasonable, as most methods considers only 0-1 value ranges. If you want to support other formats like uint8 or uint16. A preprocessing and postprocessing to 0-1 are in need I think.

for more information, see https://pre-commit.ci

ashnair1 · 2024-04-17T16:50:40Z

I do not think it is reasonable, as most methods considers only 0-1 value ranges. If you want to support other formats like uint8 or uint16. A preprocessing and postprocessing to 0-1 are in need I think.

Agreed. Have made it so inputs are converted to float before the operations and reverted to original dtype once they're complete.

ashnair1 · 2024-05-01T15:44:23Z

@shijianjian @johnnv1 @edgarriba Any thoughts on this?

shijianjian

I think this should not apply on input type.

shijianjian · 2024-05-02T02:03:40Z

tests/augmentation/test_container.py

 def test_forward_and_inverse(self, random_apply, device, dtype):
- inp = torch.randn(1, 3, 1000, 500, device=device, dtype=dtype)
+ if dtype not in [torch.float32, torch.float64]:


We do not run int test in our CI. You may put only mask for int.

shijianjian · 2024-05-02T02:04:59Z

kornia/augmentation/container/ops.py

+ dtype = None
+ if not torch.is_floating_point(input):
+ dtype = input.dtype
+ input = input.float()


This does not make sense for input. Our function does not support 0-255 range.

ashnair1 · 2024-05-06T08:26:36Z

Current issue is the type of dtype to cast to. If a non float dtype mask is detected, it is converted into a float type and later reverted back. The issue is that if it's converted to float, the slow float64 tests complain (Expected float, got double) and when it's converted to double, the slow float32 tests complain. Does it need to be aware of what dtype input is and map accordingly?

Would appreciate some pointers here as I'm not sure how to go about this.

johnnv1 · 2024-05-18T14:30:12Z

Would appreciate some pointers here as I'm not sure how to go about this.

hey @ashnair1, sorry for the delay on this one...

i think the issue here is: the random generator creates the augmentations parameters in XXX device and YYY dtype then casts it into the input dtype (probably the same dtype as the image, since we always have the image)... here you are "hardcoding" the dtype for the other cases when you want to support nonfloating dtypes, instead you should cast it to the input dtype (or maybe the params dtype? not sure cc @shijianjian)

ashnair1 · 2024-05-20T09:52:04Z

That does not make sense to me. Just so we're all on the same page regarding what we're trying to achieve here.

In a kornia augmentation, masks by default have to be float. This is because certain operations don't work for int tensors. But as mentioned above, masks are typically expected to be int tensors and the fact they are float here is a design convenience.

So in order to work with masks tensors of type int, we should pre-process it to float before the op (that only accepts floats) and post process it back to int after the op.

However in the tests, it requires the mask tensors to be the same dtype as input. This means that if you want to test passing in a mask tensor of type int, you would (as you suggested) need to have input be of type int as well. Which in turn means we need to add a pre and post processing step for input as well which @shijianjian had advised against (#2882 (review)) and which I agree with.

Ultimately, what I would like to see, is that kornia should ensure input is float while allowing masks to be int or float relying on the processing steps to ensure the mask tensor works with the ops.

johnnv1 · 2024-05-20T21:30:40Z

That does not make sense to me. Just so we're all on the same page regarding what we're trying to achieve here.

In a kornia augmentation, masks by default have to be float. This is because certain operations don't work for int tensors. But as mentioned above, masks are typically expected to be int tensors and the fact they are float here is a design convenience.

So in order to work with masks tensors of type int, we should pre-process it to float before the op (that only accepts floats) and post process it back to int after the op.

yeap, but we need to cast to the right floating type, which is the same floating type as the input data, otherwise, it will mismatch and crash in some ops...

what I mean is to retrieve the data type from the input itself, like using something like

kornia/kornia/augmentation/container/augment.py

Line 408 in 25237be

inp = in_args[self.transform_op.data_keys.index(DataKey.INPUT)]

(or some other from the API itself)

johnnv1 reviewed Apr 12, 2024

View reviewed changes

kornia/augmentation/_2d/mix/base.py Outdated Show resolved Hide resolved

johnnv1 reviewed Apr 12, 2024

View reviewed changes

shijianjian requested changes Apr 15, 2024

View reviewed changes

ashnair1 and others added 3 commits April 16, 2024 12:39

Add int32 & int64 as valid data types

bb817f2

[pre-commit.ci] auto fixes from pre-commit.com hooks

c06fd00

for more information, see https://pre-commit.ci

Add pre and post process steps for diff dtypes

1cf0057

ashnair1 force-pushed the int-valid-types branch from c15a710 to 1cf0057 Compare April 16, 2024 08:39

ashnair1 marked this pull request as ready for review April 17, 2024 16:50

ashnair1 changed the title ~~Add int32 & int64 as valid data types~~ Add pre and post processing steps to allow non float dtypes Apr 17, 2024

ashnair1 requested review from shijianjian and johnnv1 April 17, 2024 17:23

ashnair1 added 3 commits April 17, 2024 23:59

Update

60aa301

Fixes

1ab504d

Fix3

b7edd89

shijianjian requested changes May 2, 2024

View reviewed changes

ashnair1 added 3 commits May 6, 2024 11:20

Scope to masks only

60bddd9

Remove float

90a9aa0

Switch to double

25237be

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add pre and post processing steps to allow non float dtypes #2882

Add pre and post processing steps to allow non float dtypes #2882

ashnair1 commented Apr 12, 2024

johnnv1 left a comment

shijianjian left a comment

ashnair1 commented Apr 17, 2024

ashnair1 commented May 1, 2024

shijianjian left a comment

shijianjian May 2, 2024

shijianjian May 2, 2024

ashnair1 commented May 6, 2024 •

edited

johnnv1 commented May 18, 2024

ashnair1 commented May 20, 2024

johnnv1 commented May 20, 2024

Add pre and post processing steps to allow non float dtypes #2882

Are you sure you want to change the base?

Add pre and post processing steps to allow non float dtypes #2882

Conversation

ashnair1 commented Apr 12, 2024

Changes

Type of change

Checklist

johnnv1 left a comment

Choose a reason for hiding this comment

shijianjian left a comment

Choose a reason for hiding this comment

ashnair1 commented Apr 17, 2024

ashnair1 commented May 1, 2024

shijianjian left a comment

Choose a reason for hiding this comment

shijianjian May 2, 2024

Choose a reason for hiding this comment

shijianjian May 2, 2024

Choose a reason for hiding this comment

ashnair1 commented May 6, 2024 • edited

johnnv1 commented May 18, 2024

ashnair1 commented May 20, 2024

johnnv1 commented May 20, 2024

ashnair1 commented May 6, 2024 •

edited