Add squeeze / unsqueeze operations to quant invariant functions in `torch_handler.py` #891

nickfraser · 2024-03-01T17:54:58Z

No description provided.

sanjay-rb · 2024-03-02T03:21:05Z

Hi @nickfraser ,

You wanted to add squeeze / unsqueeze operations in quant_invariant_handler function right ?

Giuseppe5 · 2024-03-02T11:16:03Z

When dealing with squeeze/unsqueeze, we also have to handle the shapes of scale factors and zero points.

Related to #728

ScXfjiang · 2024-04-20T22:34:40Z

For per-channel quantization

Squeeze/Unsqueeze OP is more like Permute OP, where we can find easy ways to modify the QuantTensor to keep those OPs affine quantization invariant. In the case of Squeeze/Unsqueeze OP, all we need to do is squeeze/unsqueeze the scale and zero point tensor accordingly.

However, OPs mentioned in #728 (reshape, flatten) are non-trivial. There are no trivial ways to modify the QuantTensor to keep those OPs affine quantization invariant. Recalculation of scale and zero point is inevitable. We may need to dequantize --> reshape/flatten --> requantize to bypass this problem, at the price of precision loss.

It looks like PyTorch doesn't solve this problem either. They don’t offer a quantized version of the flatten(); instead, they simply use torch.flatten(). QUANTIZATION API REFERENCE

ScXfjiang · 2024-04-21T12:52:36Z

Hi @nickfraser @Giuseppe5,

A PR has been submitted to solve this issue. Your comments are highly appreciated, many thanks.

#941

nickfraser self-assigned this Mar 1, 2024

nickfraser added feature New features good first issue Good for newcomers labels Mar 1, 2024

This was referenced Apr 20, 2024

QuantTensor quantization invariant op on per tensor vs per channel #728

Open

Dev quant squeeze #941

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add squeeze / unsqueeze operations to quant invariant functions in `torch_handler.py` #891

Add squeeze / unsqueeze operations to quant invariant functions in `torch_handler.py` #891

nickfraser commented Mar 1, 2024

sanjay-rb commented Mar 2, 2024

Giuseppe5 commented Mar 2, 2024 •

edited

ScXfjiang commented Apr 20, 2024 •

edited

ScXfjiang commented Apr 21, 2024

Add squeeze / unsqueeze operations to quant invariant functions in torch_handler.py #891

Add squeeze / unsqueeze operations to quant invariant functions in torch_handler.py #891

Comments

nickfraser commented Mar 1, 2024

sanjay-rb commented Mar 2, 2024

Giuseppe5 commented Mar 2, 2024 • edited

ScXfjiang commented Apr 20, 2024 • edited

ScXfjiang commented Apr 21, 2024

Add squeeze / unsqueeze operations to quant invariant functions in `torch_handler.py` #891

Add squeeze / unsqueeze operations to quant invariant functions in `torch_handler.py` #891

Giuseppe5 commented Mar 2, 2024 •

edited

ScXfjiang commented Apr 20, 2024 •

edited