[InferenceSlicer] - allow batch size inference #781

inakierregueab · 2024-01-25T21:09:02Z

Description

Currently, sv.InferenceSlicer processes each slice in a separate callback call - hindering inference with a batch size larger than 1. We can change this by:

Batching Slices: Instead of submitting individual tasks for each slice, group slices into batches. batch_size can be a new parameter for the InferenceSlicer class.
Modifying the Callback: Ensure the callback function can handle a batch of slices instead of a single slice. Changing the callback signature from callback: Callable[[np.ndarray], Detections] to callback: Callable[[List[np.ndarray]], List[Detections]].
Collecting and Merging Results: After processing, you must appropriately collect and merge the results from the batches.

Additional

Note: Please share a Google Colab with minimal code to test the new feature. We know it's additional work, but it will definitely speed up the review process. Each change must be tested by the reviewer. Setting up a local environment to do this is time-consuming. Please ensure that Google Colab can be accessed without any issues (make it public). Thank you! 🙏🏻

The text was updated successfully, but these errors were encountered:

SkalskiP · 2024-01-25T22:21:10Z

Hi, @inakierregueab 👋🏻 That is something we were considering but didn't implement due to time restrictions. Let me add some details to this issue. Maybe someone will pick it up.

Bhavay-2001 · 2024-01-26T05:55:46Z

Hi @SkalskiP, can I work on this issue if it is for beginners?
Thanks

SkalskiP · 2024-01-26T07:06:54Z

Hi, @Bhavay-2001 👋🏻 Do you already have experience with running model inference at different batch sizes?

Bhavay-2001 · 2024-01-28T09:25:32Z

Hi @SkalskiP, yes I think I can manage that. Can you please let me know how to proceed with this? Thanks

SkalskiP · 2024-01-28T09:43:02Z

Great! Do you have any specific questions?

Bhavay-2001 · 2024-01-30T15:13:54Z

Hi @SkalskiP, how to add batch_size feature in the Inference Class. How can I test in google colab? Any start point that can help me get on track will be helpful.

SkalskiP · 2024-01-30T16:47:16Z

I outlined vital steps that need to be taken to add batch_size support in task description. I think you should just try to implement it, get first working version and submit PR so we could review it.

Bhavay-2001 · 2024-01-30T17:11:01Z

Hi @SkalskiP, can you please refer me some code sample that is already been implemented and provides the batch_size functionality?

SkalskiP · 2024-01-30T17:15:55Z

@Bhavay-2001, I'm afraid we do not have a code sample. Implementing batch inference was supposed to be executed in this task. :/

Bhavay-2001 · 2024-01-30T17:21:31Z

@SkalskiP, What I am thinking of doing is to implement a for loop with batch of images. Each image is then passed to the model and detections are collected and at the end the detections are returned for the batch.

Bhavay-2001 · 2024-02-06T14:42:48Z

Hi @SkalskiP, can you please review this PR?

Bhavay-2001 · 2024-02-16T16:54:44Z

Hi @SkalskiP, can you please review and let me know. Thanks

LinasKo · 2024-04-10T13:41:46Z

Me and SkalskiP had a conversation about this - I'll take over for now.

LinasKo · 2024-04-10T13:52:36Z

Intermediate results:

I've confirmed that threads help, especially when the model is run on the CPU. I see a 5-10x performance improvement.
I've implemented the batched inference slicer, allowing users to input both images and lists of images.
Threading implementation is kept, docs written to point to either batch=N; threads=1 or batch=1; threads=N, depending on GPU / CPU needs.

Testing more broadly, however, provides mixed results.

On my machine, batching provides a speed boost for ultralytics, does nothing for transformers (GPU) and inference (CPU, I believe).
~~Using threads=8 slows down the ultralytics, batch=1 case, compared to threads=1.~~ Only slower on my machine. In Colabs it's faster.

Still checking transformers - there's an obvious speedup with GPU, but I ran out of memory when trying with batching.

Colab coming soon.

LinasKo · 2024-04-10T14:21:22Z

https://colab.research.google.com/drive/1j85QErM74VCSLADoGliM296q4GFUdnGM?usp=sharing

As you can see, in these tests it only helped the Ultralytics case.

Known insufficiencies:

~~Inference 1 model is fit for vehicle detection but is tested on an image with people.~~
~~No image to check how well it performed.~~
~~No tests for auto-batch case (when max_batch_size=-1).~~
~~Missing examples in dosctring: normal vs batch callback~~
No improvements to nms efficiency.

LinasKo · 2024-04-10T15:14:46Z

PR: #1108

LinasKo · 2024-04-11T17:52:38Z

@SkalskiP,
Ready for review, details in #1108.

inakierregueab added the enhancement New feature or request label Jan 25, 2024

SkalskiP changed the title ~~Batch size inference slicer~~ [InferenceSlicer] - allow batch size inference Jan 25, 2024

SkalskiP added the Q1.2024 Tasks planned for execution in Q1 2024. label Jan 25, 2024

This was referenced Feb 1, 2024

Updated inference_slicer.py #839

Closed

Updated inference_slicer.py #840

Closed

Updated inference_slicer.py #842

Closed

SkalskiP added Q2.2024 Tasks planned for execution in Q2 2024. and removed Q1.2024 Tasks planned for execution in Q1 2024. labels Apr 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[InferenceSlicer] - allow batch size inference #781

[InferenceSlicer] - allow batch size inference #781

inakierregueab commented Jan 25, 2024 •

edited by SkalskiP

SkalskiP commented Jan 25, 2024

Bhavay-2001 commented Jan 26, 2024

SkalskiP commented Jan 26, 2024

Bhavay-2001 commented Jan 28, 2024

SkalskiP commented Jan 28, 2024

Bhavay-2001 commented Jan 30, 2024

SkalskiP commented Jan 30, 2024

Bhavay-2001 commented Jan 30, 2024

SkalskiP commented Jan 30, 2024

Bhavay-2001 commented Jan 30, 2024

Bhavay-2001 commented Feb 6, 2024

Bhavay-2001 commented Feb 16, 2024

LinasKo commented Apr 10, 2024 •

edited

LinasKo commented Apr 10, 2024 •

edited

LinasKo commented Apr 10, 2024 •

edited

LinasKo commented Apr 10, 2024

LinasKo commented Apr 11, 2024 •

edited

[InferenceSlicer] - allow batch size inference #781

[InferenceSlicer] - allow batch size inference #781

Comments

inakierregueab commented Jan 25, 2024 • edited by SkalskiP

Description

Additional

SkalskiP commented Jan 25, 2024

Bhavay-2001 commented Jan 26, 2024

SkalskiP commented Jan 26, 2024

Bhavay-2001 commented Jan 28, 2024

SkalskiP commented Jan 28, 2024

Bhavay-2001 commented Jan 30, 2024

SkalskiP commented Jan 30, 2024

Bhavay-2001 commented Jan 30, 2024

SkalskiP commented Jan 30, 2024

Bhavay-2001 commented Jan 30, 2024

Bhavay-2001 commented Feb 6, 2024

Bhavay-2001 commented Feb 16, 2024

LinasKo commented Apr 10, 2024 • edited

LinasKo commented Apr 10, 2024 • edited

LinasKo commented Apr 10, 2024 • edited

LinasKo commented Apr 10, 2024

LinasKo commented Apr 11, 2024 • edited

inakierregueab commented Jan 25, 2024 •

edited by SkalskiP

LinasKo commented Apr 10, 2024 •

edited

LinasKo commented Apr 10, 2024 •

edited

LinasKo commented Apr 10, 2024 •

edited

LinasKo commented Apr 11, 2024 •

edited