Add support slice function, padding the mask to the full images and t… #881

AdonaiVera · 2024-02-12T03:37:44Z

Description

This pull request introduces support for InferenceSlicer that focuses on segmentation. This is the second part of the discussion in the #678 issue.

First, I added a conditional statement to pad images that were smaller than the slice size, such as corners. Next, I created the _apply_padding_to_slice function to apply padding to a slice using the letterbox resizing method.

We are adjusting the masks to align them with the image coordinate system, which represents the entire image. Most of the masks are empty except for the specific area we want to focus on. This simplifies the process of merging all the masks, but it also requires more memory since the size of each mask will be the same as the original image. Besides, we are currently using a concatenate function to join all the masks in the merge function

The main difficulty lies in the high computational cost and time required by this method. The algorithm needs to fit every mask detection in the entire image, resulting in a significantly larger size of the data.

In my opinion, this PR is not ready for production yet, as we need to handle the computational cost for the mask. However, I would like to hear your perspective @SkalskiP

Currently, the method is consuming a lot of RAM.

- load image ✅ 
- slice the image into NxN tiles ✅ 
- surround smaller size slices (the ones close to the edges) with a letterbox so that all tiles are NxN
- loop over slices ✅ 
   - run inference ✅ 
- update box coordinate values to match the image coordinate system, not the slice coordinate system ✅ 
- pad masks to match the image coordinate system, not the slice coordinate system ✅ 
- merge detections ✅

In addition, this PR encompasses comprehensive unit tests for the new move_masks functionality and a demo that demonstrates the algorithm's application and effectiveness in real-world scenarios.

Type of change

New feature (non-breaking change which adds functionality)

How has this change been tested

I created a demo to showcase this functionality here

…hen concatenate all the predictions together

AdonaiVera · 2024-02-19T04:51:22Z

Hi @SkalskiP 👋

I was discussing with Tal Yagev, who is a Roboflow user, about the issue of large images consuming too much RAM. I tried several approaches to reduce RAM consumption, such as integrating a function to merge the mask in a batch, incremental stacking with lists, generator expressions, and others. However, none of these approaches solved the peak of the RAM that broke the code.

After further investigation, I found that the issue was with the NMS (non-maximum suppression) in the segmentation function that consumed a lot of RAM when you have slices of the image. Therefore, I suggested two changes and would like to hear your perspective on them.

The first change is to modify the code from this:

with ThreadPoolExecutor(max_workers=self.thread_workers) as executor:
            futures = [
                executor.submit(self._run_callback, image, offset) for offset in offsets
            ]
            for future in as_completed(futures):
                detections_list.append(future.result())

        return Detections.merge(detections_list=detections_list).with_nms(
                    threshold=self.iou_threshold
                )

to this:

with ThreadPoolExecutor(max_workers=self.thread_workers) as executor:
            futures = [
                executor.submit(self._run_callback, image, offset) for offset in offsets
            ]
            for future in as_completed(futures):
                detections_list.append(future.result().with_nms(
                    threshold=self.iou_threshold
                ))

        return Detections.merge(detections_list=detections_list)

The reason for this change is that we don't need to implement NMS in the full image if we're processing slices of the images.

The second change is optional and adds a conditional in the with_nms function, where the user can decide whether they want to implement NMS with a mask or with a bounding box.

if self.mask is not None and nms_mask:
            indices = mask_non_max_suppression(
                predictions=predictions, masks=self.mask, iou_threshold=threshold
            )
        else:
            indices = box_non_max_suppression(
                predictions=predictions, iou_threshold=threshold
            )

With these changes, we have improved the results significantly. I integrated a memory_profiler function in the demo to check the improvements.

## RAM CONSUMTION
### BEFORE
1. mask 1024 x 1024, peak memory: 7415.32 MiB, increment: 6418.33 MiB
2. mask 512 x 512 - Crash.
3. mask 256 x 256 - Crash.
4. mask 128 x 128 - Crash.

### AFTER
1. mask 1024 x 1024, peak memory: 2775.74 MiB, increment: 1775.71 MiB
2. mask 512 x 512, peak memory: 4574.13 MiB, increment: 2378.86 MiB
3. mask 256 x 256, peak memory: 5558.58 MiB, increment: 4558.59 MiB
4. mask 128 x 128, peak memory: 9688.52 MiB, increment: 4861.30 MiB

What do you think Piotr ?

On the other hand, I would like to create a cookbook of this functionality because I believe it will be of great help to people who work with large images and need to detect small objects.

AdonaiVera and others added 6 commits February 11, 2024 21:59

Add support slice function, padding the mask to the full images and t…

952a20d

…hen concatenate all the predictions together

fix(pre_commit): 🎨 auto format pre-commit hooks

411bb28

Improvement in the RAM consumtion using vstack

1a8b416

fix(pre_commit): 🎨 auto format pre-commit hooks

e2f8e89

Improvements in reduce the RAM consumption in big images

d9b8ab3

fix(pre_commit): 🎨 auto format pre-commit hooks

ffe5551

AdonaiVera and others added 4 commits February 18, 2024 23:53

Reduce length of line

f7bd324

fix(pre_commit): 🎨 auto format pre-commit hooks

dbe3160

Added bool in case the user want to apply nms in box or mask

8999e37

fix(pre_commit): 🎨 auto format pre-commit hooks

664ba1c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support slice function, padding the mask to the full images and t… #881

Add support slice function, padding the mask to the full images and t… #881

AdonaiVera commented Feb 12, 2024

AdonaiVera commented Feb 19, 2024

Add support slice function, padding the mask to the full images and t… #881

Are you sure you want to change the base?

Add support slice function, padding the mask to the full images and t… #881

Conversation

AdonaiVera commented Feb 12, 2024

Description

Type of change

How has this change been tested

AdonaiVera commented Feb 19, 2024