Add a mapping function in image_reader.py and image_writer.py #7769

staydelight · 2024-05-14T09:04:22Z

Add a function to create a JSON file that maps input and output paths.

Fixes #7557 .

Description

A few sentences describing the changes proposed in this pull request.

Types of changes

Non-breaking change (fix or new feature that would not break existing functionality).
Breaking change (fix or new feature that would cause existing functionality to change).
New tests added to cover the changes.
Integration tests passed locally by running ./runtests.sh -f -u --net --coverage.
Quick tests passed locally by running ./runtests.sh --quick --unittests --disttests.
In-line docstrings updated.
Documentation updated, tested make html command in the docs/ folder.

KumoLiu · 2024-05-15T03:51:14Z

monai/data/image_reader.py

@@ -148,6 +149,25 @@ def _stack_images(image_list: list, meta_dict: dict):
 return np.stack(image_list, axis=0)


+def update_json(input_file=None, output_file=None):
+ record_path = "img-label.json"


Hi @staydelight, thank you for the PR. I have a few concerns:

Do we really need to make any changes to read images? It seems that we can already support reading paired data using LoadImaged.

I suggest adding a flag to SaveImage to allow users to choose whether or not to write a JSON file.

Can you clarify the purpose of the record_path? We can directly obtain the label path from the input of SaveImage, and for the image path, we can retrieve it from the metadata of the data (since we have introduced MetaTensor).

Let me know if you need further clarification on any of these points.

Hi @staydelight thanks as well but I also have concerns similar to @KumoLiu. In the general case users are not going to want these JSON files being generated so I don't think we should add this to the readers and writers at all. This is very use-case specific so writing some custom transform or doing things in a different way would be the solution.

One other thing to mention is that it's not thread-safe since multiple parallel transforms may be reading/writing the file at the same time. In this case you also cannot rely on the ordering of LoadImage/SaveImage operations to ensure you match the right input with output.

As mentioned you can access original paths with the metadata present in the MetaTensor objects:

trans = monai.transforms.LoadImaged(keys="image") d = trans({"image": "/path/to/file.nii.gz"}) print(d["image"].meta["filename_or_obj"])

It should be possible to access this value in the postprocessing transform sequence were SaveImage is used since the network output should be a MetaTensor with these values included. You should be able to define a transform after SaveImage which logs these values to a file. This would be the much more modular approach versus adding specific code to the loader/saver classes, so I'd strongly suggest investigating how best to go about that.

Hi @ericspod @KumoLiu,Thank you for your advice. According to what you said, since the network output should be a MetaTensor, is it possible to add a save_log function to SaveImage like this:

self.log_data.append({ "input": meta_data.get("filename_or_obj", "(unknown)"), "output": filename }) def save_log(self): try: with open(self.log_json_path, 'r') as f: existing_log_data = json.load(f) except FileNotFoundError: existing_log_data = [] with open(self.log_json_path, 'w') as f: existing_log_data.extend(self.log_data) json.dump(existing_log_data, f, indent=4)

It seems like a workable solution, but it involves repeatedly reading and writing JSON files. I haven't thought of a better way yet. We could add a utility function, but it might not be very effective. @ericspod, do you have any better suggestions for saving a mapping?

BTW, the save path can also be record in the meta data:

MONAI/monai/transforms/io/array.py

Line 507 in 4029c42

if self.savepath_in_metadict and meta_data is not None:

This is a similar sort of change we discussed with the input-output mapping file generation in another PR. I think this would be better implemented in a transform which will appear after SaveImage(d) in your pipeline which will handle saving to file(s). The code here can be put into that and keep SaveImage(d) focused on saving the image data only. You would also have to take into account multiprocessing when doing this and write to separate files for each subprocess, this implementation as-is contains a race condition if multiple processes attempt to save to the same file.

Add a function to create a JSON file that maps input and output paths. Signed-off-by: staydelight <[email protected]>

Remove changes unrelated to this issue. Signed-off-by: staydelight <[email protected]>

for more information, see https://pre-commit.ci

Remove changes unrelated to this issue. Signed-off-by: staydelight <[email protected]>

for more information, see https://pre-commit.ci

Signed-off-by: staydelight <[email protected]>

Add code for generating a mapping json file. Signed-off-by: staydelight <[email protected]>

for more information, see https://pre-commit.ci

Change mapping_json_path init way. Signed-off-by: staydelight <[email protected]>

for more information, see https://pre-commit.ci

Fixing unsuccessful checks. Signed-off-by: staydelight <[email protected]>

Fixes unseccessful ckecks. (if mapping_json_path is not None) Signed-off-by: staydelight <[email protected]>

Signed-off-by: staydelight <[email protected]>

for more information, see https://pre-commit.ci

Signed-off-by: staydelight <[email protected]>

for more information, see https://pre-commit.ci

Signed-off-by: staydelight <[email protected]>

staydelight closed this May 14, 2024

staydelight reopened this May 14, 2024

staydelight force-pushed the fix-issue-7557 branch from cedd53e to 93ccbdb Compare May 14, 2024 09:36

KumoLiu reviewed May 15, 2024

View reviewed changes

staydelight and others added 21 commits June 13, 2024 23:00

Fixes Project-MONAI#7557

afae503

Add a function to create a JSON file that maps input and output paths. Signed-off-by: staydelight <[email protected]>

Fixes Project-MONAI#7557

542a77d

Remove changes unrelated to this issue. Signed-off-by: staydelight <[email protected]>

[pre-commit.ci] auto fixes from pre-commit.com hooks

e9f7565

for more information, see https://pre-commit.ci

Fixes Project-MONAI#7557

7969d21

Remove changes unrelated to this issue. Signed-off-by: staydelight <[email protected]>

Fixes Project-MONAI#7557

3ce5f30

Remove changes unrelated to this issue. Signed-off-by: staydelight <[email protected]>

[pre-commit.ci] auto fixes from pre-commit.com hooks

0699eeb

for more information, see https://pre-commit.ci

fix-issue-7557

274cd04

Signed-off-by: staydelight <[email protected]>

fix-issue-7557

d4fb0b7

Signed-off-by: staydelight <[email protected]>

Fixes Project-MONAI#7557

bfb6d58

Add code for generating a mapping json file. Signed-off-by: staydelight <[email protected]>

[pre-commit.ci] auto fixes from pre-commit.com hooks

5ab2521

for more information, see https://pre-commit.ci

Fixes Project-MONAI#7557

894854d

Change mapping_json_path init way. Signed-off-by: staydelight <[email protected]>

[pre-commit.ci] auto fixes from pre-commit.com hooks

c372225

for more information, see https://pre-commit.ci

Fixes Project-MONAI#7557

682379b

Fixing unsuccessful checks. Signed-off-by: staydelight <[email protected]>

Fixes Project-MONAI#7557

56d8df5

Fixes unseccessful ckecks. (if mapping_json_path is not None) Signed-off-by: staydelight <[email protected]>

Fixes Project-MONAI#7557

8bab11b

Signed-off-by: staydelight <[email protected]>

fix-issue-7557

ca48fec

Signed-off-by: staydelight <[email protected]>

fix-issue-7557

117dd78

Signed-off-by: staydelight <[email protected]>

[pre-commit.ci] auto fixes from pre-commit.com hooks

3908cdd

for more information, see https://pre-commit.ci

fix-issue-7557

36e5af0

Signed-off-by: staydelight <[email protected]>

[pre-commit.ci] auto fixes from pre-commit.com hooks

1a3da38

for more information, see https://pre-commit.ci

fix-issue-7557

37d19ed

Signed-off-by: staydelight <[email protected]>

staydelight force-pushed the fix-issue-7557 branch from f4520d1 to 37d19ed Compare June 13, 2024 15:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a mapping function in image_reader.py and image_writer.py #7769

Add a mapping function in image_reader.py and image_writer.py #7769

staydelight commented May 14, 2024 •

edited

KumoLiu May 15, 2024

ericspod May 15, 2024

staydelight May 31, 2024

KumoLiu May 31, 2024

ericspod Jun 3, 2024

Add a mapping function in image_reader.py and image_writer.py #7769

Are you sure you want to change the base?

Add a mapping function in image_reader.py and image_writer.py #7769

Conversation

staydelight commented May 14, 2024 • edited

Description

Types of changes

KumoLiu May 15, 2024

Choose a reason for hiding this comment

ericspod May 15, 2024

Choose a reason for hiding this comment

staydelight May 31, 2024

Choose a reason for hiding this comment

KumoLiu May 31, 2024

Choose a reason for hiding this comment

ericspod Jun 3, 2024

Choose a reason for hiding this comment

staydelight commented May 14, 2024 •

edited