Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feat]: Utilize multiple GPUs and increase thread count during image captioning #293

Open
jferments opened this issue May 8, 2024 · 4 comments
Labels
enhancement New feature or request

Comments

@jferments
Copy link

Describe your use-case.

I recently was using OneTrainer UI to auto-caption ~750k images using WD14 tagger. I have a dual GPU machine, but it was only utilizing one of them (and was running at very low VRAM/power usage). Additionally, it was barely utilizing CPU at all.

I was wondering if there is any way to make it more fully utilize CPU/GPU resources to speed up batch processing of images. I have a 3.7 million image dataset I need to do next, and it could cut many hours off my processing time to be able to use multiple GPUs.

What would you like to see as a solution?

An option to select multiple GPUs to use, as well as increasing thread count for captioning.

Have you considered alternatives? List them here.

No response

@jferments jferments added the enhancement New feature or request label May 8, 2024
@O-J1
Copy link

O-J1 commented May 8, 2024

No support. Most developers for OT do no think its worth it and ruled it out multiple times. PR's are welcome.

@jferments
Copy link
Author

Do you have links to tickets where they've rejected this idea? I didn't see any for it when I searched

@yggdrasil75
Copy link

#69 though I fully agree, using my 3080 and 3090 together would definitely be beneficial. especially when I want to train at 2048x2048 with better parameters.

@Disty0
Copy link

Disty0 commented May 9, 2024

You can run separate taggers at the same time. You will start to get CPU bottlenecked to 60~ images per second per tagger.

Run one part of the dataset on GPU0 and other on GPU1.
And do not run anything on the CPU since your CPU will get hit pretty hard with all of those image operations.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

4 participants