Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

High Initial RAM Usage Leads to Crashes #338

Open
Sypherd opened this issue Aug 9, 2023 · 2 comments
Open

High Initial RAM Usage Leads to Crashes #338

Sypherd opened this issue Aug 9, 2023 · 2 comments

Comments

@Sypherd
Copy link

Sypherd commented Aug 9, 2023

I've been downloading select URLs from LAION-400M, -5B, and SBU and have noticed that there is a significant spike in RAM usage on startup that causes instances with <=32GB RAM, such as AWS' c6i.4xlarge, to crash. While img2dataset is running, however, RAM usage remains very low. I'd love if we could somehow mitigate that initial spike to be able to use instances with lower RAM throughout. Here's a screenshot from wandb.ai showing the initial spike on a 64GB instance:
image

@Sypherd
Copy link
Author

Sypherd commented Aug 9, 2023

Here's another sample from a crashed c6i.4xlarge instance where we can see available process memory approach 0 before crashing:
image
Maybe the cause of the crashes is something else but I have not been able to run img2dataset on a c6i.4xlarge instance yet.

@rom1504
Copy link
Owner

rom1504 commented Aug 9, 2023 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants