Skip to content

Why changing logger arguments affects checkpoint directory? #259

Answered by ashleve
Yevgnen asked this question in Q&A
Discussion options

You must be logged in to vote

As you can see from many issues, multirun + DDP currently doesn't work correctly. This is due to the way hydra creates a new work dir for every run, which is not compatible with DDP which runs new process for each GPU.

A while ago I made an issue about in lightning repo, read it if you want to learn more: Lightning-AI/pytorch-lightning#11300

I don't see anything we can do about it for now. It's being worked on, so the future release might fix it.

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by Yevgnen
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants