New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Tensorflow Saved model not portable with latest tf.keras.optimizers #4028
Labels
Comments
supercharleszhu
changed the title
Saved model not portable with HorovodAllReduceOps
Tensorflow Saved model not portable with HorovodAllReduceOps
Mar 11, 2024
supercharleszhu
changed the title
Tensorflow Saved model not portable with HorovodAllReduceOps
Tensorflow Saved model not portable with HorovodAllReduce Ops
Mar 11, 2024
supercharleszhu
changed the title
Tensorflow Saved model not portable with HorovodAllReduce Ops
Tensorflow Saved model not portable with latest tf.keras.optimizers
Mar 15, 2024
4 tasks
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Environment:
Checklist:
Bug report:
Please describe erroneous behavior you're observing and steps to reproduce it.
We met an issue after running TF Training w/ horovod in both CPU and GPU execution. The tf saved model is not loadable outside Horovod environment because HorovodAllReduce seems to be saved unexpected.
Ways to reproduce: running the following script for a simple keras model in the test case and saving it
and run
python test.py
Then loading the model without horovd being imported
and run
python test_2.py
it will return
Note: Reverting to Horovod 0.26 or tf.keras.optimizer.legacy will resolve this issue. But we want to use latest horovod instead.
The text was updated successfully, but these errors were encountered: