You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Is it expected that elasticity that is provided via horovod would be such that if one of the worker is killed, and then it is restarted again(say restartPolicy of MPIJob being OnFailure then that would be considered by the launcher?
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
-
Is it expected that elasticity that is provided via horovod would be such that if one of the worker is killed, and then it is restarted again(say restartPolicy of MPIJob being
OnFailure
then that would be considered by the launcher?Beta Was this translation helpful? Give feedback.
All reactions