You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Interestingly someone just dropped a suggestion to help us implement alpha zero #1844
If you want to collaborate or follow the progress, i'd suggest to join our discord challenge here, I just created an MCTS channel!
Motivation
It would be great to have an MCTS and Alphazero implementation, including other model-based RL for benchmarking and comparison.
Solution
I can write a loss function of this policy.
Alternatives
There are limited RL libraries that have a base implementation of Muzero.
Additional context
None.
Checklist
The text was updated successfully, but these errors were encountered: