Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature Request] Muzero and MCTS implementations #1845

Open
1 task done
Prakyathkantharaju opened this issue Jan 29, 2024 · 1 comment
Open
1 task done

[Feature Request] Muzero and MCTS implementations #1845

Prakyathkantharaju opened this issue Jan 29, 2024 · 1 comment
Assignees
Labels
enhancement New feature or request

Comments

@Prakyathkantharaju
Copy link

Motivation

It would be great to have an MCTS and Alphazero implementation, including other model-based RL for benchmarking and comparison.

Solution

I can write a loss function of this policy.

Alternatives

There are limited RL libraries that have a base implementation of Muzero.

Additional context

None.

Checklist

  • I have checked that there is no similar issue in the repo (required)
@Prakyathkantharaju Prakyathkantharaju added the enhancement New feature or request label Jan 29, 2024
@vmoens
Copy link
Contributor

vmoens commented Jan 29, 2024

Interestingly someone just dropped a suggestion to help us implement alpha zero
#1844
If you want to collaborate or follow the progress, i'd suggest to join our discord challenge here, I just created an MCTS channel!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants