-
Notifications
You must be signed in to change notification settings - Fork 331
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
About reward and action in pursuit #72
Comments
u can see the reward about the predator. i remembered the predator should get the positive reward when they attacked. Meanwhile, when they surrounded the preys they can attack the prey and get positive reward. i dont know if i get the true realization. i wish this can help u. |
Just one predator in the map, can this predator get a positive reward? or can the predator attack any preys if and only if there is one predator in the map? |
I generated a game with a 10 * 10 map - pursuit. There are one predator with my own a2c model and two preys with random actor. By training, predator's total reward per episode converges to zero, never higher than zero. Does it mean predator never chooses to attack any preys? How can predator get a positive reward?
The text was updated successfully, but these errors were encountered: