The problem of reward performance #115

Louise599 · 2024-05-08T03:12:49Z

Hello, thank you very much for open sourcing such a great project. I am running the code:
python experiments.py evaluate configs/IntersectionEnv/env.json using the command
configs/IntersectionEnv/agents/DQNAgent/baseline.json
--train --episodes=4000 --name-from-config,
the reward graph I get is unstable. I hope to get your help, thanks a lot!

kongxincaizi · 2024-06-05T02:29:13Z

hi，I also encountered the same problem.
In tensorboard, all of my curves did not converge.
Have you solved this problem now？

kongxincaizi · 2024-06-05T04:06:22Z

Perhaps I have found a solution. Adjust the smoothness index in tensorboard

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The problem of reward performance #115

The problem of reward performance #115

Louise599 commented May 8, 2024

kongxincaizi commented Jun 5, 2024

kongxincaizi commented Jun 5, 2024

The problem of reward performance #115

The problem of reward performance #115

Comments

Louise599 commented May 8, 2024

kongxincaizi commented Jun 5, 2024

kongxincaizi commented Jun 5, 2024