[Bug Report] Vector env return value #3253

Root970103 · 2023-12-21T10:30:40Z

Describe the bug
Hi, I used the vector env api in gym to train Atari-PongNoFrameSkip-v4. After the agent interacts with the environment for a period of time, I discovered a strange phenomenon. The cumulative reward was 21.0, but the corresponding done status was still False.

An intuitive example is described below:

rewards: [21.0, 19.0, 17.0, 21.0, 18.0, 21.0]
done: [True, False, False, True, False, False]

In this case, the env reached the max reward cannot be set done. And the cumulative reward would increased. Is this situation normal?

System Info
gym==0.18

The text was updated successfully, but these errors were encountered:

pseudo-rnd-thoughts · 2023-12-25T20:52:42Z

Without more of your code it is difficult to tell what is happening also this is for v0.18 which is several years old so we wouldn't be updated any code unless this is still an issue now

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug Report] Vector env return value #3253

[Bug Report] Vector env return value #3253

Root970103 commented Dec 21, 2023

pseudo-rnd-thoughts commented Dec 25, 2023

[Bug Report] Vector env return value #3253

[Bug Report] Vector env return value #3253

Comments

Root970103 commented Dec 21, 2023

pseudo-rnd-thoughts commented Dec 25, 2023