You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi @moribots,
I am recreating this project and building upon it for a project for my uni, yet there is an important incongruity I can't figure out.
When I run the code straight away I get an reward per step of 5.4 at the first training steps. However, in figure 6 of your paper(see attached figure) the reward is supposed to start at -2.0 and never surpass 0.5 . Could you tell me where this difference in reward is coming from?
Thanks in advance!
Leon
The text was updated successfully, but these errors were encountered:
Hi @moribots,
I am recreating this project and building upon it for a project for my uni, yet there is an important incongruity I can't figure out.
When I run the code straight away I get an reward per step of 5.4 at the first training steps. However, in figure 6 of your paper(see attached figure) the reward is supposed to start at -2.0 and never surpass 0.5 . Could you tell me where this difference in reward is coming from?
Thanks in advance!
Leon
The text was updated successfully, but these errors were encountered: