-
-
Notifications
You must be signed in to change notification settings - Fork 58
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
TD time step parameter #87
Comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
currently multi-step TD has an incorrect parameter (JuliaReinforcementLearning/ReinforcementLearning.jl#648).
ReinforcementLearningAnIntroduction.jl/notebooks/Chapter09_Random_Walk.jl
Lines 193 to 216 in e83f540
as an example, the
n
is used as the number of time steps. however it currently corresponds to the number of time steps plus one.run_once(1, α)
thus is notTD(0)
which has a time step parameter of 1, but rather a 2-step TD method. depending on how upstream is resolved an update might be needed here.The text was updated successfully, but these errors were encountered: