-
Notifications
You must be signed in to change notification settings - Fork 251
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
max_iteration parameters is not respected in the ValueIteration Algorithm #13
Comments
Hi, this is actually a feature. There is a method on the ValueIteration class called _boundIter that computes the upper bound of how many iterations the algorithm will need to complete given a certain tolerance. This tolerance is called However using the small example might help here:
You will notice that the smaller I would suggest that Hope this helps. |
Thanks for the explanation. It is very useful. The reason I wanted to specify the max_iterations is that I can observe the convergence of each algorithm and not all of them have epsilon. What worked for me is setting the max_iter after the object has been initialized, like
In this case it wont go beyond those 10 iterations. Great job, I liked playing with the forest example |
Ah yes, that is a good solution. Another option that might work is to modify the |
Using the forest example from the quickstart guide and modifying the max_iter does not seem to work.
It looks like it always defaults to 39 and stops.
I am experimenting with number of iterations to see how the algorithm is performing and it would be very beneficial to collect values at each iteration
Update:
Setting MDP to verbose shows that only 4 iterations were executed. So what does 39 means here?
The text was updated successfully, but these errors were encountered: