Skip to content

For a model of Markov Decision Process, Policy creation via two methods : Value Iteration and Linear Programming

Notifications You must be signed in to change notification settings

chaitanya100100/MDP

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

MDP

For a model of Markov Decision Process, Policy creation via two methods : Value Iteration and Linear Programming

Model Description

Model world has 4*4 block grid, one positive terminal state, one negative terminal state. Here is total description of the world.

Value Iteration

  • value_iteration.py contains the code which runs value iteration algorithm to find the utilities of all states and then final policy. It prints the result of every iteration.
  • value_iteration2.py has the same code but with different world model than given problem statement.

Linear Programming

  • It is another approach to find policy. LP.ods has the linear solver's output.
  • Final output of the solver gives expected utility of start state and policy for the world.

About

For a model of Markov Decision Process, Policy creation via two methods : Value Iteration and Linear Programming

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages