GitHub - razor08/RL-Project: Implementation of REINFORCE with Baseline and Monte-Carlo Tree Search algorithms along with Multi-Armed Bandits.

We present implementations of REINFORCE with Baseline and Monte-Carlo Tree Search algorithms on three MDPs: Cartpole, CS687-Gridworld and Mountain Car. For extra-credits, we have implemented a yet unexplored MDP: Mountain Car and we present different algorithms: Epsilon Greedy, Epsilon Decreasing Greedy, Upper Confidence Bound (UCB) and Thompson sampling performance analysis on multi-armed bandits.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
mcts_figs		mcts_figs
multi_armed_bandits		multi_armed_bandits
reinforce_with_baseline		reinforce_with_baseline
.gitignore		.gitignore
CS687-Gridworld.png		CS687-Gridworld.png
CS687FinalReport.pdf		CS687FinalReport.pdf
README.md		README.md
cartpole.py		cartpole.py
cs687_gridworld.py		cs687_gridworld.py
mab_algorithms.py		mab_algorithms.py
mcts.py		mcts.py
mcts_main.py		mcts_main.py
mountain_car.py		mountain_car.py
multi_armed_bandits.py		multi_armed_bandits.py
reinforce_baseline.py		reinforce_baseline.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

razor08/RL-Project

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages