torch-ppo

Multiple implementations of PPO using CartPole-v1 and a simple actor-critic model Instead of GAE, this implementation works with Monte-Carlo estimation.

Inspired heavily by PPO-Pytorch

The simplest implementation is simple_ppo/ppo.py, and the code is well documented over there.

The project also manages multi-agent environments. The used environment is a custom "CartPole-v1" wrapper that contains multiple instances of the environment. This solution is extremely simple and can be used only to understand the logic that can be applied to work with multi-agent environments.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
img		img
multiagent_ppo		multiagent_ppo
simple_ppo		simple_ppo
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
tmp.py		tmp.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

torch-ppo

About

Releases

Packages

Languages

License

sa1g/torch-ppo

Folders and files

Latest commit

History

Repository files navigation

torch-ppo

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages