This project is implementation of multiple AI agents based on different Reinforcement Learning methods to OpenAI Gymnasium Lunar-Lander environment which is classic rocket landing trajectory optimization problem.
-
Updated
Dec 31, 2022 - Python
This project is implementation of multiple AI agents based on different Reinforcement Learning methods to OpenAI Gymnasium Lunar-Lander environment which is classic rocket landing trajectory optimization problem.
Topics in Machine Learning @ IIIT Hyderabad (Fall 2021)
Pytorch implementation of classic and latest Model-Free RL algorithms.
Example VPG implementation with ReLAx
This notebook trains an agent to navigate a maze and reach a desired destination. It uses the Gym-MiniGrid's fourRoom-v0 environment as the maze. The agent is trained by using reiforcement learning's vanilla policy gradient (REINFORCE) algorithm.
The pytorch implementation of vpg
Vanilla Policy Gradient (REINFORCE) implementation with PyTorch
Add a description, image, and links to the vanilla-policy-gradient topic page so that developers can more easily learn about it.
To associate your repository with the vanilla-policy-gradient topic, visit your repo's landing page and select "manage topics."