Deep Q-Learning Networks vs. Policy Gradient Learning in OpenAI Gym's Pong Environment
-
Updated
May 2, 2017 - Python
Deep Q-Learning Networks vs. Policy Gradient Learning in OpenAI Gym's Pong Environment
A collection of RL algorithms written in PyTorch
Example TRPO implementation with ReLAx
Example PPO implementation with ReLAx
Deep Q network and Policy gradient reinforcement learning alogrithms to play pacman
Ben Gurion University "Deep Reinforcement Learning (372.2.5910)" course assignments & solutions
My reports for the reinforcement learning class given at the ENS
Implementing Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning". using TensorFlow
policy gradient for pong
[Reinforcement Learning, forked from Stable-baselines3] Étude des performances des algorithmes de Reinforcement Learning sur Pendulum
Programming Assignments for Reinforcement Learning Specialization
Scheduling TRPO's KL Divergence Constraint
This repository provides an implementation of Othello game playing agents trained using reinforcement learning techniques.
Quickly learn policies for continuous control in sparse reward environments
Code for some fun exercises in the textbook 'Reinforcement Learning - An Introduction'
Deep Q-Network, Actor-critic , Policy gradient implementation in python
Example A2C implementation with ReLAx
Reinforcement learning algs for Open AI gym games.
Add a description, image, and links to the policy-gradient topic page so that developers can more easily learn about it.
To associate your repository with the policy-gradient topic, visit your repo's landing page and select "manage topics."