policy-gradient

Star

Here are 410 public repositories matching this topic...

yashbhutwala / pong-ai

Sponsor

Star

Deep Q-Learning Networks vs. Policy Gradient Learning in OpenAI Gym's Pong Environment

python tensorflow numpy pong openai-gym policy-gradient deep-q-learning

Updated May 2, 2017
Python

ethanmclark1 / rl_toolkit

Star

A collection of RL algorithms written in PyTorch

reinforcement-learning deep-reinforcement-learning openai-gym pytorch policy-gradient reproducibility q-table dueling-dqn a2c rl-algorithms

Updated Aug 25, 2023
Python

nslyubaykin / relax_trpo_example

Star

Example TRPO implementation with ReLAx

reinforcement-learning gae policy-gradient reinforcement-learning-algorithms continuous-control trpo generalized-advantage-estimation discrete-control

Updated Aug 29, 2022
Jupyter Notebook

nslyubaykin / relax_ppo_example

Star

Example PPO implementation with ReLAx

reinforcement-learning gae policy-gradient reinforcement-learning-algorithms continuous-control proximal-policy-optimization ppo generalized-advantage-estimation discrete-control

Updated Aug 29, 2022
Jupyter Notebook

TUJJIEVE / Reinforcement-Learning-Pacman

Star

Deep Q network and Policy gradient reinforcement learning alogrithms to play pacman

dqn policy-gradient reinforcement-learning-algorithms qlearning-algorithm sarsa-learning

Updated Jan 5, 2023
Jupyter Notebook

ShaharShc / DeepReinforcementLearningCourse

Star

Ben Gurion University "Deep Reinforcement Learning (372.2.5910)" course assignments & solutions

deep-learning deep-reinforcement-learning policy-gradient transfer-learning deep-q-learning meta-learning

Updated Jan 30, 2024
Python

Twice22 / Reinforcement-Learning

Star

My reports for the reinforcement learning class given at the ENS

reinforcement-learning policy-gradient reinforce policy-iteration value-iteration ucb1

Updated Jan 16, 2018
Jupyter Notebook

KokoMind / A3C-TF

Star

Implementing Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning". using TensorFlow

reinforcement-learning policy-gradient a3c

Updated Aug 7, 2017
Python

mynameisvinn / pypong

Star

policy gradient for pong

machine-learning reinforcement-learning pong policy-gradient gym-environment

Updated Feb 12, 2018
Python

SIakovlev / Continuous-Control

Star

deep-reinforcement-learning policy-gradient ddpg continuous-control unity-environment

Updated Dec 25, 2018
Jupyter Notebook

vcentfu / ReinfLearnPendulum

Star

[Reinforcement Learning, forked from Stable-baselines3] Étude des performances des algorithmes de Reinforcement Learning sur Pendulum

policy-gradient reinforcement-learning-algorithms soft-actor-critic

Updated Jan 31, 2022
Python

SwamiKannan / Reinforcement-Learning-Specialization

Star

Programming Assignments for Reinforcement Learning Specialization

reinforcement-learning q-learning coursera dqn policy-gradient coursera-machine-learning td-learning specialization monte-carlo-sampling capstone-project actor-critic-methods sarsa-learning lunar-lander actor-critic-algorithm university-of-alberta amii alberta-machine-learning-institute

Updated Oct 25, 2022
Jupyter Notebook

nslyubaykin / trpo_schedule_kl

Star

Scheduling TRPO's KL Divergence Constraint

reinforcement-learning scheduling policy-gradient reinforcement-learning-algorithms continuous-control trpo kl-divergence trust-region-policy-optimization

Updated Aug 29, 2022
Jupyter Notebook

farkoo / PG-PPO-OthelloSolver

Star

This repository provides an implementation of Othello game playing agents trained using reinforcement learning techniques.

python reinforcement-learning tensorflow othello policy-gradient proximal-policy-optimization

Updated Jul 7, 2023
Python

rbrigden / multi-goal-policy-gradients

Star

Quickly learn policies for continuous control in sparse reward environments

reinforcement-learning deep-reinforcement-learning policy-gradient

Updated Feb 17, 2021
Python

mohith-sakthivel / learn_rl_notebook

Star

Code for some fun exercises in the textbook 'Reinforcement Learning - An Introduction'

reinforcement-learning policy-gradient cartpole mountain-car policy-evaluation workbook policy-iteration rl-algorithms rich-sutton-textbook-examples

Updated Jun 7, 2020
Jupyter Notebook

ganjalipour / Reinforcement-learning

Star

Deep Q-Network, Actor-critic , Policy gradient implementation in python

reinforcement-learning policy-gradient actor-critic-algorithm

Updated May 17, 2020
Python

nslyubaykin / relax_a2c_example

Star

Example A2C implementation with ReLAx

reinforcement-learning policy-gradient reinforcement-learning-algorithms continuous-control advantage-actor-critic discrete-control

Updated Aug 29, 2022
Jupyter Notebook

vkramanuj / atari-rl

Star

Reinforcement learning algs for Open AI gym games.

reinforcement-learning tensorflow policy-gradient

Updated Mar 12, 2017
Python

BhanuPrakashPebbeti / Reinforce-Algorithm

Star

python reinforcement-learning deep-learning pytorch artificial-intelligence policy-gradient reinforce-algorithm

Updated Mar 12, 2022
Python

Improve this page

Add a description, image, and links to the policy-gradient topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the policy-gradient topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

policy-gradient

Here are 410 public repositories matching this topic...

yashbhutwala / pong-ai

ethanmclark1 / rl_toolkit

nslyubaykin / relax_trpo_example

nslyubaykin / relax_ppo_example

TUJJIEVE / Reinforcement-Learning-Pacman

ShaharShc / DeepReinforcementLearningCourse

Twice22 / Reinforcement-Learning

KokoMind / A3C-TF

mynameisvinn / pypong

SIakovlev / Continuous-Control

vcentfu / ReinfLearnPendulum

SwamiKannan / Reinforcement-Learning-Specialization

nslyubaykin / trpo_schedule_kl

farkoo / PG-PPO-OthelloSolver

rbrigden / multi-goal-policy-gradients

mohith-sakthivel / learn_rl_notebook

ganjalipour / Reinforcement-learning

nslyubaykin / relax_a2c_example

vkramanuj / atari-rl

BhanuPrakashPebbeti / Reinforce-Algorithm

Improve this page

Add this topic to your repo