off-policy

Star

Here are 39 public repositories matching this topic...

MishaLaskin / curl

Star

CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning

Updated Oct 28, 2020
Python

TianhongDai / hindsight-experience-replay

Star

This is the pytorch implementation of Hindsight Experience Replay (HER) - Experiment on all fetch robotic environments.

reinforcement-learning exploration ddpg her pytorch-implmention off-policy hindsight-experience-replay

Updated Dec 11, 2021
Python

MishaLaskin / rad

Star

RAD: Reinforcement Learning with Augmented Data

Updated Mar 29, 2021
Jupyter Notebook

denisyarats / drq

Star

DrQ: Data regularized Q

python control reinforcement-learning deep-learning pixel deep-reinforcement-learning pytorch gym rl data-augmentation sac actor-critic mujoco model-free off-policy dm-control drq soft-actor-crit

Updated Jan 13, 2023
Jupyter Notebook

pokaxpoka / sunrise

Star

SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning

deep-neural-networks reinforcement-learning deep-learning deep-reinforcement-learning rainbow rl codebase deep-q-network sac deep-q-learning mujoco model-free off-policy dm-control soft-actor-critic

Updated Mar 21, 2021
Python

zhihanyang2022 / off-policy-continuous-control

Star

Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)

reinforcement-learning pytorch continuous-control actor-critic rdpg recurrent-neural-network off-policy rtd3 rsac

Updated Nov 21, 2023
Python

denisyarats / exorl

Star

ExORL: Exploratory Data for Offline Reinforcement Learning

python control reinforcement-learning deep-learning pytorch datasets mujoco model-free off-policy offline-rl unsupevised exporation

Updated Feb 8, 2022
Python

instadeepai / flashbax

Star

⚡ Flashbax: Accelerated Replay Buffers in JAX

machine-learning reinforcement-learning hpc buffers rl jax off-policy

Updated May 29, 2024
Python

Rosefintech / Rosefintech-RosefinAIEngine

Star

RosefinAIEngine of Rosfintech

ai tensorflow engine off-policy

Updated Aug 17, 2021
Python

ccnets-team / causal-rl

Star

Causal RL: Reverse-Environment Network Integrated Actor-Critic Algorithm

reinforcement-learning pytorch gpt causal actor-critic-algorithm off-policy invertible-policy reverse-environment-network cooperative-network causal-mask

Updated May 22, 2024
Python

baturaysaglam / LA3P

Star

Actor Prioritized Experience Replay

deep-reinforcement-learning actor-critic prioritized-experience-replay off-policy

Updated Nov 20, 2023
Python

lionelblonde / sam-pytorch

Star

PyTorch implementation of "Sample-efficient Imitation Learning via Generative Adversarial Nets"

reinforcement-learning pytorch gan imitation-learning gail off-policy

Updated Nov 22, 2019
Python

lionelblonde / sam-tf

Star

TensorFlow implementation of "Sample-efficient Imitation Learning via Generative Adversarial Nets"

reinforcement-learning tensorflow gan imitation-learning gail off-policy

Updated Dec 8, 2022
Python

lionelblonde / liayn-pytorch

Star

PyTorch implementation of our work: "Lipschitzness Is All You Need To Tame Off-policy Generative Adversarial Imitation Learning"

reinforcement-learning pytorch gan imitation-learning gail off-policy

Updated Apr 19, 2022
Python

MohammadAsadolahi / Reinforcement-Learning-solving-a-simple-4by4-Gridworld-using-Qlearning-in-python

Star

solving a simple 4*4 Gridworld almost similar to openAI gym FrozenLake using Qlearning Temporal difference method Reinforcement Learning

reinforcement-learning qlearning off-policy qlearning-on-gridworld

Updated Jul 16, 2023
Jupyter Notebook

TheUnsolvedDev / ReinforcementLearning

Star

Repository containing basic algorithm applied in python.

algorithm reinforcement-learning monte-carlo policy-evaluation policy-iteration bandit-algorithms on-policy off-policy

Updated Dec 3, 2023
Jupyter Notebook

mabirck / CS294-DeepRL

Star

My content of CS294 Deep Reinforcement Learning course, conduced by Sergey Levine from UC Berkeley.

deep-neural-networks reinforcement-learning deep-learning deep-reinforcement-learning pytorch neural-networks policy-gradient reinforcement pytorch-tutorials cs294 on-policy off-policy

Updated Jan 15, 2018
Python

lionelblonde / sam-pytorch-complete-history

Star

PyTorch implementation of "Sample-efficient Imitation Learning via Generative Adversarial Nets"

reinforcement-learning pytorch gan imitation-learning gail off-policy

Updated Aug 9, 2021
Python

SaminYeasar / off_policy_ac

Star

Contains PyTorch Implementation of the following off policy actor critic algorithms

reinforcement-learning pytorch ddpg sac actor-critic mujoco off-policy td3

Updated Aug 5, 2021
Python

baturaysaglam / DASE

Star

Safe and Robust Experience Sharing for Deterministic Policy Gradient Algorithms

deep-reinforcement-learning actor-critic off-policy experience-replay multi-agent-reinforcement-learning

Updated Aug 11, 2022
Python

Improve this page

Add a description, image, and links to the off-policy topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the off-policy topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

off-policy

Here are 39 public repositories matching this topic...

MishaLaskin / curl

TianhongDai / hindsight-experience-replay

MishaLaskin / rad

denisyarats / drq

pokaxpoka / sunrise

zhihanyang2022 / off-policy-continuous-control

denisyarats / exorl

instadeepai / flashbax

Rosefintech / Rosefintech-RosefinAIEngine

ccnets-team / causal-rl

baturaysaglam / LA3P

lionelblonde / sam-pytorch

lionelblonde / sam-tf

lionelblonde / liayn-pytorch

MohammadAsadolahi / Reinforcement-Learning-solving-a-simple-4by4-Gridworld-using-Qlearning-in-python

TheUnsolvedDev / ReinforcementLearning

mabirck / CS294-DeepRL

lionelblonde / sam-pytorch-complete-history

SaminYeasar / off_policy_ac

baturaysaglam / DASE

Improve this page

Add this topic to your repo