GitHub - cpuheater/cause-life-is-a-game: Solving games with reinforcement learning

Solving video games with Deep Reinforcement Learning!

Each file is a self-contained rl algorithm, tuned to solve that particular game. An algorithm might have one or many extensions. You can determine the name of the algorithm and the type of the extension from the name of the file.
The core algorithms are based on cleanrl and comes with many cleanrl goodies like: tensorboard logging, videos of gameplay capturing, experiment management with weights and biases.

List of algorithms:

ppo - Proximal Policy Optimization (https://arxiv.org/abs/1707.06347)
ppo_lstm - PPO with recurrent policy using LSTM
ppo_gru - PPO with recurrent policy using GRU
sac_dis - Soft Actor-Critic for discrete action settings (https://arxiv.org/abs/1910.07207)
a2c - Advantage Actor Critic
dqn - Deep Q-Network (https://arxiv.org/abs/1509.06461)
sac - Soft Actor-Critic (https://arxiv.org/abs/1801.01290)
ddqn - Double DQN (https://arxiv.org/abs/1509.06461)
dueling_dqn - Dueling DQN (https://arxiv.org/abs/1511.06581)

List of extensions:

ppo_separate - separate network for the actor and the critic
frame_stacking - stacking four consecutive frames
vt - vision transformer as an encoder
n_step - using n step returns, Asynchronous Methods for Deep Reinforcement Learning (https://arxiv.org/pdf/1602.01783)
relational - relational deep reinforcement learning (https://arxiv.org/abs/1806.01830)
sil - self imitation learning (https://arxiv.org/abs/1806.05635)
icm - curiosity driven exploration (https://arxiv.org/pdf/1705.05363.pdf)
branching - action branching (https://arxiv.org/pdf/1711.08946.pdf)

Name		Name	Last commit message	Last commit date
Latest commit History 281 Commits
breakout		breakout
cartpole		cartpole
halfcheetah_v2		halfcheetah_v2
hopper-v2		hopper-v2
hopperbulletenv		hopperbulletenv
microrts_16x16_coacAI		microrts_16x16_coacAI
microrts_4x4_random		microrts_4x4_random
microrts_8x8_random		microrts_8x8_random
microrts_8x8_workerRushAI		microrts_8x8_workerRushAI
microrts_8x8_workerRushAI_po		microrts_8x8_workerRushAI_po
minigrid_doorkey_16x16		minigrid_doorkey_16x16
minigrid_doorkey_5x5		minigrid_doorkey_5x5
minigrid_memory		minigrid_memory
minigrid_redbluedoors		minigrid_redbluedoors
miniworld_four_rooms		miniworld_four_rooms
miniworld_hallway		miniworld_hallway
mujoco_hopper_v2		mujoco_hopper_v2
pacman		pacman
pendulum-v0		pendulum-v0
pong		pong
procgen		procgen
reacher-v2		reacher-v2
simplememorytask		simplememorytask
space_invaders		space_invaders
super_mario_bros		super_mario_bros
vizdoom_basic		vizdoom_basic
vizdoom_deadly_corridor		vizdoom_deadly_corridor
vizdoom_deathmatch		vizdoom_deathmatch
vizdoom_defend_the_center		vizdoom_defend_the_center
vizdoom_defend_the_center_hard		vizdoom_defend_the_center_hard
vizdoom_health_gathering		vizdoom_health_gathering
vizdoom_my_way_home		vizdoom_my_way_home
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Solving video games with Deep Reinforcement Learning!

List of algorithms:

List of extensions:

About

Releases

Packages

Languages

cpuheater/cause-life-is-a-game

Folders and files

Latest commit

History

Repository files navigation

Solving video games with Deep Reinforcement Learning!

List of algorithms:

List of extensions:

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages