DQN-tensorflow

Tensorflow implementation of Deep Q-Network (DQN) and Behavior Cloning (BC) to learn how to defeat humans in a FlappyBird game.

Tensorflow implementation of DQN similar to the paper Human-level control through deep reinforcement learning [Mnih et al., 2015].
Some visualization tools for analysing experimental results.
Possibility to learn from expert dataset (Behavior Cloning).

Experiments

The environnement used for the experiments is the flappyBird_cnn Gym environnement from this repository [blavad].

Which parts of the inputs were decisive when the AI won against human at Flappy Bird Game ?

We can visually explain actions taken by the agent via Gradient-based Localization [Servaraju et al., 2016].

More details: Internship report (in French)

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
imgs		imgs
per		per
README.md		README.md
data.py		data.py
data_visualisation.py		data_visualisation.py
dqn_bc.ipynb		dqn_bc.ipynb
main.py		main.py
resultats-test.ipynb		resultats-test.ipynb
setup.py		setup.py
tf_tools.py		tf_tools.py