pong-ai

A Pong game played by two AIs.

AI1 is a Q-learning agent and AI2 is the near-perfect opponent. Compared to previously related work which train Pong RL agents by combining Q-learning with deep learning in an algorithm known as Deep Q-Networks, this implementation takes advantage of known environment constraints of the custom-made Pong environment to train the agent using one-step Q-learning alone.

This work highlights that it is possible to use one-step Q-learning, a model-free, off-policy reinforcement learning algorithm typically relegated to solving simple maze world environments, in combination with a POMDP and a novel technique called state distillation to train a Q-agent to play Pong and converge to the optimal policy.

For more info, please refer to the project's thesis paper.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
img		img
qnetworks		qnetworks
qtables		qtables
src		src
visited_graphs		visited_graphs
Akash-Kumar-Masters-Thesis-Playing-Pong-Using-Q-Learning.pdf		Akash-Kumar-Masters-Thesis-Playing-Pong-Using-Q-Learning.pdf
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

pong-ai

About

Releases

Packages

Languages

License

KumarUniverse/pong-ai

Folders and files

Latest commit

History

Repository files navigation

pong-ai

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages