dmu-project

Repository for my Spring 2023 Decision Making Under Uncertainty final project. I investigated the performance of a reinforcement learning agent trained using a policy gradient to play Atari Pong. You can read the final report here.

Agent Demo

Here is a demo of one of the trained agents (a53b7b) playing and winning against the computer.

pong-agent.mp4

Source Files

File	Description
model_helpers.py	Helper functions for saving and loading PyTorch models.
play.py	Fun script for playing games in the Atari gym.
pong_test.py	Test script that tests the trained Pong agent.
pong_train.py	Train script that trains Pong agent via policy gradient.
visualization_and_stats.ipynb	Notebook for help visualizing models, and computing various statistics.

Models

Models were trained with PyTorch using Reinforcement Learning and a policy gradient. They are stored in the models directory. Complete metrics were collected with Aim.

Agent	ID	Train Time	Win Rate	Total Fames	Reward-to-Go	Baseline Subtraction	Max Steps / Episode	Total Layers	Hidden Dim	Episodes	Gamma	Learning Rate
4	f0d3a1baf6a04e9380841bc5	109hrs	~62%	506,007,959	✅	✅	None	3	300	20000	0.99	0.0001
3	2b9c7df2eca04bb49e31404f	35hrs	~95%	367,556,707	✅	✅	5000	3	200	20000	0.99	0.0001
2	a53b7b3457f14f4e99172150	38hrs	~95%	335,928,545	✅	❌	5000	3	200	20000	0.99	0.0001
1	b60ba6f06be54de99c2f890f	12hrs 29min	~28%	79,861,221	✅	❌	1000	3	200	20000	0.99	0.0001

Win rate calculated over the result of 100 games played. See pong_test.py for how this is computed.
Agent 4 not included in final report since it took too long to train.

Create and Activate Environment

python3 -m venv env
source env/bin/activate

Install Dependencies

python -m pip install -r requirements.txt
pip install gym\[accept-rom-license\]

Run Model

python pong_test.py

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
models		models
videos		videos
.gitignore		.gitignore
README.md		README.md
final-report.pdf		final-report.pdf
model_helpers.py		model_helpers.py
play.py		play.py
pong_test.py		pong_test.py
pong_train.py		pong_train.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
visualization_and_stats.ipynb		visualization_and_stats.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

models

models

videos

videos

.gitignore

.gitignore

README.md

README.md

final-report.pdf

final-report.pdf

model_helpers.py

model_helpers.py

play.py

play.py

pong_test.py

pong_test.py

pong_train.py

pong_train.py

pyproject.toml

pyproject.toml

requirements.txt

requirements.txt

visualization_and_stats.ipynb

visualization_and_stats.ipynb

Repository files navigation

dmu-project

Agent Demo

Source Files

Models

Create and Activate Environment

Install Dependencies

Run Model

About

Languages

camargo/dmu-project

Folders and files

Latest commit

History

Repository files navigation

dmu-project

Agent Demo

Source Files

Models

Create and Activate Environment

Install Dependencies

Run Model

About

Topics

Resources

Stars

Watchers

Forks

Languages