English Version

This project was developed for the subject 'Laboratório de IACD' by the students André Sousa, António Cardoso, Antónia Brito and Paulo Silva.

In this project, our work which takes focus on recreating an adversarial model inspirated on DeepMind's AlphaZero capable of learning to play the games of Go and Ataxx.

To achieve this, we created a machine learning model based on the AlphaZero aproach, making use of a Convolutional Neural Network and a Monte Carlo Tree Search algorythm.

On top of that we implemented some features of our own, such as:

Multiple games can be played in parallel
A model that can play on different sized boards
Data Augmentation by applying transformations to board states
Introduction of noise to mitigate overfitting

The final work is distribuited on theese files:

aMCTS_parallel.py: which implements the MCTS
az_parallel2.py: containing the AlphaZero class
ataxx.py and go.py: that implement the rules of the games
graphics.py: to display the pygame interface
play_AI.py: to play against a trained model
ataxx4x4.py, ataxx5x5.py, ataxx6x6.py, ataxxflex.py, go7x7.py and go9x9.py: to train each model

If you wish to play against your models, you can do it by changing the game characteristics, the model characteristics and the model path in play_AI.py and running it.

Versão Portuguesa

Este projeto foi desenvolvido no âmbito da unidade curricular 'Laboratório de IACD' pelos alunos André Sousa, António Cardoso, Antónia Brito e Paulo Silva.

Neste projeto, o nosso trabalho concentra-se em recriar um modelo adversarial inspirado no AlphaZero da DeepMind, capaz de aprender a jogar os jogos de Go e Ataxx.

Para alcançar isso, criamos um modelo baseado na abordagem AlphaZero, fazendo uso de uma Rede Neural Convolucional e um algoritmo de Busca em Árvore de Monte Carlo.

Além disso, implementamos algumas características próprias, tais como:

Múltiplos jogos podem ser jogados em paralelo
Um modelo que pode jogar em tabuleiros de diferentes tamanhos
Data Augmentation aplicando transformações aos estados do tabuleiro
Introdução de ruído para combater overfitting

O trabalho final está distribuido pelos seguintes ficheiros:

aMCTS_parallel.py: que implementa a MCTS
az_parallel2.py: que contém a classe AlphaZero
ataxx.py e go.py: implementando as regras dos jogos
graphics.py: para mostrar o jogo numa interface gráfica pygam
play_AI.py: para jogar contra um modelo treinado
ataxx4x4.py, ataxx5x5.py, ataxx6x6.py, ataxxflex.py, go7x7.py e go9x9.py: para treinar cada modelo

Para jogar contra os modelos, é necessário alterar as caraterísticas do jogo, do modelo e o caminho para o modelo em play_AI.py e correr esse ficheiro.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

English Version

Versão Portuguesa

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
CNN.py		CNN.py
README.md		README.md
aMCTS_parallel.py		aMCTS_parallel.py
ataxx.py		ataxx.py
ataxx4x4.py		ataxx4x4.py
ataxx5x5.py		ataxx5x5.py
ataxx6x6.py		ataxx6x6.py
ataxxflex.py		ataxxflex.py
az_parallel2.py		az_parallel2.py
go.py		go.py
go7x7.py		go7x7.py
go9x9.py		go9x9.py
graphics.py		graphics.py
play_AI.py		play_AI.py

anfisou/AlphaZero_Recreation

Folders and files

Latest commit

History

Repository files navigation

English Version

Versão Portuguesa

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages