iBlackJack

Instructing an agent in the strategies of playing Blackjack using Monte Carlo control.

Introduction

This serves as a straightforward example demonstrating how Monte Carlo methods can be practically applied in gaming scenarios. Essentially, the approach involves repeatedly playing BlackJack to obtain accurate estimates for the expected value of each possible state. Actions are then selected based on these calculated values.

To simulate the game, I use Gym's BlackJack. Check here for more details: https://www.gymlibrary.dev/environments/toy_text/blackjack/

Strategy Overview

The final code implementation utilizes the epsilon-soft action selection strategy. Here's a breakdown:

$$ \text{Exploratory action probability} \gets \frac{\epsilon}{|\boldsymbol{S}|} $$

$$ \text{Greedy action probability} \gets 1 - \epsilon + \frac{\epsilon}{|\boldsymbol{S}|} $$

$$ \text{The greed increases over time.} $$

However, in earlier commits, you'll find simpler action selection strategies.

Let's Get Started!

Feel free to explore, experiment, and have fun! 🎮😄

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.gitignore		.gitignore
Agent.py		Agent.py
README.md		README.md
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.gitignore

.gitignore

Agent.py

Agent.py

README.md

README.md

main.py

main.py

Repository files navigation

iBlackJack

Introduction

Strategy Overview

Let's Get Started!

About

Releases

Packages

Languages

mshokrnezhad/iBlackJack

Folders and files

Latest commit

History

Repository files navigation

iBlackJack

Introduction

Strategy Overview

Let's Get Started!

About

Topics

Resources

Stars

Watchers

Forks

Languages