Artificial intelligence is no match for natural stupidity
― Albert Einstein
This is simplified version of Blackjack game, suitable for tabular RL methods without card counting and shuffle tracking methods
Allowed actions:
- hit
- stand
Game is represented as stationary environment:
- no multiple decks
- every hand is dealt from full deck
- cards are not put aside after one action
- First visit Monte carlo method
- Q learning method
- add new algorithms
- add non-stationary environment and enable card counting
Reinforcement Learning: An Introduction
by Andrew Barto and Richard S. Sutton