Monte Carlo Tree Search Hex solver using Deep Reinforcement Learning

This is the second of three projects in the AI programming course at NTNU. The group built an Actor Reinforcement Learner and has applied it to different instances of the game Hex. The Actor is a deep neural network that is trained on the probabilities given by the MCTS.

Figure 1 provides a high-level view of the actor.

File structure:

agent
- actor.py - the neural network actor
environment
- hexboard.py - the logic of the board
- game_manager.py - updates the state of the game
simulator
- mcts.py - gives the probability distribution of a state, which is equal to the softmax of total counts of visits to all child nodes of that state
- tree_node.py - node in the MCTS -tournament - tournament.py - used for playing models that are trained at different levels against each other

The config folder consists of different configs that have been used for the different instances of the game. In main.py it reads in these configs and starts the whole training loop. The oht-folder is used for playing on the server created by the course administrators to test our models against their models. The nim-folder was used under developing the MCTS, to test that it worked on a less complicated game. Profiling was used to track the runtime of each element of the project.

Beneath are the progression of learning and a visualization of game play shown. For the progression of learning, models were saved under different stages of training and then played against each other. The increasing model number corresponds to an increasing level of learning. The visualization of game play shows a trained actor (blue) against an actor that is only trained for a few episodes (red).

Progression of Learning	Visualization of Game Play

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
agent		agent
config		config
environment		environment
images		images
nim		nim
oht		oht
profiling		profiling
simulator		simulator
tournament		tournament
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Monte Carlo Tree Search Hex solver using Deep Reinforcement Learning

About

Releases

Packages

Languages

License

annalunde/mcts-hex

Folders and files

Latest commit

History

Repository files navigation

Monte Carlo Tree Search Hex solver using Deep Reinforcement Learning

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages