TromisNet

A deep reinforcement learning agent to play Tromis, a simplified version of two-player Tetris. The model learns exclusively through self-play.

Neural network built with Pytorch, using a C++ implementation of Tromis.

A blog post explaining an early version of this, which used the same DNN structure, can be found here.

Since then, with hyper-parameter tuning, off-policy training with V-trace and other methods, the agent improved significantly.

Currently it achieves a 99.7% winrate, so new benchmarks are needed to measure its performance.

The source contains 3 different python trainers. The best one, experimentally, is trainer_self.py which optimises GPU utilisation by doing off-policy rollouts during game simulations.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
gif		gif
graphic		graphic
old		old
src		src
LICENSE.md		LICENSE.md
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TromisNet

About

Releases

Packages

Languages

License

Nightcrab/tromis-net

Folders and files

Latest commit

History

Repository files navigation

TromisNet

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages