Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Mixture of Experts MCTS (MoE MCTS) #216

Merged
merged 57 commits into from
May 3, 2024
Merged
Changes from 1 commit
Commits
Show all changes
57 commits
Select commit Hold shift + click to select a range
f066178
- added game phase detection file
HelpstoneX Jun 22, 2023
e7b9f00
- changed openspiel git
HelpstoneX Jun 22, 2023
ebfa472
- changed openspiel git
HelpstoneX Jun 24, 2023
ae18d66
fixed phase ids
HelpstoneX Jun 24, 2023
8eb237f
added dataset creation option for specific phases
HelpstoneX Jun 24, 2023
738cd8d
Merge branch 'dataset_creation' of https://github.com/HelpstoneX/Craz…
HelpstoneX Jun 29, 2023
3b56cee
param changes in train_cnn.ipynb
HelpstoneX Jul 3, 2023
826f3b6
- fixed plys_to_end list to only include values for moves that really…
HelpstoneX Jul 4, 2023
b92a8a8
Merge branch 'dataset_creation'
HelpstoneX Jul 5, 2023
cdbe3ab
- changes to train_cnn to make it compatible
HelpstoneX Aug 3, 2023
ce89f69
updated fork
HelpstoneX Aug 10, 2023
e55c318
mcts phase integration working, some improvements missing
HelpstoneX Sep 14, 2023
a2a9a2c
- added phase_to_nets map to make sure the right net is used for each…
HelpstoneX Sep 20, 2023
0be67ff
- added game phase vector to created datasets
HelpstoneX Sep 27, 2023
9e9b009
minor fixes for weighted training
HelpstoneX Sep 27, 2023
1700818
- fixes and improvements to prs.py from cutechess-cli
HelpstoneX Sep 28, 2023
cb575ce
- changes for continuing training from tar file (pytorch)
HelpstoneX Oct 2, 2023
964a982
- added python file for training (exported notebook)
HelpstoneX Oct 2, 2023
c3a2a25
- added python file for executing cutechess shell commands
HelpstoneX Oct 3, 2023
d05228d
- added the option to specify additional eval sets (unweighted) to pa…
HelpstoneX Oct 7, 2023
87456f2
- minor changes
HelpstoneX Oct 7, 2023
4229a04
- minor changes for debugging
HelpstoneX Oct 7, 2023
e9bed77
- bugfix in train_cnn.py for additional dataloaders
HelpstoneX Oct 7, 2023
c13d0be
- bugfix in to correctly determine train iterations
HelpstoneX Oct 9, 2023
4a11bd8
- minor changese in prs.py
HelpstoneX Oct 14, 2023
614a4c9
- minor changes for chess 960
HelpstoneX Oct 14, 2023
5049a1e
- reverted mode and version back to 2 and 3
HelpstoneX Oct 14, 2023
d0cc5f4
fixed bug when executing isready multiple times consecutively while s…
HelpstoneX Oct 26, 2023
b47f44b
alternative bugfix attempt for linux
HelpstoneX Oct 27, 2023
57e9d7b
- temporary fix for chess960 wrong training representation
HelpstoneX Nov 1, 2023
982c650
- changes to incorporate 960 dataset analysis
HelpstoneX Dec 9, 2023
d18251b
chess960 input representation fix (c++ engine files still unadjusted …
HelpstoneX Dec 9, 2023
26cb920
- added plot generating notebooks to git (/etc folder)
HelpstoneX Dec 15, 2023
5e0df73
- added support for naive movecount phases
HelpstoneX Dec 17, 2023
f065280
- minor path fix in dataset_loader.py
HelpstoneX Dec 17, 2023
1a7ab6c
undone temporary fix for broken chess960 input representation
HelpstoneX Dec 19, 2023
69e002c
- added support for phases by movecount in c++ code (currently always…
HelpstoneX Jan 12, 2024
85167cd
- minor plotting adjustments
HelpstoneX Jan 17, 2024
2059aac
- adjusted run_cutechess_experiments.py to be able to do experiments …
HelpstoneX Mar 20, 2024
bf57009
- added documentation
HelpstoneX Apr 11, 2024
09c6810
- minor assertion change in train_cnn.py
HelpstoneX Apr 11, 2024
9d5fa1f
- cleaned code and removed sections that are not needed anymore
HelpstoneX Apr 20, 2024
336f785
- changed underscore naming to camelCase naming in several cases
HelpstoneX Apr 20, 2024
fa3dfe6
- added UCI option Game_Phase_Definition with options "lichess" and "…
HelpstoneX Apr 20, 2024
9c4d507
- added searchSettings to RawNetAgent to access selected gamePhaseDef…
HelpstoneX Apr 22, 2024
687ef8a
- aligned train_cnn.ipynb with code inside train_cnn.py
HelpstoneX Apr 22, 2024
65a302b
- cleaned cell outputs of main notebooks
HelpstoneX May 1, 2024
2fae83a
- further notebook output cleanings
HelpstoneX May 1, 2024
2917f78
- removed files unnecessary for pull request and reverted several fil…
HelpstoneX May 1, 2024
df97fac
- reverted .gitignore and Dockerfile to older state
HelpstoneX May 1, 2024
2fd46bb
- .gitignore update to different previous state
HelpstoneX May 1, 2024
6df0f5a
Merge branch 'master' into pull_request_preparation
QueensGambit May 2, 2024
4030367
Update crazyara.cpp
QueensGambit May 2, 2024
81e1a94
Update board.cpp
QueensGambit May 2, 2024
3cbb403
Add GamePhase get_phase to states
QueensGambit May 3, 2024
fabc465
Add GamePhase OpenSpielState::get_phase()
QueensGambit May 3, 2024
a4853f5
Update get_data_loader() to load dict instead
QueensGambit May 3, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Prev Previous commit
Next Next commit
- changed openspiel git
  • Loading branch information
HelpstoneX committed Jun 24, 2023
commit ebfa472a18984642768c1743ce42af512e59f585
Original file line number Diff line number Diff line change
@@ -1,3 +1,13 @@
"""
@file: game_phase_detector.py
Created on 08.06.2023
@project: CrazyAra
@author: HelpstoneX

Analyses a given board state defined by a python-chess object and outputs the game phase according to a given definition
"""


import chess
import chess.pgn
import numpy as np
@@ -59,22 +69,23 @@ def get_mixedness(board):


def get_game_phase(board, definition="lichess"):
"""
TODO fill docstring
"""
if definition == "lichess":
"""
returns the game phase based on the lichess definition implemented in:
https://github.com/lichess-org/scalachess/blob/master/src/main/scala/Divider.scala
"""
# returns the game phase based on the lichess definition implemented in:
# https://github.com/lichess-org/scalachess/blob/master/src/main/scala/Divider.scala

num_majors_and_minors = get_majors_and_minors_count(board)
backrank_sparse = is_backrank_sparse(board)
mixedness_score = get_mixedness(board)

if num_majors_and_minors <= 6:
return "endgame", num_majors_and_minors, backrank_sparse, mixedness_score
return "endgame", num_majors_and_minors, backrank_sparse, mixedness_score, 0
elif num_majors_and_minors <= 10 or backrank_sparse or (mixedness_score > 150):
return "midgame", num_majors_and_minors, backrank_sparse, mixedness_score
return "midgame", num_majors_and_minors, backrank_sparse, mixedness_score, 1
else:
return "opening", num_majors_and_minors, backrank_sparse, mixedness_score
return "opening", num_majors_and_minors, backrank_sparse, mixedness_score, 2

else:
return "not implemented yet"
@@ -118,7 +129,7 @@ def get_game_phase(board, definition="lichess"):

for idx, move in enumerate(curr_game.main_line()):

phase, num_maj_and_min, backrank_sparse, mix_score = get_game_phase(curr_board)
phase, num_maj_and_min, backrank_sparse, mix_score, _ = get_game_phase(curr_board)
print(get_game_phase(curr_board), move)

if curr_phase == "midgame" and phase == "opening":
38 changes: 22 additions & 16 deletions DeepCrazyhouse/src/preprocessing/pgn_converter_util.py
Original file line number Diff line number Diff line change
@@ -15,6 +15,7 @@
from DeepCrazyhouse.src.domain.variants.input_representation import board_to_planes
from DeepCrazyhouse.configs.main_config import main_config
from DeepCrazyhouse.src.domain.variants.game_state import mirror_policy
from DeepCrazyhouse.src.preprocessing.game_phase_detector import get_game_phase


NB_ITEMS_METADATA = 18 # constant which defines how many meta data items will be stored in a matrix
@@ -125,22 +126,27 @@ def get_planes_from_game(game, mate_in_one=False):

# check if you need to export a mate_in_one_scenario
if not mate_in_one or plys == len(all_moves) - 1:
# build the last move vector by putting the most recent move on top followed by the remaining past moves
last_moves = [None] * NB_LAST_MOVES
if plys != 0:
last_moves[0:min(plys, NB_LAST_MOVES)] = all_moves[max(plys-NB_LAST_MOVES, 0):plys][::-1]

# receive the board and the evaluation of the current position in plane representation
# We don't want to store float values because the integer datatype is cheaper,
# that's why normalize is set to false
x_cur = board_to_planes(board, board_occ, normalize=False, mode=main_config["mode"], last_moves=last_moves)

# add the evaluation of 1 position to the list
x.append(x_cur)
y_value.append(y_init)
# add the next move defined in policy vector notation to the policy list
# the network always sees the board as if he's the white player, that's the move is mirrored fro black
y_policy.append(move_to_policy(next_move, mirror_policy=mirror_policy(board)))

# if specified phase is not None
# check if the current game phase is the phase the dataset is created for

if main_config["phase"] is None or get_game_phase(board)[4] == main_config["phase"]:
# build the last move vector by putting the most recent move on top followed by the remaining past moves
last_moves = [None] * NB_LAST_MOVES
if plys != 0:
last_moves[0:min(plys, NB_LAST_MOVES)] = all_moves[max(plys-NB_LAST_MOVES, 0):plys][::-1]

# receive the board and the evaluation of the current position in plane representation
# We don't want to store float values because the integer datatype is cheaper,
# that's why normalize is set to false
x_cur = board_to_planes(board, board_occ, normalize=False, mode=main_config["mode"], last_moves=last_moves)

# add the evaluation of 1 position to the list
x.append(x_cur)
y_value.append(y_init)
# add the next move defined in policy vector notation to the policy list
# the network always sees the board as if he's the white player, that's the move is mirrored fro black
y_policy.append(move_to_policy(next_move, mirror_policy=mirror_policy(board)))

y_init *= -1 # flip the y_init value after each move
board.push(move) # push the next move on the board
1 change: 1 addition & 0 deletions engine/src/rl/Dockerfile
Original file line number Diff line number Diff line change
@@ -197,4 +197,5 @@ RUN apt-get update -y \
# && wget https://raw.githubusercontent.com/ianfab/books/master/crazyhouse.epd

CMD cd /root/CrazyAra/engine/build/ \
&& git pull \
&& /bin/bash