Support for multiplayer games? #159

kendonB · 2023-01-10T08:32:27Z

I saw a YouTube video suggest that this was difficult in principle due to the possibility of the agent forming cartels (i.e. it learns that it's always best to cooperate with position 2 if it finds itself in position 1 and vice versa).

This should be possible to avoid by just choosing the objective function to disincentivise collaboration.

So rather than having the agent maximise own win probability it could, for example, maximise the difference between own win probability and that of the opposing player most likely to win. Perhaps the negative weights could be applied to all other players weighted by their win probability.

kendonB · 2023-01-10T08:47:08Z

Mentioned in #101 and you make a similar point there. I don't think there needs to be a single correct objective function - I think it just needs to have some weight on own win probability and some negative weight on opponents in strong positions.

jonathan-laurent · 2023-01-10T13:05:42Z

Your idea may have potential but it is way too abstract in its present form for me to evaluate. I would encourage you to flesh it out using a concrete game as an example. Also, try and be specific about how each component of AlphaZero should be adapted to work with your idea (MCTS, network training objective, self-play...).

smart-fr mentioned this issue Jan 19, 2023

Stack overflow with lots of RAM still available #164

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for multiplayer games? #159

Support for multiplayer games? #159

kendonB commented Jan 10, 2023

kendonB commented Jan 10, 2023

jonathan-laurent commented Jan 10, 2023

Support for multiplayer games? #159

Support for multiplayer games? #159

Comments

kendonB commented Jan 10, 2023

kendonB commented Jan 10, 2023

jonathan-laurent commented Jan 10, 2023