title | abstract | layout | series | publisher | issn | id | month | tex_title | firstpage | lastpage | page | order | cycles | bibtex_editor | editor | bibtex_author | author | date | note | address | container-title | volume | genre | issued | extras | |||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Learning and solving many-player games through a cluster-based representation |
In addressing the challenge of exponential scaling with the number of agents we adopt a cluster-based representation to approximately solve asymmetric games of very many players. A cluster groups together agents with a similar "strategic view" of the game. We learn the clustered approximation from data consisting of strategy profiles and payoffs, which may be obtained from observations of play or access to a simulator. Using our clustering we construct a reduced "twins" game in which each cluster is associated with two players of the reduced game. This allows our representation to be individually-responsive because we align the interests of every individual agent with the strategy of its cluster. Our approach provides agents with higher payoffs and lower regret on average than model-free methods as well as previous cluster-based methods, and requires only few observations for learning to be successful. The "twins" approach is shown to be an important component of providing these low regret approximations. |
inproceedings |
Proceedings of Machine Learning Research |
PMLR |
2640-3498 |
ficici08a |
0 |
Learning and solving many-player games through a cluster-based representation |
188 |
195 |
188-195 |
188 |
false |
McAllester, David A. and Myllym{"a}ki, Petri |
|
Ficici, Sevan G. and Parkes, David C. and Pfeffer, Avi |
|
2008-07-09 |
Reissued by PMLR on 30 October 2024. |
Proceedings of the 24th Conference on Uncertainty in Artificial Intelligence |
R6 |
inproceedings |
|