Skip to content

Latest commit

 

History

History
55 lines (55 loc) · 2.38 KB

2024-04-18-ahmed24a.md

File metadata and controls

55 lines (55 loc) · 2.38 KB
title software abstract layout series publisher issn id month tex_title firstpage lastpage page order cycles bibtex_author author date address container-title volume genre issued pdf extras
Privacy-Preserving Decentralized Actor-Critic for Cooperative Multi-Agent Reinforcement Learning
Multi-agent reinforcement learning has a wide range of applications in cooperative settings, but ensuring data privacy among agents is a significant challenge. To address this challenge, we propose Privacy-Preserving Decentralized Actor-Critic (PPDAC), an algorithm that motivates agents to cooperate while maintaining their data privacy. Leveraging trajectory ranking, PPDAC enables the agents to learn a cooperation reward that encourages agents to account for other agents’ preferences. Subsequently, each agent trains a policy that maximizes not only its local reward as in independent actor-critic (IAC) but also the cooperation reward, hence, increasing cooperation. Importantly, communication among agents is restricted to their ranking of trajectories that only include public identifiers without any private local data. Moreover, as an additional layer of privacy, the agents can perturb their rankings with the randomized response method. We evaluate PPDAC on the level-based foraging (LBF) environment and a coin-gathering environment. We compare with IAC and Shared Experience Actor-Critic (SEAC) which achieves SOTA results for the LBF environment. The results show that PPDAC consistently outperforms IAC. In addition, PPDAC outperforms SEAC in the coin-gathering environment and achieves similar performance in the LBF environment, all while providing better privacy.
inproceedings
Proceedings of Machine Learning Research
PMLR
2640-3498
ahmed24a
0
Privacy-Preserving Decentralized Actor-Critic for Cooperative Multi-Agent Reinforcement Learning
2755
2763
2755-2763
2755
false
Ahmed, Maheed A. and Ghasemi, Mahsa
given family
Maheed A.
Ahmed
given family
Mahsa
Ghasemi
2024-04-18
Proceedings of The 27th International Conference on Artificial Intelligence and Statistics
238
inproceedings
date-parts
2024
4
18