Mitigating Side Effects in Multi-Agent Systems Using Blame Assignment
Pulkit Rustagi, Sandhya Saisubramanian

TL;DR
This paper presents a decentralized approach for multi-agent systems to mitigate negative side effects by decomposing joint penalties into individual ones, improving scalability and safety in shared environments.
Contribution
It introduces a novel bi-objective lexicographic decentralized MDP model with credit assignment for NSE mitigation in multi-agent systems.
Findings
Effective in reducing negative side effects in simulations
Scalable approach demonstrated with mobile robots
Balances individual objectives with collective safety
Abstract
When independently trained or designed robots are deployed in a shared environment, their combined actions can lead to unintended negative side effects (NSEs). To ensure safe and efficient operation, robots must optimize task performance while minimizing the penalties associated with NSEs, balancing individual objectives with collective impact. We model the problem of mitigating NSEs in a cooperative multi-agent system as a bi-objective lexicographic decentralized Markov decision process. We assume independence of transitions and rewards with respect to the robots' tasks, but the joint NSE penalty creates a form of dependence in this setting. To improve scalability, the joint NSE penalty is decomposed into individual penalties for each robot using credit assignment, which facilitates decentralized policy computation. We empirically demonstrate, using mobile robots and in simulation, the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEvolutionary Game Theory and Cooperation · Terrorism, Counterterrorism, and Political Violence
