Tackling Uncertainties in Multi-Agent Reinforcement Learning through   Integration of Agent Termination Dynamics

Somnath Hazra; Pallab Dasgupta; Soumyajit Dey

arXiv:2501.12061·cs.LG·January 22, 2025

Tackling Uncertainties in Multi-Agent Reinforcement Learning through Integration of Agent Termination Dynamics

Somnath Hazra, Pallab Dasgupta, Soumyajit Dey

PDF

Open Access 1 Repo

TL;DR

This paper presents a novel MARL approach that combines distributional learning with safety-focused loss functions, improving convergence and safety in complex multi-agent tasks like StarCraft II.

Contribution

It introduces a Barrier Function based loss integrating safety metrics into MARL, addressing uncertainty and risk during policy learning.

Findings

01

Enhanced convergence in StarCraft II benchmarks

02

Outperforms state-of-the-art baselines in safety and task completion

03

Demonstrates the effectiveness of safety-aware learning in MARL

Abstract

Multi-Agent Reinforcement Learning (MARL) has gained significant traction for solving complex real-world tasks, but the inherent stochasticity and uncertainty in these environments pose substantial challenges to efficient and robust policy learning. While Distributional Reinforcement Learning has been successfully applied in single-agent settings to address risk and uncertainty, its application in MARL is substantially limited. In this work, we propose a novel approach that integrates distributional learning with a safety-focused loss function to improve convergence in cooperative MARL tasks. Specifically, we introduce a Barrier Function based loss that leverages safety metrics, identified from inherent faults in the system, into the policy learning process. This additional loss term helps mitigate risks and encourages safer exploration during the early stages of training. We evaluate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

somnathhazra/uncertainties_marl
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics