SigmaRL: A Sample-Efficient and Generalizable Multi-Agent Reinforcement   Learning Framework for Motion Planning

Jianye Xu; Pan Hu; Bassam Alrifaee

arXiv:2408.07644·cs.RO·April 11, 2025

SigmaRL: A Sample-Efficient and Generalizable Multi-Agent Reinforcement Learning Framework for Motion Planning

Jianye Xu, Pan Hu, Bassam Alrifaee

PDF

4 Repos

TL;DR

SigmaRL is an open-source multi-agent reinforcement learning framework that significantly improves sample efficiency and generalization in motion planning for automated vehicles, enabling rapid training and effective zero-shot generalization across diverse traffic scenarios.

Contribution

The paper introduces five observation design strategies that enhance general features for traffic scenarios, improving sample efficiency and zero-shot generalization in multi-agent RL for motion planning.

Findings

01

Training time reduced to under one hour on a single CPU.

02

RL agents successfully generalize to unseen traffic scenarios.

03

Observation design strategies improve sample efficiency and generalization.

Abstract

This paper introduces an open-source, decentralized framework named SigmaRL, designed to enhance both sample efficiency and generalization of multi-agent Reinforcement Learning (RL) for motion planning of connected and automated vehicles. Most RL agents exhibit a limited capacity to generalize, often focusing narrowly on specific scenarios, and are usually evaluated in similar or even the same scenarios seen during training. Various methods have been proposed to address these challenges, including experience replay and regularization. However, how observation design in RL affects sample efficiency and generalization remains an under-explored area. We address this gap by proposing five strategies to design information-dense observations, focusing on general features that are applicable to most traffic scenarios. We train our RL agents using these strategies on an intersection and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsExperience Replay