MAPLE: Latent Multi-Agent Play for End-to-End Autonomous Driving

Rajeev Yasarla; Deepti Hegde; Hsin-Pai Cheng; Shizhong Han; Yunxiao Shi; Meysam Sadeghigooghari; Hanno Ackermann; Litian Liu; Pranav Desai; Fatih Porikli; Mohammad Ghavamzadeh; Hong Cai

arXiv:2605.14201·cs.RO·May 21, 2026

MAPLE: Latent Multi-Agent Play for End-to-End Autonomous Driving

Rajeev Yasarla, Deepti Hegde, Hsin-Pai Cheng, Shizhong Han, Yunxiao Shi, Meysam Sadeghigooghari, Hanno Ackermann, Litian Liu, Pranav Desai, Fatih Porikli, Mohammad Ghavamzadeh, Hong Cai

PDF

TL;DR

MAPLE is a scalable, closed-loop training framework for end-to-end autonomous driving that models multi-agent interactions in the latent space without external simulators.

Contribution

It introduces a novel two-stage training process with diversity rewards for reactive multi-agent planning in autonomous driving.

Findings

01

Achieves state-of-the-art performance on Bench2Drive.

02

Demonstrates scalable, closed-loop multi-agent play.

03

Does not require external simulators for training.

Abstract

Vision-language-action (VLA) models are effective as end-to-end motion planners, but can be brittle when evaluated in closed-loop settings due to being trained under traditional imitation learning framework. Existing closed-loop supervision approaches lack scalability and fail to completely model a reactive environment. We propose MAPLE, a novel framework for reactive, multi-agent rollout of a dynamic driving scenario in the latent space of the VLA model. The ego vehicle and nearby traffic agents are independently controlled over multi-step horizons, while being reactive to other agents in the scene, enabling closed-loop training. MAPLE consists of two training stages: (1) supervised fine-tuning on the latent rollouts based on ground-truth trajectories, followed by (2) reinforcement learning with global and agent -specific rewards that encourage safety, progress, and interaction…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.