UniMM-V2X: MoE-Enhanced Multi-Level Fusion for End-to-End Cooperative Autonomous Driving

Ziyi Song; Chen Xia; Chenbing Wang; Haibao Yu; Sheng Zhou; Zhisheng Niu

arXiv:2511.09013·cs.RO·November 13, 2025

UniMM-V2X: MoE-Enhanced Multi-Level Fusion for End-to-End Cooperative Autonomous Driving

Ziyi Song, Chen Xia, Chenbing Wang, Haibao Yu, Sheng Zhou, Zhisheng Niu

PDF

Open Access

TL;DR

UniMM-V2X introduces a hierarchical multi-agent framework with MoE-enhanced multi-level fusion for end-to-end autonomous driving, significantly improving perception, prediction, and planning accuracy through cooperative reasoning.

Contribution

It presents a novel end-to-end multi-agent system with multi-level fusion and MoE architecture, enabling hierarchical cooperation across perception, prediction, and planning tasks.

Findings

01

39.7% improvement in perception accuracy

02

7.2% reduction in prediction error

03

33.2% improvement in planning performance

Abstract

Autonomous driving holds transformative potential but remains fundamentally constrained by the limited perception and isolated decision-making with standalone intelligence. While recent multi-agent approaches introduce cooperation, they often focus merely on perception-level tasks, overlooking the alignment with downstream planning and control, or fall short in leveraging the full capacity of the recent emerging end-to-end autonomous driving. In this paper, we present UniMM-V2X, a novel end-to-end multi-agent framework that enables hierarchical cooperation across perception, prediction, and planning. At the core of our framework is a multi-level fusion strategy that unifies perception and prediction cooperation, allowing agents to share queries and reason cooperatively for consistent and safe decision-making. To adapt to diverse downstream tasks and further enhance the quality of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAutonomous Vehicle Technology and Safety · Advanced Neural Network Applications · Adversarial Robustness in Machine Learning