Language-Driven Policy Distillation for Cooperative Driving in Multi-Agent Reinforcement Learning

Jiaqi Liu; Chengkai Xu; Peng Hang; Jian Sun; Wei Zhan; Masayoshi Tomizuka; Mingyu Ding

arXiv:2410.24152·cs.RO·August 12, 2025

Language-Driven Policy Distillation for Cooperative Driving in Multi-Agent Reinforcement Learning

Jiaqi Liu, Chengkai Xu, Peng Hang, Jian Sun, Wei Zhan, Masayoshi Tomizuka, Mingyu Ding

PDF

Open Access

TL;DR

This paper introduces LDPD, a novel language-driven policy distillation method that leverages Large Language Models to improve cooperative decision-making in multi-agent reinforcement learning for autonomous vehicles.

Contribution

The paper proposes a new framework combining LLMs with MARL to enhance learning efficiency and decision quality in cooperative driving tasks.

Findings

01

Students rapidly improve with minimal guidance

02

Students surpass teacher performance over time

03

Approach outperforms baseline methods in experiments

Abstract

The cooperative driving technology of Connected and Autonomous Vehicles (CAVs) is crucial for improving the efficiency and safety of transportation systems. Learning-based methods, such as Multi-Agent Reinforcement Learning (MARL), have demonstrated strong capabilities in cooperative decision-making tasks. However, existing MARL approaches still face challenges in terms of learning efficiency and performance. In recent years, Large Language Models (LLMs) have rapidly advanced and shown remarkable abilities in various sequential decision-making tasks. To enhance the learning capabilities of cooperative agents while ensuring decision-making efficiency and cost-effectiveness, we propose LDPD, a language-driven policy distillation method for guiding MARL exploration. In this framework, a teacher agent based on LLM trains smaller student agents to achieve cooperative decision-making through…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Transportation and Mobility Innovations · Multi-Agent Systems and Negotiation