Language-Driven Policy Distillation for Cooperative Driving in Multi-Agent Reinforcement Learning
Jiaqi Liu, Chengkai Xu, Peng Hang, Jian Sun, Wei Zhan, Masayoshi Tomizuka, Mingyu Ding

TL;DR
This paper introduces LDPD, a novel language-driven policy distillation method that leverages Large Language Models to improve cooperative decision-making in multi-agent reinforcement learning for autonomous vehicles.
Contribution
The paper proposes a new framework combining LLMs with MARL to enhance learning efficiency and decision quality in cooperative driving tasks.
Findings
Students rapidly improve with minimal guidance
Students surpass teacher performance over time
Approach outperforms baseline methods in experiments
Abstract
The cooperative driving technology of Connected and Autonomous Vehicles (CAVs) is crucial for improving the efficiency and safety of transportation systems. Learning-based methods, such as Multi-Agent Reinforcement Learning (MARL), have demonstrated strong capabilities in cooperative decision-making tasks. However, existing MARL approaches still face challenges in terms of learning efficiency and performance. In recent years, Large Language Models (LLMs) have rapidly advanced and shown remarkable abilities in various sequential decision-making tasks. To enhance the learning capabilities of cooperative agents while ensuring decision-making efficiency and cost-effectiveness, we propose LDPD, a language-driven policy distillation method for guiding MARL exploration. In this framework, a teacher agent based on LLM trains smaller student agents to achieve cooperative decision-making through…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Transportation and Mobility Innovations · Multi-Agent Systems and Negotiation
