LearningFlow: Automated Policy Learning Workflow for Urban Driving with   Large Language Models

Zengqi Peng; Yubin Wang; Xu Han; Lei Zheng; Jun Ma

arXiv:2501.05057·cs.RO·January 10, 2025

LearningFlow: Automated Policy Learning Workflow for Urban Driving with Large Language Models

Zengqi Peng, Yubin Wang, Xu Han, Lei Zheng, Jun Ma

PDF

Open Access

TL;DR

LearningFlow is an automated workflow that uses multiple large language models to improve reinforcement learning for urban driving, reducing manual effort and enhancing policy performance in complex environments.

Contribution

This paper introduces LearningFlow, a novel LLM-based framework that automates reward and curriculum generation for RL in urban driving, improving efficiency and generalization.

Findings

01

Outperforms existing methods in reward and curriculum generation

02

Achieves higher driving policy performance in CARLA simulator

03

Demonstrates robust generalization across tasks and RL algorithms

Abstract

Recent advancements in reinforcement learning (RL) demonstrate the significant potential in autonomous driving. Despite this promise, challenges such as the manual design of reward functions and low sample efficiency in complex environments continue to impede the development of safe and effective driving policies. To tackle these issues, we introduce LearningFlow, an innovative automated policy learning workflow tailored to urban driving. This framework leverages the collaboration of multiple large language model (LLM) agents throughout the RL training process. LearningFlow includes a curriculum sequence generation process and a reward generation process, which work in tandem to guide the RL policy by generating tailored training curricula and reward functions. Particularly, each process is supported by an analysis agent that evaluates training progress and provides critical insights to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Quality and Management · Traffic Prediction and Management Techniques · Business Process Modeling and Analysis

MethodsEntropy Regularization · Proximal Policy Optimization · CARLA: An Open Urban Driving Simulator