Multi-Timescale Hierarchical Reinforcement Learning for Unified Behavior and Control of Autonomous Driving

Guizhe Jin; Zhuoren Li; Bo Leng; Ran Yu; Lu Xiong; Chen Sun

arXiv:2506.23771·cs.RO·November 25, 2025

Multi-Timescale Hierarchical Reinforcement Learning for Unified Behavior and Control of Autonomous Driving

Guizhe Jin, Zhuoren Li, Bo Leng, Ran Yu, Lu Xiong, Chen Sun

PDF

TL;DR

This paper introduces a multi-timescale hierarchical reinforcement learning framework for autonomous driving that unifies long-term motion planning with short-term control, improving efficiency, consistency, and safety.

Contribution

It proposes a hierarchical RL policy structure with explicit long- and short-timescale policies, incorporating hybrid actions and safety mechanisms for better autonomous driving.

Findings

01

Enhanced driving efficiency and safety in simulations.

02

Improved action consistency across driving scenarios.

03

Effective multi-timescale safety enforcement.

Abstract

Reinforcement Learning (RL) is increasingly used in autonomous driving (AD) and shows clear advantages. However, most RL-based AD methods overlook policy structure design. An RL policy that only outputs short-timescale vehicle control commands results in fluctuating driving behavior due to fluctuations in network outputs, while one that only outputs long-timescale driving goals cannot achieve unified optimality of driving behavior and control. Therefore, we propose a multi-timescale hierarchical reinforcement learning approach. Our approach adopts a hierarchical policy structure, where high- and low-level RL policies are unified-trained to produce long-timescale motion guidance and short-timescale control commands, respectively. Therein, motion guidance is explicitly represented by hybrid actions to capture multimodal driving behaviors on structured road and support incremental…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.