Extensive Exploration in Complex Traffic Scenarios using Hierarchical   Reinforcement Learning

Zhihao Zhang; Ekim Yurtsever; Keith A. Redmill

arXiv:2501.14992·cs.LG·January 28, 2025

Extensive Exploration in Complex Traffic Scenarios using Hierarchical Reinforcement Learning

Zhihao Zhang, Ekim Yurtsever, Keith A. Redmill

PDF

Open Access

TL;DR

This paper presents a hierarchical deep reinforcement learning framework for complex traffic scenarios, enabling better decision-making and control in autonomous driving through decomposed subtasks and separate training of controllers.

Contribution

Introduces a hierarchical DRL framework with a two-step training process for complex traffic environments, improving exploration and control capabilities.

Findings

01

Hierarchical controller outperforms baseline in complex highway scenarios.

02

Separate training of high-level and low-level controllers enhances learning efficiency.

03

Framework demonstrates improved handling of long-term delayed rewards.

Abstract

Developing an automated driving system capable of navigating complex traffic environments remains a formidable challenge. Unlike rule-based or supervised learning-based methods, Deep Reinforcement Learning (DRL) based controllers eliminate the need for domain-specific knowledge and datasets, thus providing adaptability to various scenarios. Nonetheless, a common limitation of existing studies on DRL-based controllers is their focus on driving scenarios with simple traffic patterns, which hinders their capability to effectively handle complex driving environments with delayed, long-term rewards, thus compromising the generalizability of their findings. In response to these limitations, our research introduces a pioneering hierarchical framework that efficiently decomposes intricate decision-making problems into manageable and interpretable subtasks. We adopt a two step training process…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTraffic control and management · Traffic Prediction and Management Techniques

MethodsADaptive gradient method with the OPTimal convergence rate · Focus