Automated Driving with Evolution Capability: A Reinforcement Learning   Method with Monotonic Performance Enhancement

Jia Hu; Xuerun Yan; Tian Xu; Haoran Wang

arXiv:2412.10822·eess.SY·December 17, 2024

Automated Driving with Evolution Capability: A Reinforcement Learning Method with Monotonic Performance Enhancement

Jia Hu, Xuerun Yan, Tian Xu, Haoran Wang

PDF

Open Access

TL;DR

This paper introduces HCPI-RL, a reinforcement learning method for automated driving that guarantees monotonic performance improvement, handles emergency scenarios, and significantly enhances decision-making and efficiency over existing planners.

Contribution

The paper proposes a novel HCPI-RL framework that ensures consistent performance improvement in automated driving, addressing risks associated with traditional RL methods.

Findings

01

Policy return increased by up to 108.2% in emergency scenarios

02

Driving efficiency improved by 19.2% over PPO planner

03

Monotonic performance enhancement achieved in diverse scenarios

Abstract

Reinforcement Learning (RL) offers a promising solution to enable evolutionary automated driving. However, the conventional RL method is always concerned with risk performance. The updated policy may not obtain a performance enhancement, even leading to performance deterioration. To address this challenge, this research proposes a High Confidence Policy Improvement Reinforcement Learning-based (HCPI-RL) planner. It is intended to achieve the monotonic evolution of automated driving. A novel RL policy update paradigm is designed to enable the newly learned policy performance consistently surpass that of previous policies, which is deemed as monotonic performance enhancement. Hence, the proposed HCPI-RL planner has the following features: i) Evolutionary automated driving with monotonic performance enhancement; ii) With the capability of handling scenarios with emergency; iii) With…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Transportation and Mobility Innovations · Energy, Environment, and Transportation Policies

MethodsEntropy Regularization · Proximal Policy Optimization