Automated Driving with Evolution Capability: A Reinforcement Learning Method with Monotonic Performance Enhancement
Jia Hu, Xuerun Yan, Tian Xu, Haoran Wang

TL;DR
This paper introduces HCPI-RL, a reinforcement learning method for automated driving that guarantees monotonic performance improvement, handles emergency scenarios, and significantly enhances decision-making and efficiency over existing planners.
Contribution
The paper proposes a novel HCPI-RL framework that ensures consistent performance improvement in automated driving, addressing risks associated with traditional RL methods.
Findings
Policy return increased by up to 108.2% in emergency scenarios
Driving efficiency improved by 19.2% over PPO planner
Monotonic performance enhancement achieved in diverse scenarios
Abstract
Reinforcement Learning (RL) offers a promising solution to enable evolutionary automated driving. However, the conventional RL method is always concerned with risk performance. The updated policy may not obtain a performance enhancement, even leading to performance deterioration. To address this challenge, this research proposes a High Confidence Policy Improvement Reinforcement Learning-based (HCPI-RL) planner. It is intended to achieve the monotonic evolution of automated driving. A novel RL policy update paradigm is designed to enable the newly learned policy performance consistently surpass that of previous policies, which is deemed as monotonic performance enhancement. Hence, the proposed HCPI-RL planner has the following features: i) Evolutionary automated driving with monotonic performance enhancement; ii) With the capability of handling scenarios with emergency; iii) With…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Transportation and Mobility Innovations · Energy, Environment, and Transportation Policies
MethodsEntropy Regularization · Proximal Policy Optimization
