Learning Time Reduction Using Warm Start Methods for a Reinforcement   Learning Based Supervisory Control in Hybrid Electric Vehicle Applications

Bin Xu; Jun Hou; Junzhe Shi; Huayi Li; Dhruvang Rathod; Zhe Wang,; Zoran Filipi

arXiv:2010.14575·cs.RO·October 29, 2020

Learning Time Reduction Using Warm Start Methods for a Reinforcement Learning Based Supervisory Control in Hybrid Electric Vehicle Applications

Bin Xu, Jun Hou, Junzhe Shi, Huayi Li, Dhruvang Rathod, Zhe Wang,, Zoran Filipi

PDF

TL;DR

This paper introduces warm start methods for Q-learning in hybrid electric vehicle supervisory control, significantly reducing learning time and improving initial fuel consumption performance, facilitating real-world deployment.

Contribution

It proposes initializing Q-learning with supervisory controls instead of random values, reducing iterations by 68.8% and enhancing early-stage fuel efficiency in HEV applications.

Findings

01

68.8% fewer learning iterations needed

02

10-16% MPG improvement over baseline controls

03

Validated in different driving cycles

Abstract

Reinforcement Learning (RL) is widely utilized in the field of robotics, and as such, it is gradually being implemented in the Hybrid Electric Vehicle (HEV) supervisory control. Even though RL exhibits excellent performance in terms of fuel consumption minimization in simulation, the large learning iteration number needs a long learning time, making it hardly applicable in real-world vehicles. In addition, the fuel consumption of initial learning phases is much worse than baseline controls. This study aims to reduce the learning iterations of Q-learning in HEV application and improve fuel consumption in initial learning phases utilizing warm start methods. Different from previous studies, which initiated Q-learning with zero or random Q values, this study initiates the Q-learning with different supervisory controls (i.e., Equivalent Consumption Minimization Strategy control and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsQ-Learning