Energy Management of Multi-mode Plug-in Hybrid Electric Vehicle using Multi-agent Deep Reinforcement Learning
Min Hua, Cetengfei Zhang, Fanggang Zhang, Zhi Li, Xiaoli Yu, Hongming, Xu, Quan Zhou

TL;DR
This paper introduces a multi-agent deep reinforcement learning approach for energy management in multi-mode PHEVs, achieving better global optimization and energy savings over traditional methods.
Contribution
It develops a collaborative MADRL framework with a relevance ratio and hand-shaking strategy for improved MIMO control in PHEVs, outperforming existing decoupled methods.
Findings
Up to 4% energy savings over single-agent systems.
Up to 23.54% energy savings compared to rule-based systems.
Identified learning rate as the most influential factor for performance.
Abstract
The recently emerging multi-mode plug-in hybrid electric vehicle (PHEV) technology is one of the pathways making contributions to decarbonization, and its energy management requires multiple-input and multipleoutput (MIMO) control. At the present, the existing methods usually decouple the MIMO control into singleoutput (MISO) control and can only achieve its local optimal performance. To optimize the multi-mode vehicle globally, this paper studies a MIMO control method for energy management of the multi-mode PHEV based on multi-agent deep reinforcement learning (MADRL). By introducing a relevance ratio, a hand-shaking strategy is proposed to enable two learning agents to work collaboratively under the MADRL framework using the deep deterministic policy gradient (DDPG) algorithm. Unified settings for the DDPG agents are obtained through a sensitivity analysis of the influencing factors…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsElectric Vehicles and Infrastructure · Electric and Hybrid Vehicle Technologies · Advanced Battery Technologies Research
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Batch Normalization · Experience Replay · Weight Decay · Adam · Convolution · Dense Connections · Deep Deterministic Policy Gradient
