Sequence Aware SAC Control for Engine Fuel Consumption Optimization in Electrified Powertrain

Wafeeq Jaleel; Md Ragib Rownak; Athar Hanif; Sidra Ghayour Bhatti; Qadeer Ahmed

arXiv:2508.04874·eess.SY·August 8, 2025

Sequence Aware SAC Control for Engine Fuel Consumption Optimization in Electrified Powertrain

Wafeeq Jaleel, Md Ragib Rownak, Athar Hanif, Sidra Ghayour Bhatti, Qadeer Ahmed

PDF

TL;DR

This paper introduces a sequence-aware reinforcement learning framework using SAC, GRUs, and Decision Transformers to optimize engine control in hybrid electric trucks, achieving near-optimal fuel savings and robustness across diverse driving conditions.

Contribution

It develops a novel RL approach incorporating sequence modeling techniques for improved engine control in HEVs, outperforming traditional methods in fuel efficiency and adaptability.

Findings

01

SAC with DT-based actor and GRU critic achieved within 1.8% of DP in fuel savings.

02

Sequence-aware agents outperformed feedforward networks on unseen drive cycles.

03

Models demonstrated robustness and generalization across diverse initial states and drive conditions.

Abstract

As hybrid electric vehicles (HEVs) gain traction in heavy-duty trucks, adaptive and efficient energy management is critical for reducing fuel consumption while maintaining battery charge for long operation times. We present a new reinforcement learning (RL) framework based on the Soft Actor-Critic (SAC) algorithm to optimize engine control in series HEVs. We reformulate the control task as a sequential decision-making problem and enhance SAC by incorporating Gated Recurrent Units (GRUs) and Decision Transformers (DTs) into both actor and critic networks to capture temporal dependencies and improve planning over time. To evaluate robustness and generalization, we train the models under diverse initial battery states, drive cycle durations, power demands, and input sequence lengths. Experiments show that the SAC agent with a DT-based actor and GRU-based critic was within 1.8% of Dynamic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.