Adaptive BESS and Grid Setpoints Optimization: A Model-Free Framework for Efficient Battery Management under Dynamic Tariff Pricing
Alaa Selim, Huadong Mo, Hemanshu Pota, Daoyi Dong

TL;DR
This paper presents a model-free DRL framework using an enhanced Soft Actor-Critic algorithm for efficient, safe, and adaptive BESS management under dynamic tariffs, significantly reducing optimization time and costs.
Contribution
It introduces a novel DRL-based approach with reward refinement and safety mechanisms for BESS control, outperforming traditional gradient-based methods in speed and cost efficiency.
Findings
SOC exceeds 50% at day's end, ensuring readiness for next day
DRL approach reduces optimization time by 50%
Cost savings average 40% over benchmarks
Abstract
This paper introduces an enhanced framework for managing Battery Energy Storage Systems (BESS) in residential communities. The non-convex BESS control problem is first addressed using a gradient-based optimizer, providing a benchmark solution. Subsequently, the problem is tackled using multiple Deep Reinforcement Learning (DRL) agents, with a specific emphasis on the off-policy Soft Actor-Critic (SAC) algorithm. This version of SAC incorporates reward refinement based on this non-convex problem, applying logarithmic scaling to enhance convergence rates. Additionally, a safety mechanism selects only feasible actions from the action space, aimed at improving the learning curve, accelerating convergence, and reducing computation times. Moreover, the state representation of this DRL approach now includes uncertainties quantified in the entropy term, enhancing the model's adaptability across…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSmart Grid Energy Management · Electric Vehicles and Infrastructure · Transportation and Mobility Innovations
MethodsGlobal Average Pooling · 1x1 Convolution · Dilated Convolution · Average Pooling · Convolution · Switchable Atrous Convolution
