Reinforcement Learning for Joint Design and Control of Battery-PV Systems
Marine Cauz, Adrien Bolland, Bardhyl Miftari, Lionel Perret,, Christophe Ballif, Nicolas Wyrsch

TL;DR
This paper explores the application of reinforcement learning for the joint design and control of PV and battery systems, comparing it with traditional MILP optimization to evaluate benefits and challenges.
Contribution
It introduces a novel approach combining RL with system design, bridging the gap between control strategies and design optimization in renewable energy systems.
Findings
RL and MILP converge to similar control solutions
Investment decisions differ between RL and MILP
RL offers a stochastic approach to system optimization
Abstract
The decentralisation and unpredictability of new renewable energy sources require rethinking our energy system. Data-driven approaches, such as reinforcement learning (RL), have emerged as new control strategies for operating these systems, but they have not yet been applied to system design. This paper aims to bridge this gap by studying the use of an RL-based method for joint design and control of a real-world PV and battery system. The design problem is first formulated as a mixed-integer linear programming problem (MILP). The optimal MILP solution is then used to evaluate the performance of an RL agent trained in a surrogate environment designed for applying an existing data-driven algorithm. The main difference between the two models lies in their optimization approaches: while MILP finds a solution that minimizes the total costs for a one-year operation given the deterministic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSmart Grid Energy Management · Electric Vehicles and Infrastructure · Energy Efficiency and Management
