Scalable multi-agent reinforcement learning for distributed control of residential energy flexibility
Flora Charbonnier, Thomas Morstyn, Malcolm D. McCulloch

TL;DR
This paper introduces a scalable multi-agent reinforcement learning framework for distributed control of residential energy resources, improving coordination and reducing costs without sharing personal data.
Contribution
It presents a novel combination of offline convex optimization and marginal contribution-based rewards for scalable, privacy-preserving multi-agent energy management.
Findings
Reduces energy costs, losses, and emissions.
Improves coordination stability at scale.
Maintains privacy without central data sharing.
Abstract
This paper proposes a novel scalable type of multi-agent reinforcement learning-based coordination for distributed residential energy. Cooperating agents learn to control the flexibility offered by electric vehicles, space heating and flexible loads in a partially observable stochastic environment. In the standard independent Q-learning approach, the coordination performance of agents under partial observability drops at scale in stochastic environments. Here, the novel combination of learning from off-line convex optimisations on historical data and isolating marginal contributions to total rewards in reward signals increases stability and performance at scale. Using fixed-size Q-tables, prosumers are able to assess their marginal impact on total system objectives without sharing personal data either with each other or with a central coordinator. Case studies are used to assess the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsQ-Learning
