Scalable multi-agent reinforcement learning for distributed control of   residential energy flexibility

Flora Charbonnier; Thomas Morstyn; Malcolm D. McCulloch

arXiv:2203.03417·eess.SY·March 29, 2022

Scalable multi-agent reinforcement learning for distributed control of residential energy flexibility

Flora Charbonnier, Thomas Morstyn, Malcolm D. McCulloch

PDF

TL;DR

This paper introduces a scalable multi-agent reinforcement learning framework for distributed control of residential energy resources, improving coordination and reducing costs without sharing personal data.

Contribution

It presents a novel combination of offline convex optimization and marginal contribution-based rewards for scalable, privacy-preserving multi-agent energy management.

Findings

01

Reduces energy costs, losses, and emissions.

02

Improves coordination stability at scale.

03

Maintains privacy without central data sharing.

Abstract

This paper proposes a novel scalable type of multi-agent reinforcement learning-based coordination for distributed residential energy. Cooperating agents learn to control the flexibility offered by electric vehicles, space heating and flexible loads in a partially observable stochastic environment. In the standard independent Q-learning approach, the coordination performance of agents under partial observability drops at scale in stochastic environments. Here, the novel combination of learning from off-line convex optimisations on historical data and isolating marginal contributions to total rewards in reward signals increases stability and performance at scale. Using fixed-size Q-tables, prosumers are able to assess their marginal impact on total system objectives without sharing personal data either with each other or with a central coordinator. Case studies are used to assess the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsQ-Learning