Multi-agent Reinforcement Learning for Low-Carbon P2P Energy Trading among Self-Interested Microgrids

Junhao Ren; Honglin Gao; Lan Zhao; Qiyu Kang; Gaoxi Xiao; Yajuan Sun

arXiv:2604.08973·cs.MA·April 13, 2026

Multi-agent Reinforcement Learning for Low-Carbon P2P Energy Trading among Self-Interested Microgrids

Junhao Ren, Honglin Gao, Lan Zhao, Qiyu Kang, Gaoxi Xiao, Yajuan Sun

PDF

TL;DR

This paper introduces a multi-agent reinforcement learning framework enabling self-interested microgrids to optimize P2P energy trading, increasing renewable use and economic welfare while reducing emissions.

Contribution

It presents a novel multi-agent RL approach with a market mechanism for microgrid P2P trading, enhancing renewable integration and economic benefits.

Findings

01

Learned bidding policies improve renewable utilization.

02

Reduces reliance on high-carbon electricity.

03

Increases community economic welfare.

Abstract

Uncertainties in renewable generation and demand dynamics challenge day-ahead scheduling. To enhance renewable penetration and maintain intra-day balance, we develop a multi-agent reinforcement learning framework for self-interested microgrids participating in peer-to-peer (P2P) electricity trading. Each microgrid independently bids both price and quantity while optimizing its own profit via storage arbitrage under time-varying main-grid prices. A market-clearing mechanism coordinating trades and promoting incentive compatibility is proposed. Simulation results show that the learned bidding policy improves renewable utilization and reduces reliance on high-carbon electricity, while increasing community-level economic welfare, delivering a win-win situation in emission reduction and local prosperity.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.