Scalable Fairness Shaping with LLM-Guided Multi-Agent Reinforcement Learning for Peer-to-Peer Electricity Markets

Shrenik Jadhav; Birva Sevak; Srijita Das; Akhtar Hussain; Wencong Su; Van-Hai Bui

arXiv:2508.18610·eess.SY·August 27, 2025

Scalable Fairness Shaping with LLM-Guided Multi-Agent Reinforcement Learning for Peer-to-Peer Electricity Markets

Shrenik Jadhav, Birva Sevak, Srijita Das, Akhtar Hussain, Wencong Su, Van-Hai Bui

PDF

TL;DR

This paper introduces FairMarket-RL, a multi-agent reinforcement learning framework guided by large language models to promote fairness in peer-to-peer electricity trading, balancing economic efficiency with social equity.

Contribution

It presents a novel LLM-guided fairness shaping approach for P2P energy markets, integrating fairness scores into reinforcement learning to achieve equitable and efficient trading outcomes.

Findings

01

Shifts energy exchanges toward local P2P trades.

02

Reduces consumer costs compared to grid-only procurement.

03

Maintains strong fairness and utility viability across diverse scenarios.

Abstract

Peer-to-peer (P2P) energy trading is becoming central to modern distribution systems as rooftop PV and home energy management systems become pervasive, yet most existing market and reinforcement learning designs emphasize efficiency or private profit and offer little real-time guidance to ensure equitable outcomes under uncertainty. To address this gap, a fairness-aware multiagent reinforcement learning framework, FairMarket-RL, is proposed in which a large language model (LLM) critic shapes bidding policies within a continuous double auction under partial observability and discrete price-quantity actions. After each trading slot, the LLM returns normalized fairness scores Fairness-to-Grid (FTG), Fairness-Between-Sellers (FBS), and Fairness-of-Pricing (FPP) that are integrated into the reward via ramped coefficients and tunable scaling, so that fairness guidance complements, rather than…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.