Safe Decentralized Operation of EV Virtual Power Plant with Limited Network Visibility via Multi-Agent Reinforcement Learning
Chenghao Huang, Jiarong Fan, Weiqing Wang, Hao Wang

TL;DR
This paper introduces a multi-agent reinforcement learning framework for EV virtual power plants that ensures voltage security and economic operation under limited network visibility, using transformer-assisted policies.
Contribution
It develops TL-MAPPO, a decentralized control method with transformer-based embeddings and centralized training, improving voltage stability and cost efficiency in VPPs.
Findings
Reduces voltage violations by ~45% in simulations.
Lowers operational costs by ~10% compared to baselines.
Demonstrates effectiveness under partial network visibility constraints.
Abstract
As power systems advance toward net-zero targets, behind-the-meter renewables are driving rapid growth in distributed energy resources (DERs). Virtual power plants (VPPs) increasingly coordinate these resources to support power distribution network (PDN) operation, with EV charging stations (EVCSs) emerging as a key asset due to their strong impact on local voltages. However, in practice, VPPs must make operational decisions with only partial visibility of PDN states, relying on limited, aggregated information shared by the distribution system operator. This work proposes a safety-enhanced VPP framework for coordinating multiple EVCSs under such realistic information constraints to ensure voltage security while maintaining economic operation. We develop Transformer-assisted Lagrangian Multi-Agent Proximal Policy Optimization (TL-MAPPO), in which EVCS agents learn decentralized charging…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
