Multi-Agent Reinforcement Learning for Dynamic Pricing: Balancing Profitability,Stability and Fairness

Krishna Kumar Neelakanta Pillai Santha Kumari Amma

arXiv:2603.16888·cs.LG·March 19, 2026

Multi-Agent Reinforcement Learning for Dynamic Pricing: Balancing Profitability,Stability and Fairness

Krishna Kumar Neelakanta Pillai Santha Kumari Amma

PDF

Open Access

TL;DR

This paper systematically evaluates multi-agent reinforcement learning algorithms, MAPPO and MADDPG, for dynamic retail pricing, demonstrating MAPPO's superior stability and profitability in competitive simulated markets.

Contribution

It provides the first comprehensive empirical comparison of MARL algorithms for dynamic pricing, highlighting MAPPO's advantages over existing independent learning baselines.

Findings

01

MAPPO achieves highest average returns with low variance.

02

MADDPG offers the fairest profit distribution among agents.

03

MARL methods are scalable and stable for dynamic retail pricing.

Abstract

Dynamic pricing in competitive retail markets requires strategies that adapt to fluctuating demand and competitor behavior. In this work, we present a systematic empirical evaluation of multi-agent reinforcement learning (MARL) approaches-specifically MAPPO and MADDPG-for dynamic price optimization under competition. Using a simulated marketplace environment derived from real-world retail data, we benchmark these algorithms against an Independent DDPG (IDDPG) baseline, a widely used independent learner in MARL literature. We evaluate profit performance, stability across random seeds, fairness, and training efficiency. Our results show that MAPPO consistently achieves the highest average returns with low variance, offering a stable and reproducible approach for competitive price optimization, while MADDPG achieves slightly lower profit but the fairest profit distribution among agents.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsConsumer Market Behavior and Pricing · Auction Theory and Applications · Supply Chain and Inventory Management