Approximate Nash Equilibrium Learning for n-Player Markov Games in Dynamic Pricing
Larkin Liu

TL;DR
This paper introduces a model-free, neural network-based method for learning approximate Nash equilibria in multi-agent Markov games, especially applied to dynamic pricing where exact solutions are computationally infeasible.
Contribution
A novel gradient-free optimization approach combined with neural networks to estimate approximate Nash equilibria in complex, multi-agent Markov games.
Findings
Successfully learned approximate Nash equilibria in dynamic pricing scenarios.
Demonstrated the method's effectiveness in high-dimensional, multi-agent environments.
Provided a scalable approach to equilibrium computation where exact solutions are intractable.
Abstract
We investigate Nash equilibrium learning in a competitive Markov Game (MG) environment, where multiple agents compete, and multiple Nash equilibria can exist. In particular, for an oligopolistic dynamic pricing environment, exact Nash equilibria are difficult to obtain due to the curse-of-dimensionality. We develop a new model-free method to find approximate Nash equilibria. Gradient-free black box optimization is then applied to estimate , the maximum reward advantage of an agent unilaterally deviating from any joint policy, and to also estimate the -minimizing policy for any given state. The policy- correspondence and the state to -minimizing policy are represented by neural networks, the latter being the Nash Policy Net. During batch update, we perform Nash Q learning on the system, by adjusting the action probabilities using the Nash Policy…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGame Theory and Applications · Economic theories and models · Auction Theory and Applications
