Convergence of Stochastic First-Order Algorithms in Bertrand Competition Under Incomplete Information
Martin Bichler, Jan-Sebastian Hoehener

TL;DR
This paper proves that stochastic first-order algorithms converge to equilibrium in Bayesian Bertrand competition, providing rigorous guarantees and challenging claims of collusion in algorithmic pricing.
Contribution
It establishes convergence of Regularized Robbins-Monro algorithms to Bayes-Nash equilibrium in incomplete information settings, with explicit stability analysis.
Findings
RRM algorithms converge almost surely to the unique equilibrium
A global Lyapunov function is constructed for the dynamics
Provides rigorous convergence guarantees in Bayesian pricing games
Abstract
Autonomous pricing agents are widely deployed in online marketplaces, making algorithmic pricing a prominent application of multi-agent learning. Experimental studies often report collusive outcomes, but these findings typically rely on Q-learning in complete-information environments and lack rigorous convergence guarantees. In this paper, we study the stochastic learning dynamics of Regularized Robbins-Monro (RRM) algorithms in a Bayesian Bertrand competition with private costs. We show that this setting violates standard stability conditions, including monotonicity and the Minty variational inequality, rendering classical convergence results for gradient-based learning inapplicable. Despite this, we prove that Euclidean RRM algorithms converge almost surely to the unique, efficient Bayes-Nash equilibrium within a finite-dimensional approximation of the strategy space. By analyzing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
