No-Regret Learning from Partially Observed Data in Repeated Auctions
Orcun Karaca, Pier Giuseppe Sessa, Anna Leidi, Maryam Kamgarpour

TL;DR
This paper develops and analyzes modified no-regret algorithms for repeated auctions with partial feedback, improving convergence guarantees and applicability in electricity markets.
Contribution
It introduces a novel approach to leverage partial utility observations in auction settings, enhancing regret bounds over traditional bandit algorithms.
Findings
Modified algorithms exploit additional feedback for better utility estimation.
Improved regret guarantees compared to standard bandit algorithms.
Validated on realistic electricity market models.
Abstract
We study a general class of repeated auctions, such as the ones found in electricity markets, as multi-agent games between the bidders. In such a repeated setting, bidders can adapt their strategies online based on the data observed in the previous auction rounds. Moreover, if no-regret algorithms are employed by the bidders to update their strategies, the game is known to converge to a coarse-correlated equilibrium, which generalizes the notion of Nash equilibrium to a probabilistic view of the auction state. Well-studied no-regret algorithms depend on the feedback information available at every round, and can be mainly distinguished as bandit (or payoff-based), and full-information. However, the information structure found in auctions lies in between these two models, since participants can often obtain partial observations of their utilities under different strategies. To this end,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
