On the Learning Curves of Revenue Maximization
Steve Hanneke, Alkis Kalavasis, Shay Moran, Grigoris Velegkas

TL;DR
This paper studies the rate at which revenue-maximizing algorithms improve with more data, revealing that convergence can be arbitrarily slow or fast depending on distribution properties.
Contribution
It provides a near-complete characterization of learning curve decay rates in revenue maximization for a single item and buyer, highlighting differences from PAC learning.
Findings
Existence of a Bayes-consistent algorithm with arbitrarily slow convergence.
When optimal revenue is achieved by a finite price, decay rate is roughly 1/√n.
For discrete distributions, learning curves decay almost exponentially fast.
Abstract
Learning curves are a fundamental primitive in supervised learning, describing how an algorithm's performance improves with more data and providing a quantitative measure of its generalization ability. Formally, a learning curve plots the decay of an algorithm's error for a fixed underlying distribution as a function of the number of training samples. Prior work on revenue-maximizing learning algorithms, starting with the seminal work of Cole and Roughgarden [STOC, 2014], adopts a distribution-free perspective, which parallels the PAC learning framework in learning theory. This approach evaluates performance against the hardest possible sequence of valuation distributions, one for each sample size, effectively defining the upper envelope of learning curves over all possible distributions, thus leading to error bounds that do not capture the shape of the learning curves. In this work…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
