Online Market Making and the Value of Observing the Order Book
Davide Maran, Marcello Restelli

TL;DR
This paper investigates online market-making with action-dependent feedback, proposing algorithms that leverage order book observations to achieve improved regret bounds in stochastic and adversarial settings.
Contribution
It introduces new algorithms for online market making that utilize limited order book feedback, significantly enhancing regret guarantees over traditional bandit models.
Findings
Achieves $O(\sqrt T)$ regret in stochastic settings without smoothness assumptions.
Extends results to mean-reverting price processes with high-probability bounds.
Designs an explore-then-perturb algorithm with $O(T^{2/3})$ regret in adversarial settings.
Abstract
We study an online market-making problem in which a learner sequentially posts bid and ask prices for a single asset while interacting with traders holding private valuations. Unlike existing online learning formulations that assume fully censored feedback, we introduce an action-dependent feedback model inspired by real limit order books: when a trade occurs, the trader's valuation remains hidden, whereas when no trade occurs, informative feedback about supply and demand is revealed. We show that this additional information fundamentally changes the learnability of the problem. In the stochastic setting with i.i.d. market prices, we propose an elimination-based algorithm that achieves regret with high probability, without requiring any smoothness assumptions on the distribution of trader valuations. We then extend this result to a broad class of mean-reverting price…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
