Nonparametric Contextual Online Bilateral Trade
Emanuele Coccia, Martino Bernasconi, Andrea Celli

TL;DR
This paper introduces a nonparametric approach to online bilateral trade with contextual information, achieving near-optimal regret bounds under minimal feedback and budget constraints.
Contribution
It develops a hierarchical tree-based algorithm for nonparametric contextual trade, extending beyond linear models, with proven regret bounds and matching lower bounds.
Findings
Regret bound of O(T^{(d-1)/d}) for the proposed algorithm
Algorithm operates effectively with one-bit feedback and no market subsidies
Matching lower bound established in the full-feedback setting
Abstract
We study the problem of contextual online bilateral trade. At each round, the learner faces a seller-buyer pair and must propose a trade price without observing their private valuations for the item being sold. The goal of the learner is to post prices to facilitate trades between the two parties. Before posting a price, the learner observes a -dimensional context vector that influences the agent's valuations. Prior work in the contextual setting has focused on linear models. In this work, we tackle a general nonparametric setting in which the buyer's and seller's valuations behave according to arbitrary Lipschitz functions of the context. We design an algorithm that leverages contextual information through a hierarchical tree construction and guarantees regret . Remarkably, our algorithm operates under two stringent features of the setting: (1) one-bit…
Peer Reviews
Decision·ICLR 2026 Poster
- This paper extends the contextual online bilateral trade framework from linear valuations to a general nonparametric setting where valuations are arbitrary Lipschitz functions of the context. The main results are achieved under the restrictive setting of one-bit feedback and strong budget balance, a setup where prior work often required relaxing one of these constraints. - A major strength is the tightness of the results. The paper establishes an upper bound on regret of $\tilde{O}(T^{(d-1)/d}
- The assumption that valuations are deterministic functions of the context ($s_t = f_s(x_t)$, $b_t = f_b(x_t)$) is a significant limitation. Real-world valuations are typically subject to noise. The authors correctly note in the conclusion that adding noise is an open question, but the implications of this assumption on the main results could be discussed more prominently. The deterministic nature is precisely what allows certain proof techniques to work and is a key reason why the tight regret
The paper is generally well-written and clear. The collection of theoretical results is non-trivial and technically strong. The authors provide a complete picture for regret in contextual online bilateral trade in a nonparametric setting.
1. It seems that the overhead in the regret bound is pretty large, which might make the proposed algorithm not applicable in practice. 2. The paper would be stronger if the authors can conduct empirical studies to validate the effectiveness of the proposed algorithm for practical instances. 3. This paper could be better suited for other theory and/or econ conferences, such as EC/SODA/STOC/FOCS.
The results claimed are clearly presented and their proofs are succinct yet rigourous. Overall the paper's soundness is commendable from a mathematical standpoint. I would also like to highlight that the authors openly point out a limitation of their main algorithm (dependence on knowledge of the Lipschitz constant of the valuation functions) and provide an adapted algorithm. The presentation is clear and the text is easy to follow, mostly due to the relatively new nature of the online bilatera
The main weakness for me is in the contribution, or perhaps its contextualisation. While technically sound, the results of the paper didn’t bring me any exciting new insight into online learning/estimation. While bilateral trade is an interesting setting, I didn’t find a clear characterisation of what key properties it has, to what extent they are unique or generalise, say to/from online auction problems. I think a broader characterisation of the problem and its properties (or a clear applicatio
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Auction Theory and Applications · Consumer Market Behavior and Pricing
