Nonparametric Contextual Online Bilateral Trade

Emanuele Coccia; Martino Bernasconi; Andrea Celli

arXiv:2602.12904·cs.GT·February 16, 2026

Nonparametric Contextual Online Bilateral Trade

Emanuele Coccia, Martino Bernasconi, Andrea Celli

PDF

Open Access 3 Reviews

TL;DR

This paper introduces a nonparametric approach to online bilateral trade with contextual information, achieving near-optimal regret bounds under minimal feedback and budget constraints.

Contribution

It develops a hierarchical tree-based algorithm for nonparametric contextual trade, extending beyond linear models, with proven regret bounds and matching lower bounds.

Findings

01

Regret bound of O(T^{(d-1)/d}) for the proposed algorithm

02

Algorithm operates effectively with one-bit feedback and no market subsidies

03

Matching lower bound established in the full-feedback setting

Abstract

We study the problem of contextual online bilateral trade. At each round, the learner faces a seller-buyer pair and must propose a trade price without observing their private valuations for the item being sold. The goal of the learner is to post prices to facilitate trades between the two parties. Before posting a price, the learner observes a $d$ -dimensional context vector that influences the agent's valuations. Prior work in the contextual setting has focused on linear models. In this work, we tackle a general nonparametric setting in which the buyer's and seller's valuations behave according to arbitrary Lipschitz functions of the context. We design an algorithm that leverages contextual information through a hierarchical tree construction and guarantees regret $O (T^{(d - 1) / d})$ . Remarkably, our algorithm operates under two stringent features of the setting: (1) one-bit…

Peer Reviews

Decision·ICLR 2026 Poster

Reviewer 01Rating 6Confidence 3

Strengths

- This paper extends the contextual online bilateral trade framework from linear valuations to a general nonparametric setting where valuations are arbitrary Lipschitz functions of the context. The main results are achieved under the restrictive setting of one-bit feedback and strong budget balance, a setup where prior work often required relaxing one of these constraints. - A major strength is the tightness of the results. The paper establishes an upper bound on regret of $\tilde{O}(T^{(d-1)/d}

Weaknesses

- The assumption that valuations are deterministic functions of the context ($s_t = f_s(x_t)$, $b_t = f_b(x_t)$) is a significant limitation. Real-world valuations are typically subject to noise. The authors correctly note in the conclusion that adding noise is an open question, but the implications of this assumption on the main results could be discussed more prominently. The deterministic nature is precisely what allows certain proof techniques to work and is a key reason why the tight regret

Reviewer 02Rating 6Confidence 3

Strengths

The paper is generally well-written and clear. The collection of theoretical results is non-trivial and technically strong. The authors provide a complete picture for regret in contextual online bilateral trade in a nonparametric setting.

Weaknesses

1. It seems that the overhead in the regret bound is pretty large, which might make the proposed algorithm not applicable in practice. 2. The paper would be stronger if the authors can conduct empirical studies to validate the effectiveness of the proposed algorithm for practical instances. 3. This paper could be better suited for other theory and/or econ conferences, such as EC/SODA/STOC/FOCS.

Reviewer 03Rating 4Confidence 3

Strengths

The results claimed are clearly presented and their proofs are succinct yet rigourous. Overall the paper's soundness is commendable from a mathematical standpoint. I would also like to highlight that the authors openly point out a limitation of their main algorithm (dependence on knowledge of the Lipschitz constant of the valuation functions) and provide an adapted algorithm. The presentation is clear and the text is easy to follow, mostly due to the relatively new nature of the online bilatera

Weaknesses

The main weakness for me is in the contribution, or perhaps its contextualisation. While technically sound, the results of the paper didn’t bring me any exciting new insight into online learning/estimation. While bilateral trade is an interesting setting, I didn’t find a clear characterisation of what key properties it has, to what extent they are unique or generalise, say to/from online auction problems. I think a broader characterisation of the problem and its properties (or a clear applicatio

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Auction Theory and Applications · Consumer Market Behavior and Pricing