Strategy-robust Online Learning in Contextual Pricing

Joon Suk Huh; Kirthevasan Kandasamy

arXiv:2511.19842·cs.GT·February 18, 2026

Strategy-robust Online Learning in Contextual Pricing

Joon Suk Huh, Kirthevasan Kandasamy

PDF

Open Access

TL;DR

This paper develops a strategy-robust online learning framework for contextual pricing, addressing strategic buyer behavior and introducing a polynomial-time approximation scheme and a novel Sparse Update Mechanism to ensure robustness.

Contribution

It introduces a strategy-robust notion of regret, a PTAS for linear pricing policies, and the Sparse Update Mechanism that guarantees robustness against strategic buyers.

Findings

01

The PTAS efficiently learns pricing policies in adversarial environments.

02

The Sparse Update Mechanism ensures robustness to all Nash equilibria.

03

A black-box reduction from online experts to strategy-robust learners is provided.

Abstract

Learning effective pricing strategies is crucial in digital marketplaces, especially when buyers' valuations are unknown and must be inferred through interaction. We study the online contextual pricing problem, where a seller observes a stream of context-valuation pairs and dynamically sets prices. Moreover, departing from traditional online learning frameworks, we consider a strategic setting in which buyers may misreport valuations to influence future prices, a challenge known as strategic overfitting (Amin et al. 2013). We introduce a strategy-robust notion of regret for multi-buyer online environments, capturing worst-case strategic behavior in the spirit of the Price of Anarchy. Our first contribution is a polynomial-time approximation scheme (PTAS) for learning linear pricing policies in adversarial, adaptive environments, enabled by a novel online sketching technique. Building…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Game Theory and Applications · Age of Information Optimization