From Confounding to Learning: Dynamic Service Fee Pricing on Third-Party Platforms

Rui Ai; David Simchi-Levi; Feng Zhu

arXiv:2512.22749·cs.LG·December 30, 2025

From Confounding to Learning: Dynamic Service Fee Pricing on Third-Party Platforms

Rui Ai, David Simchi-Levi, Feng Zhu

PDF

Open Access

TL;DR

This paper develops a novel demand learning algorithm for third-party platform pricing, addressing confounding issues and supply noise, with theoretical guarantees and real-world validation.

Contribution

It introduces an optimal regret algorithm for demand learning under confounding, utilizing instrumental variables and neural networks, with practical applications demonstrated.

Findings

01

Optimal regret of ilde{ ext{O}}(\sqrt{T} ext{ or } \sigma_S^{-2})

02

Supply-side noise causes a phase transition in demand learnability

03

First efficiency guarantee for neural network-based demand estimation

Abstract

We study the pricing behavior of third-party platforms facing strategic agents. Assuming the platform is a revenue maximizer, it observes market features that generally affect demand. Since only the equilibrium price and quantity are observable, this presents a general demand learning problem under confounding. Mathematically, we develop an algorithm with optimal regret of $\Tilde \cO (T \land σ_{S}^{- 2})$ . Our results reveal that supply-side noise fundamentally affects the learnability of demand, leading to a phase transition in regret. Technically, we show that non-i.i.d. actions can serve as instrumental variables for learning demand. We also propose a novel homeomorphic construction that allows us to establish estimation bounds without assuming star-shapedness, providing the first efficiency guarantee for learning demand with deep neural networks. Finally, we demonstrate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Game Theory and Applications · Age of Information Optimization