Robust Learning with Private Information
Kyohei Okumura

TL;DR
This paper investigates how standard learning algorithms can be exploited by strategic, adaptive platform policies to extract private information, and proposes a robust algorithm that prevents such rent extraction while maintaining optimal performance under stationarity.
Contribution
It introduces a misspecification-robust learning algorithm that tests for stationarity and resists strategic manipulation by adaptive principals.
Findings
Standard algorithms can be exploited for full surplus extraction.
The proposed robust algorithm achieves optimal payoff under stationarity.
It guarantees minimum utility for each type against adaptive strategies.
Abstract
Firms increasingly delegate decisions to learning algorithms in platform markets. Standard algorithms perform well when platform policies are stationary, but firms often face ambiguity about whether policies are stationary or adapt strategically to their behavior. When policies adapt, efficient learning under stationarity may backfire: it may reveal a firm's persistent private information, allowing the platform to personalize terms and extract information rents. We study a repeated screening problem in which an agent with a fixed private type commits ex ante to a learning algorithm, facing ambiguity about the principal's policy. We show that a broad class of standard algorithms, including all no-external-regret algorithms, can be manipulated by adaptive principals and permit asymptotic full surplus extraction. We then construct a misspecification-robust learning algorithm that treats…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Platforms and Economics · Auction Theory and Applications · Game Theory and Applications
