Incentivizing Truthful Data Contributions in a Marketplace for Mean Estimation
Keran Chen, Alex Clinton, Kirthevasan Kandasamy

TL;DR
This paper designs a mechanism for a data marketplace that incentivizes truthful data contribution and reporting to estimate a mean, balancing welfare, profit, and incentive constraints.
Contribution
It introduces a mechanism that achieves equilibrium with truthful reporting and data collection, despite the absence of dominant strategies.
Findings
The mechanism achieves a Nash equilibrium with the two lowest-cost contributors collecting all data.
No nontrivial dominant-strategy incentive-compatible mechanism exists for this problem.
The proposed mechanism is optimal in Nash equilibrium compared to any other.
Abstract
We study a data marketplace where a broker intermediates between buyers, who seek to estimate the mean \(\mu\) of an unknown normal distribution \(\Ncal(\mu, \sigma^2)\), and contributors, who can collect data from this distribution at a cost. The broker delegates data collection work to contributors, aggregates reported datasets, sells it to buyers, and redistributes revenue as payments to contributors. We aim to maximize welfare or profit under key constraints: individual rationality for buyers and contributors, incentive compatibility (contributors are incentivized to comply with data collection instructions and truthfully report the collected data), and budget balance (total contributor payments equals total revenue). We first compute welfare/profit-optimal prices under truthful reporting; however, to incentivize data collection and truthful data reporting, we adjust them based on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
