An Instrumental Value for Data Production and its Application to Data Pricing
Rui Ai, Boxiang Lyu, Zhaoran Wang, Zhuoran Yang, Haifeng Xu

TL;DR
This paper introduces a formal approach to quantify the instrumental value of data production processes based on decision-making context and prior information, and applies it to data pricing and revenue maximization.
Contribution
It develops a valuation framework for data's instrumental value, connecting it to information theory, and designs mechanisms for near-optimal data pricing in monopoly settings.
Findings
Seller can achieve full surplus with perfect data customization.
Limited data pools restrict revenue extraction, but mechanisms can approach optimal revenue.
In multi-armed bandits, the seller can attain first-best revenue.
Abstract
How much value does a dataset or a data production process have to an agent who wishes to use the data to assist decision-making? This is a fundamental question towards understanding the value of data as well as further pricing of data. This paper develops an approach for capturing the instrumental value of data production processes, which takes two key factors into account: (a) the context of the agent's decision-making problem; (b) prior data or information the agent already possesses. We ''micro-found'' our valuation concepts by showing how they connect to classic notions of information design and signals in information economics. When instantiated in the domain of Bayesian linear regression, our value naturally corresponds to information gain. Based on our designed data value, we then study a basic monopoly pricing setting with a buyer looking to purchase from a seller some labeled…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsConsumer Market Behavior and Pricing
