An Information-Theoretic Intersectional Data Valuation Theory
Eduardo C. Garrido-Merch\'an

TL;DR
This paper introduces an intersectional data valuation framework using information theory to quantify privacy externalities, enabling fairer data trading and empowering vulnerable groups.
Contribution
It proposes a formal, model-independent pricing rule based on mutual information to internalize intersectional privacy losses in digital markets.
Findings
A mutual information-based surcharge discourages harmful data trades.
The method operates independently of the statistical model of intersectional variables.
Regulators can calibrate the surcharge to reflect societal values.
Abstract
In contemporary digital markets, personal data often reveals not just isolated traits, but complex, intersectional identities based on combinations of race, gender, disability, and other protected characteristics. This exposure generates a privacy externality: firms benefit economically from profiling, prediction, and personalization, while users face hidden costs in the form of social risk and discrimination. We introduce a formal pricing rule that quantifies and internalizes this intersectional privacy loss using mutual information, assigning monetary value to the entropy reduction induced by each datum. The result is a Pigouvian-style surcharge that discourages harmful data trades and rewards transparency. Our formulation has the advantage that it operates independently of the underlying statistical model of the intersectional variables, be it parametric, nonparametric, or learned,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
