The Cost of Balanced Training-Data Production in an Online Data Market
Augustin Chaintreau, Roland Maio, Juba Ziani

TL;DR
This paper analyzes the economic implications of implementing ethical fairness interventions in online data markets, revealing that market size and conditions significantly influence the cost and feasibility of such ethical measures.
Contribution
It introduces a stylized model to study the impact of fairness interventions on data market dynamics and costs, highlighting the importance of market size and conditions.
Findings
In small markets, fairness interventions can increase costs and push producers out.
In large markets, the cost of fairness diminishes as the market grows.
Market conditions critically determine the economic feasibility of ethical data production.
Abstract
Many ethical issues in machine learning are connected to the training data. Online data markets are an important source of training data, facilitating both production and distribution. Recently, a trend has emerged of for-profit "ethical" participants in online data markets. This trend raises a fascinating question: Can online data markets sustainably and efficiently address ethical issues in the broader machine-learning economy? In this work, we study this question in a stylized model of an online data market. We investigate the effects of intervening in the data market to achieve balanced training-data production. The model reveals the crucial role of market conditions. In small and emerging markets, an intervention can drive the data producers out of the market, so that the cost of fairness is maximal. Yet, in large and established markets, the cost of fairness can vanish (as a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScheduling and Timetabling Solutions · AI and HR Technologies · Education Methods and Technologies
