Data Measurements for Decentralized Data Markets
Charles Lu, Mohammad Mohammadi Amiri, Ramesh Raskar

TL;DR
This paper introduces federated data measurement techniques for decentralized data markets, enabling buyers to efficiently identify relevant and diverse datasets from sellers without intermediaries or specialized models.
Contribution
It proposes and benchmarks novel federated data measurement methods that facilitate seller selection based on diversity and relevance in decentralized markets.
Findings
Federated measurements effectively identify relevant datasets
Diversity measures improve dataset selection quality
Benchmark results demonstrate practical viability
Abstract
Decentralized data markets can provide more equitable forms of data acquisition for machine learning. However, to realize practical marketplaces, efficient techniques for seller selection need to be developed. We propose and benchmark federated data measurements to allow a data buyer to find sellers with relevant and diverse datasets. Diversity and relevance measures enable a buyer to make relative comparisons between sellers without requiring intermediate brokers and training task-dependent models.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Platforms and Economics · Auction Theory and Applications · Complex Systems and Time Series Analysis
