Delegating Data Collection in Decentralized Machine Learning

Nivasini Ananthakrishnan; Stephen Bates; Michael I. Jordan; and Nika; Haghtalab

arXiv:2309.01837·cs.LG·November 21, 2024

Delegating Data Collection in Decentralized Machine Learning

Nivasini Ananthakrishnan, Stephen Bates, Michael I. Jordan, and Nika, Haghtalab

PDF

Open Access

TL;DR

This paper develops contract-based mechanisms for data collection in decentralized machine learning, addressing information asymmetries and providing adaptive solutions that improve utility.

Contribution

It introduces optimal and near-optimal contract designs for decentralized ML data collection, including adaptive convex programming methods and analysis of multiple interaction settings.

Findings

01

Linear contracts achieve 1-1/e of optimal utility.

02

Convex programming efficiently computes adaptive optimal contracts.

03

Analysis extends to multiple interaction scenarios.

Abstract

Motivated by the emergence of decentralized machine learning (ML) ecosystems, we study the delegation of data collection. Taking the field of contract theory as our starting point, we design optimal and near-optimal contracts that deal with two fundamental information asymmetries that arise in decentralized ML: uncertainty in the assessment of model quality and uncertainty regarding the optimal performance of any model. We show that a principal can cope with such asymmetry via simple linear contracts that achieve 1-1/e fraction of the optimal utility. To address the lack of a priori knowledge regarding the optimal performance, we give a convex program that can adaptively and efficiently compute the optimal contract. We also study linear contracts and derive the optimal utility in the more complex setting of multiple interactions.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Distributed Sensor Networks and Detection Algorithms · Age of Information Optimization