Low-Cost Learning via Active Data Procurement

Jacob Abernethy; Yiling Chen; Chien-Ju Ho; Bo Waggoner

arXiv:1502.05774·cs.GT·June 9, 2015

Low-Cost Learning via Active Data Procurement

Jacob Abernethy, Yiling Chen, Chien-Ju Ho, Bo Waggoner

PDF

TL;DR

This paper introduces mechanisms for actively procuring data from strategic agents under budget constraints, providing guarantees on predictive error that improve with increased budget and leveraging data-cost correlations.

Contribution

It develops a framework converting no-regret algorithms into active data procurement mechanisms with robust risk bounds under strategic behavior and budget limits.

Findings

01

Achieves risk bounds of order 1/√B with budget B.

02

Provides regret bounds of order T/√B for the procurement process.

03

Demonstrates lower bounds matching the regret bounds.

Abstract

We design mechanisms for online procurement of data held by strategic agents for machine learning tasks. The challenge is to use past data to actively price future data and give learning guarantees even when an agent's cost for revealing her data may depend arbitrarily on the data itself. We achieve this goal by showing how to convert a large class of no-regret algorithms into online posted-price and learning mechanisms. Our results in a sense parallel classic sample complexity guarantees, but with the key resource being money rather than quantity of data: With a budget constraint $B$ , we give robust risk (predictive error) bounds on the order of $1/ B$ . Because we use an active approach, we can often guarantee to do significantly better by leveraging correlations between costs and data. Our algorithms and analysis go through a model of no-regret learning with $T$ arriving pairs…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.