Online Active Regression
Cheng Chen, Yi Li, Yiming Sun

TL;DR
This paper introduces online algorithms for active regression that efficiently maintain linear models with minimal label queries, suitable for large and streaming datasets, and demonstrates their effectiveness both theoretically and empirically.
Contribution
The paper presents novel online active regression algorithms for $ ext{ell}_p$ loss that require fewer label queries and work efficiently in streaming settings, extending prior offline methods.
Findings
Algorithms achieve $(1+\epsilon)$-approximate solutions with $ ilde{O}(\epsilon^{-1} d \log(n\kappa))$ label queries.
Numerical experiments confirm theoretical bounds and show comparable performance to offline methods.
Methods are effective for large-scale, streaming data with limited label budgets.
Abstract
Active regression considers a linear regression problem where the learner receives a large number of data points but can only observe a small number of labels. Since online algorithms can deal with incremental training data and take advantage of low computational cost, we consider an online extension of the active regression problem: the learner receives data points one by one and immediately decides whether it should collect the corresponding labels. The goal is to efficiently maintain the regression of received data points with a small budget of label queries. We propose novel algorithms for this problem under loss where . To achieve a -approximate solution, our proposed algorithms only require queries of labels, where is the number of data points and is a quantity, called the condition…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Advanced Bandit Algorithms Research · Optimization and Search Problems
MethodsLinear Regression
