Semi-supervised Active Regression

Fnu Devvrit; Nived Rajaraman; Pranjal Awasthi

arXiv:2106.06676·cs.LG·June 15, 2021

Semi-supervised Active Regression

Fnu Devvrit, Nived Rajaraman, Pranjal Awasthi

PDF

Open Access

TL;DR

This paper introduces a semi-supervised active learning framework for linear regression that minimizes label queries by leveraging partially labeled data, achieving near-optimal bounds based on an instance-dependent parameter called reduced rank.

Contribution

The paper formalizes semi-supervised active regression, introduces the reduced rank parameter, and provides an efficient algorithm with optimal query complexity bounds for ridge and kernel ridge regression.

Findings

01

Proposes an algorithm with query complexity O(R_X/ε)

02

Establishes matching lower bounds for active ridge regression

03

Improves bounds for ridge and kernel ridge regression cases

Abstract

Labelled data often comes at a high cost as it may require recruiting human labelers or running costly experiments. At the same time, in many practical scenarios, one already has access to a partially labelled, potentially biased dataset that can help with the learning task at hand. Motivated by such settings, we formally initiate a study of $se mi - s u p er v i se d$ $a c t i v e$ $l e a r nin g$ through the frame of linear regression. In this setting, the learner has access to a dataset $X \in R^{(n_{1} + n_{2}) \times d}$ which is composed of $n_{1}$ unlabelled examples that an algorithm can actively query, and $n_{2}$ examples labelled a-priori. Concretely, denoting the true labels by $Y \in R^{n_{1} + n_{2}}$ , the learner's objective is to find $β \in R^{d}$ such that, \begin{equation} \| X \widehat{\beta} - Y \|_2^2 \le (1 + \epsilon) \min_{\beta \in \mathbb{R}^d} \| X…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Machine Learning and Data Classification · Sparse and Compressive Sensing Techniques