Large deviations for the perceptron model and consequences for active   learning

Hugo Cui; Luca Saglietti; Lenka Zdeborov\'a

arXiv:1912.03927·cond-mat.dis-nn·September 4, 2020

Large deviations for the perceptron model and consequences for active learning

Hugo Cui, Luca Saglietti, Lenka Zdeborov\'a

PDF

TL;DR

This paper analyzes the theoretical limits of active learning performance using large deviation theory and replica methods, demonstrating that simple algorithms can approach optimal accuracy boundaries.

Contribution

It introduces a novel theoretical framework for understanding active learning limits via large deviations and replica analysis, and shows simple algorithms can nearly reach these bounds.

Findings

01

Optimal performance boundaries are derived for active learning.

02

Simple message-passing algorithms can approach these optimal bounds.

03

Comparison shows certain strategies are close to theoretical limits.

Abstract

Active learning is a branch of machine learning that deals with problems where unlabeled data is abundant yet obtaining labels is expensive. The learning algorithm has the possibility of querying a limited number of samples to obtain the corresponding labels, subsequently used for supervised learning. In this work, we consider the task of choosing the subset of samples to be labeled from a fixed finite pool of samples. We assume the pool of samples to be a random matrix and the ground truth labels to be generated by a single-layer teacher random neural network. We employ replica methods to analyze the large deviations for the accuracy achieved after supervised learning on a subset of the original pool. These large deviations then provide optimal achievable performance boundaries for any active learning algorithm. We show that the optimal learning performance can be efficiently…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.