The Power of Comparisons for Actively Learning Linear Classifiers
Max Hopkins, Daniel M. Kane, Shachar Lovett

TL;DR
This paper demonstrates that comparison queries significantly enhance active learning efficiency for linear classifiers under certain assumptions, reducing sample complexity exponentially compared to traditional methods.
Contribution
It introduces the use of comparison queries in active learning, showing they enable exponential improvements in learning linear classifiers with fewer samples.
Findings
Comparison queries reduce sample complexity exponentially.
Active learning with comparison queries outperforms traditional label-only methods.
Results hold for both PAC and RPU learning models.
Abstract
In the world of big data, large but costly to label datasets dominate many fields. Active learning, a semi-supervised alternative to the standard PAC-learning model, was introduced to explore whether adaptive labeling could learn concepts with exponentially fewer labeled samples. While previous results show that active learning performs no better than its supervised alternative for important concept classes such as linear separators, we show that by adding weak distributional assumptions and allowing comparison queries, active learning requires exponentially fewer samples. Further, we show that these results hold as well for a stronger model of learning called Reliable and Probably Useful (RPU) learning. In this model, our learner is not allowed to make mistakes, but may instead answer "I don't know." While previous negative results showed this model to have intractably large sample…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsMachine Learning and Algorithms · Algorithms and Data Compression · Machine Learning and Data Classification
