Feedback Coding for Active Learning
Gregory Canal, Matthieu Bloch, Christopher Rozell

TL;DR
This paper introduces a novel feedback coding approach, Approximate Posterior Matching (APM), for active learning, leveraging information theory concepts to improve efficiency and reduce computational costs in selecting data for labeling.
Contribution
The paper formulates active learning as a feedback coding problem, developing the APM algorithm that applies optimal transport theory to enhance data selection efficiency.
Findings
APM achieves comparable learning performance to existing methods.
APM reduces computational costs in active learning tasks.
The approach demonstrates the potential of information theory in active learning design.
Abstract
The iterative selection of examples for labeling in active machine learning is conceptually similar to feedback channel coding in information theory: in both tasks, the objective is to seek a minimal sequence of actions to encode information in the presence of noise. While this high-level overlap has been previously noted, there remain open questions on how to best formulate active learning as a communications system to leverage existing analysis and algorithms in feedback coding. In this work, we formally identify and leverage the structural commonalities between the two problems, including the characterization of encoder and noisy channel components, to design a new algorithm. Specifically, we develop an optimal transport-based feedback coding scheme called Approximate Posterior Matching (APM) for the task of active example selection and explore its application to Bayesian logistic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Algorithms and Data Compression · Machine Learning and Data Classification
