Reciprocal Learning
Julian Rodemann, Christoph Jansen, Georg Schollmeyer

TL;DR
This paper unifies various machine learning algorithms under the reciprocal learning paradigm, showing they iteratively update data based on model fit, and analyzes their convergence properties using decision theory.
Contribution
It introduces reciprocal learning as a general framework, providing convergence analysis and conditions for a broad class of algorithms like active learning and self-training.
Findings
Reciprocal learning algorithms converge linearly under mild conditions.
Probabilistic predictions and non-greedy, regularized sample updates ensure convergence.
The framework unifies and generalizes multiple existing learning algorithms.
Abstract
We demonstrate that a wide array of machine learning algorithms are specific instances of one single paradigm: reciprocal learning. These instances range from active learning over multi-armed bandits to self-training. We show that all these algorithms do not only learn parameters from data but also vice versa: They iteratively alter training data in a way that depends on the current model fit. We introduce reciprocal learning as a generalization of these algorithms using the language of decision theory. This allows us to study under what conditions they converge. The key is to guarantee that reciprocal learning contracts such that the Banach fixed-point theorem applies. In this way, we find that reciprocal learning algorithms converge at linear rates to an approximately optimal model under relatively mild assumptions on the loss function, if their predictions are probabilistic and the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsMachine Learning and Algorithms · Advanced Bandit Algorithms Research · Gaussian Processes and Bayesian Inference
