Monotone Learning
Olivier Bousquet, Amit Daniely, Haim Kaplan, Yishay Mansour, and Shay Moran, Uri Stemmer

TL;DR
This paper proves that any multiclass classification algorithm can be transformed into a monotone one with similar performance, extending previous binary results and ensuring monotonicity without performance loss.
Contribution
It introduces an efficient black-box transformation that makes any learning algorithm monotone in multiclass classification, answering longstanding open questions.
Findings
Every learning algorithm can be transformed into a monotone one with similar performance.
The transformation is efficient and uses only black-box access to the original algorithm.
Monotone learners are achievable in distribution-free settings like PAC learning.
Abstract
The amount of training-data is one of the key factors which determines the generalization capacity of learning algorithms. Intuitively, one expects the error rate to decrease as the amount of training-data increases. Perhaps surprisingly, natural attempts to formalize this intuition give rise to interesting and challenging mathematical questions. For example, in their classical book on pattern recognition, Devroye, Gyorfi, and Lugosi (1996) ask whether there exists a {monotone} Bayes-consistent algorithm. This question remained open for over 25 years, until recently Pestov (2021) resolved it for binary classification, using an intricate construction of a monotone Bayes-consistent algorithm. We derive a general result in multiclass classification, showing that every learning algorithm A can be transformed to a monotone one with similar performance. Further, the transformation is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Machine Learning and Data Classification · Domain Adaptation and Few-Shot Learning
