Fast Cross-Validation for Incremental Learning
Pooria Joulani, Andr\'as Gy\"orgy, Csaba Szepesv\'ari

TL;DR
This paper introduces a general, efficient method for cross-validation in incremental learning that significantly reduces computational costs and scales logarithmically with the number of folds, applicable to various tasks.
Contribution
It presents a novel, broadly applicable approach to perform cross-validation efficiently for incremental learning algorithms, unlike previous methods limited to specific models or domains.
Findings
Reduces CV computation time from linear to logarithmic scale.
Applicable to a wide range of supervised and unsupervised tasks.
Demonstrated effectiveness on state-of-the-art incremental algorithms.
Abstract
Cross-validation (CV) is one of the main tools for performance estimation and parameter tuning in machine learning. The general recipe for computing CV estimate is to run a learning algorithm separately for each CV fold, a computationally expensive process. In this paper, we propose a new approach to reduce the computational burden of CV-based performance estimation. As opposed to all previous attempts, which are specific to a particular learning model or problem domain, we propose a general method applicable to a large class of incremental learning algorithms, which are uniquely fitted to big data problems. In particular, our method applies to a wide range of supervised and unsupervised learning tasks with different performance criteria, as long as the base learning algorithm is incremental. We show that the running time of the algorithm scales logarithmically, rather than linearly, in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Face and Expression Recognition · Text and Document Classification Technologies
