Checkpoint Ensembles: Ensemble Methods from a Single Training Process
Hugh Chen, Scott Lundberg, Su-In Lee

TL;DR
Checkpoint ensembles enable the creation of ensemble models from a single training run, improving neural network performance by averaging multiple checkpoints, thus reducing overfitting and capturing benefits of traditional ensemble methods.
Contribution
This paper introduces checkpoint ensembles, a novel technique that combines model selection and ensembling within a single training process for neural networks.
Findings
Checkpoint ensembles outperform model selection by minimum validation score.
They achieve performance gains similar to traditional ensemble methods.
Effective across text, image, and health record datasets.
Abstract
We present the checkpoint ensembles method that can learn ensemble models on a single training process. Although checkpoint ensembles can be applied to any parametric iterative learning technique, here we focus on neural networks. Neural networks' composable and simple neurons make it possible to capture many individual and interaction effects among features. However, small sample sizes and sampling noise may result in patterns in the training data that are not representative of the true relationship between the features and the outcome. As a solution, regularization during training is often used (e.g. dropout). However, regularization is no panacea -- it does not perfectly address overfitting. Even with methods like dropout, two methodologies are commonly used in practice. First is to utilize a validation set independent to the training set as a way to decide when to stop training.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Explainable Artificial Intelligence (XAI) · Machine Learning and Data Classification
