Checkpoint Ensembles: Ensemble Methods from a Single Training Process

Hugh Chen; Scott Lundberg; Su-In Lee

arXiv:1710.03282·cs.LG·October 11, 2017·35 cites

Checkpoint Ensembles: Ensemble Methods from a Single Training Process

Hugh Chen, Scott Lundberg, Su-In Lee

PDF

Open Access 1 Repo

TL;DR

Checkpoint ensembles enable the creation of ensemble models from a single training run, improving neural network performance by averaging multiple checkpoints, thus reducing overfitting and capturing benefits of traditional ensemble methods.

Contribution

This paper introduces checkpoint ensembles, a novel technique that combines model selection and ensembling within a single training process for neural networks.

Findings

01

Checkpoint ensembles outperform model selection by minimum validation score.

02

They achieve performance gains similar to traditional ensemble methods.

03

Effective across text, image, and health record datasets.

Abstract

We present the checkpoint ensembles method that can learn ensemble models on a single training process. Although checkpoint ensembles can be applied to any parametric iterative learning technique, here we focus on neural networks. Neural networks' composable and simple neurons make it possible to capture many individual and interaction effects among features. However, small sample sizes and sampling noise may result in patterns in the training data that are not representative of the true relationship between the features and the outcome. As a solution, regularization during training is often used (e.g. dropout). However, regularization is no panacea -- it does not perfectly address overfitting. Even with methods like dropout, two methodologies are commonly used in practice. First is to utilize a validation set independent to the training set as a way to decide when to stop training.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ENSTA-U2IS-AI/torch-uncertainty
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Healthcare · Explainable Artificial Intelligence (XAI) · Machine Learning and Data Classification