Boost Neural Networks by Checkpoints

Feng Wang; Guoyizhe Wei; Qiao Liu; Jinxiang Ou; Xian Wei; Hairong Lv

arXiv:2110.00959·cs.LG·October 27, 2021

Boost Neural Networks by Checkpoints

Feng Wang, Guoyizhe Wei, Qiao Liu, Jinxiang Ou, Xian Wei, Hairong Lv

PDF

Open Access 1 Video

TL;DR

This paper introduces a boosting-based checkpoint ensemble method for deep neural networks that accelerates convergence, enhances diversity, and improves accuracy without increasing training costs, outperforming existing ensemble techniques.

Contribution

A novel boosting scheme for checkpoint ensembling that theoretically guarantees convergence and empirically improves accuracy and efficiency in deep neural network training.

Findings

01

Achieves 4.16% lower error on Cifar-100

02

Achieves 6.96% lower error on Tiny-ImageNet

03

Yields up to 5.02% higher accuracy on imbalanced datasets

Abstract

Training multiple deep neural networks (DNNs) and averaging their outputs is a simple way to improve the predictive performance. Nevertheless, the multiplied training cost prevents this ensemble method to be practical and efficient. Several recent works attempt to save and ensemble the checkpoints of DNNs, which only requires the same computational cost as training a single network. However, these methods suffer from either marginal accuracy improvements due to the low diversity of checkpoints or high risk of divergence due to the cyclical learning rates they adopted. In this paper, we propose a novel method to ensemble the checkpoints, where a boosting scheme is utilized to accelerate model convergence and maximize the checkpoint diversity. We theoretically prove that it converges by reducing exponential loss. The empirical evaluation also indicates our proposed ensemble outperforms…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Boost Neural Networks by Checkpoints· slideslive

Taxonomy

TopicsImbalanced Data Classification Techniques · Anomaly Detection Techniques and Applications · Advanced Neural Network Applications