Boosted Dynamic Neural Networks

Haichao Yu; Haoxiang Li; Gang Hua; Gao Huang; Humphrey Shi

arXiv:2211.16726·cs.LG·December 1, 2022

Boosted Dynamic Neural Networks

Haichao Yu, Haoxiang Li, Gang Hua, Gao Huang, Humphrey Shi

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces BoostNet, a novel approach to early-exiting dynamic neural networks inspired by gradient boosting, which addresses training-test mismatch issues and achieves state-of-the-art results on CIFAR100 and ImageNet.

Contribution

The paper proposes BoostNet, a new additive model for EDNNs that mitigates training-test mismatch and improves prediction accuracy through multiple training techniques.

Findings

01

Achieves state-of-the-art performance on CIFAR100.

02

Outperforms existing EDNN methods on ImageNet.

03

Effective in both anytime and budgeted-batch prediction modes.

Abstract

Early-exiting dynamic neural networks (EDNN), as one type of dynamic neural networks, has been widely studied recently. A typical EDNN has multiple prediction heads at different layers of the network backbone. During inference, the model will exit at either the last prediction head or an intermediate prediction head where the prediction confidence is higher than a predefined threshold. To optimize the model, these prediction heads together with the network backbone are trained on every batch of training data. This brings a train-test mismatch problem that all the prediction heads are optimized on all types of data in training phase while the deeper heads will only see difficult inputs in testing phase. Treating training and testing inputs differently at the two phases will cause the mismatch between training and testing data distributions. To mitigate this problem, we formulate an EDNN…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

SHI-Labs/Boosted-Dynamic-Networks
pytorchOfficial

Videos

Boosted Dynamic Neural Networks· underline

Taxonomy

TopicsAdvanced Neural Network Applications · Brain Tumor Detection and Classification · COVID-19 diagnosis using AI