Towards Inference Efficient Deep Ensemble Learning
Ziyue Li, Kan Ren, Yifan Yang, Xinyang Jiang, Yuqing Yang, Dongsheng, Li

TL;DR
This paper introduces a novel inference-efficient ensemble learning approach that dynamically halts ensemble inference per sample, significantly reducing computational costs while maintaining performance.
Contribution
It proposes a sequential inference framework with a learned halting mechanism, optimizing both effectiveness and efficiency in ensemble models.
Findings
Up to 56% reduction in inference cost
Maintains comparable performance to full ensembles
Outperforms other baseline methods in efficiency
Abstract
Ensemble methods can deliver surprising performance gains but also bring significantly higher computational costs, e.g., can be up to 2048X in large-scale ensemble tasks. However, we found that the majority of computations in ensemble methods are redundant. For instance, over 77% of samples in CIFAR-100 dataset can be correctly classified with only a single ResNet-18 model, which indicates that only around 23% of the samples need an ensemble of extra models. To this end, we propose an inference efficient ensemble learning method, to simultaneously optimize for effectiveness and efficiency in ensemble learning. More specifically, we regard ensemble of models as a sequential inference process and learn the optimal halting event for inference on a specific sample. At each timestep of the inference process, a common selector judges if the current ensemble has reached ensemble effectiveness…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Advanced Neural Network Applications · Machine Learning and Data Classification
MethodsBalanced Selection
