Evaluation of Confidence-based Ensembling in Deep Learning Image Classification
Rafael Rosales, Peter Popov, Michael Paulitsch

TL;DR
This paper evaluates confidence-based ensembling (Conf-Ensemble) in complex image classification tasks using ImageNet, revealing its limitations and proposing improvements to enhance its effectiveness in multi-class scenarios.
Contribution
The study extends Conf-Ensemble to large-scale image classification, proposes an improvement to increase sample diversity, and assesses its performance on ImageNet.
Findings
Conf-Ensemble shows limited benefit in complex multi-label classification.
An improved three-member Conf-Ensemble can outperform a single model, but gains are modest.
Large data and complex features are crucial for the success of confidence-based ensembling.
Abstract
Ensembling is a successful technique to improve the performance of machine learning (ML) models. Conf-Ensemble is an adaptation to Boosting to create ensembles based on model confidence instead of model errors to better classify difficult edge-cases. The key idea is to create successive model experts for samples that were difficult (not necessarily incorrectly classified) by the preceding model. This technique has been shown to provide better results than boosting in binary-classification with a small feature space (~80 features). In this paper, we evaluate the Conf-Ensemble approach in the much more complex task of image classification with the ImageNet dataset (224x224x3 features with 1000 classes). Image classification is an important benchmark for AI-based perception and thus it helps to assess if this method can be used in safety-critical applications using ML ensembles. Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Machine Learning and Algorithms · Domain Adaptation and Few-Shot Learning
