Diversified Ensemble of Independent Sub-Networks for Robust Self-Supervised Representation Learning
Amirhossein Vahidi, Lisa Wimmer, H\"useyin Anil G\"und\"uz, Bernd, Bischl, Eyke H\"ullermeier, Mina Rezaei

TL;DR
This paper introduces a novel ensemble method of independent sub-networks for self-supervised learning that improves robustness, calibration, and uncertainty estimation with minimal computational costs across diverse tasks and data modalities.
Contribution
It proposes a new training regime and loss function to efficiently create diverse sub-model ensembles for self-supervised learning, enhancing performance and reliability.
Findings
Improves out-of-distribution detection and dataset robustness.
Achieves better calibration and uncertainty estimates.
Enhances accuracy across vision, NLP, and genomics tasks.
Abstract
Ensembling a neural network is a widely recognized approach to enhance model performance, estimate uncertainty, and improve robustness in deep supervised learning. However, deep ensembles often come with high computational costs and memory demands. In addition, the efficiency of a deep ensemble is related to diversity among the ensemble members which is challenging for large, over-parameterized deep neural networks. Moreover, ensemble learning has not yet seen such widespread adoption, and it remains a challenging endeavor for self-supervised or unsupervised representation learning. Motivated by these challenges, we present a novel self-supervised training regime that leverages an ensemble of independent sub-networks, complemented by a new loss function designed to encourage diversity. Our method efficiently builds a sub-model ensemble with high diversity, leading to well-calibrated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Multimodal Machine Learning Applications
MethodsDeep Ensembles
