Transfer learning for ensembles: reducing computation time and keeping the diversity
Ilya Shashkov, Nikita Balabin, Evgeny Burnaev, Alexey Zaytsev

TL;DR
This paper introduces a transfer learning method for ensembles that reduces training time and maintains model diversity by shifting encoder weights and fine-tuning, achieving competitive accuracy with less computation.
Contribution
The proposed approach efficiently transfers ensemble models by weight shifting and minimal fine-tuning, preserving diversity and reducing computational costs.
Findings
Speed-up in ensemble training process
Maintains high ensemble diversity
Achieves competitive accuracy with reduced computation
Abstract
Transferring a deep neural network trained on one problem to another requires only a small amount of data and little additional computation time. The same behaviour holds for ensembles of deep learning models typically superior to a single model. However, a transfer of deep neural networks ensemble demands relatively high computational expenses. The probability of overfitting also increases. Our approach for the transfer learning of ensembles consists of two steps: (a) shifting weights of encoders of all models in the ensemble by a single shift vector and (b) doing a tiny fine-tuning for each individual model afterwards. This strategy leads to a speed-up of the training process and gives an opportunity to add models to an ensemble with significantly reduced training time using the shift vector. We compare different strategies by computation time, the accuracy of an ensemble,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Data Classification · Neural Networks and Applications
