Transfer learning for ensembles: reducing computation time and keeping   the diversity

Ilya Shashkov; Nikita Balabin; Evgeny Burnaev; Alexey Zaytsev

arXiv:2206.13116·cs.LG·June 28, 2022

Transfer learning for ensembles: reducing computation time and keeping the diversity

Ilya Shashkov, Nikita Balabin, Evgeny Burnaev, Alexey Zaytsev

PDF

Open Access

TL;DR

This paper introduces a transfer learning method for ensembles that reduces training time and maintains model diversity by shifting encoder weights and fine-tuning, achieving competitive accuracy with less computation.

Contribution

The proposed approach efficiently transfers ensemble models by weight shifting and minimal fine-tuning, preserving diversity and reducing computational costs.

Findings

01

Speed-up in ensemble training process

02

Maintains high ensemble diversity

03

Achieves competitive accuracy with reduced computation

Abstract

Transferring a deep neural network trained on one problem to another requires only a small amount of data and little additional computation time. The same behaviour holds for ensembles of deep learning models typically superior to a single model. However, a transfer of deep neural networks ensemble demands relatively high computational expenses. The probability of overfitting also increases. Our approach for the transfer learning of ensembles consists of two steps: (a) shifting weights of encoders of all models in the ensemble by a single shift vector and (b) doing a tiny fine-tuning for each individual model afterwards. This strategy leads to a speed-up of the training process and gives an opportunity to add models to an ensemble with significantly reduced training time using the shift vector. We compare different strategies by computation time, the accuracy of an ensemble,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Data Classification · Neural Networks and Applications