Model Composition: Can Multiple Neural Networks Be Combined into a   Single Network Using Only Unlabeled Data?

Amin Banitalebi-Dehkordi; Xinyu Kang; and Yong Zhang

arXiv:2110.10369·cs.LG·October 22, 2021

Model Composition: Can Multiple Neural Networks Be Combined into a Single Network Using Only Unlabeled Data?

Amin Banitalebi-Dehkordi, Xinyu Kang, and Yong Zhang

PDF

Open Access 1 Repo

TL;DR

This paper proposes a method to combine multiple neural networks into a single model using only unlabeled data, improving efficiency and performance without relying on ground-truth labels.

Contribution

It introduces a novel approach for model combination via pseudo-label generation, filtering, and aggregation, supporting arbitrary models and architectures.

Findings

01

Effective model combination demonstrated on object detection tasks.

02

Achieved comparable performance to supervised training without labels.

03

Significant mAP improvements in semi-supervised fine-tuning.

Abstract

The diversity of deep learning applications, datasets, and neural network architectures necessitates a careful selection of the architecture and data that match best to a target application. As an attempt to mitigate this dilemma, this paper investigates the idea of combining multiple trained neural networks using unlabeled data. In addition, combining multiple models into one can speed up the inference, result in stronger, more capable models, and allows us to select efficient device-friendly target network architectures. To this end, the proposed method makes use of generation, filtering, and aggregation of reliable pseudo-labels collected from unlabeled data. Our method supports using an arbitrary number of input models with arbitrary architectures and categories. Extensive performance evaluations demonstrated that our method is very effective. For example, for the task of object…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

abanitalebi/Model-Composition
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Machine Learning and Data Classification

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings