Model Composition: Can Multiple Neural Networks Be Combined into a Single Network Using Only Unlabeled Data?
Amin Banitalebi-Dehkordi, Xinyu Kang, and Yong Zhang

TL;DR
This paper proposes a method to combine multiple neural networks into a single model using only unlabeled data, improving efficiency and performance without relying on ground-truth labels.
Contribution
It introduces a novel approach for model combination via pseudo-label generation, filtering, and aggregation, supporting arbitrary models and architectures.
Findings
Effective model combination demonstrated on object detection tasks.
Achieved comparable performance to supervised training without labels.
Significant mAP improvements in semi-supervised fine-tuning.
Abstract
The diversity of deep learning applications, datasets, and neural network architectures necessitates a careful selection of the architecture and data that match best to a target application. As an attempt to mitigate this dilemma, this paper investigates the idea of combining multiple trained neural networks using unlabeled data. In addition, combining multiple models into one can speed up the inference, result in stronger, more capable models, and allows us to select efficient device-friendly target network architectures. To this end, the proposed method makes use of generation, filtering, and aggregation of reliable pseudo-labels collected from unlabeled data. Our method supports using an arbitrary number of input models with arbitrary architectures and categories. Extensive performance evaluations demonstrated that our method is very effective. For example, for the task of object…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Machine Learning and Data Classification
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
