Intra-Model Collaborative Learning of Neural Networks

Shijie Fang; Tong Lin

arXiv:2105.09590·cs.CV·May 21, 2021

Intra-Model Collaborative Learning of Neural Networks

Shijie Fang, Tong Lin

PDF

Open Access

TL;DR

This paper introduces intra-model collaborative learning methods that enable different parts of a single neural network to learn collaboratively, improving accuracy and robustness without additional modules or memory overhead.

Contribution

It proposes four novel intra-model collaborative learning strategies that enhance neural network training efficiency, accuracy, and robustness, reducing memory requirements compared to multi-head approaches.

Findings

01

Significant error reduction on STL-10 dataset (up to 9.28%)

02

Improved robustness to label noise with 3.53% higher performance at 50% noise

03

Effective across multiple datasets including CIFAR and ImageNet32

Abstract

Recently, collaborative learning proposed by Song and Chai has achieved remarkable improvements in image classification tasks by simultaneously training multiple classifier heads. However, huge memory footprints required by such multi-head structures may hinder the training of large-capacity baseline models. The natural question is how to achieve collaborative learning within a single network without duplicating any modules. In this paper, we propose four ways of collaborative learning among different parts of a single network with negligible engineering efforts. To improve the robustness of the network, we leverage the consistency of the output layer and intermediate layers for training under the collaborative learning framework. Besides, the similarity of intermediate representation and convolution kernel is also introduced to reduce the reduce redundant in a neural network. Compared…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Machine Learning and Data Classification

MethodsConvolution