Intra-Model Collaborative Learning of Neural Networks
Shijie Fang, Tong Lin

TL;DR
This paper introduces intra-model collaborative learning methods that enable different parts of a single neural network to learn collaboratively, improving accuracy and robustness without additional modules or memory overhead.
Contribution
It proposes four novel intra-model collaborative learning strategies that enhance neural network training efficiency, accuracy, and robustness, reducing memory requirements compared to multi-head approaches.
Findings
Significant error reduction on STL-10 dataset (up to 9.28%)
Improved robustness to label noise with 3.53% higher performance at 50% noise
Effective across multiple datasets including CIFAR and ImageNet32
Abstract
Recently, collaborative learning proposed by Song and Chai has achieved remarkable improvements in image classification tasks by simultaneously training multiple classifier heads. However, huge memory footprints required by such multi-head structures may hinder the training of large-capacity baseline models. The natural question is how to achieve collaborative learning within a single network without duplicating any modules. In this paper, we propose four ways of collaborative learning among different parts of a single network with negligible engineering efforts. To improve the robustness of the network, we leverage the consistency of the output layer and intermediate layers for training under the collaborative learning framework. Besides, the similarity of intermediate representation and convolution kernel is also introduced to reduce the reduce redundant in a neural network. Compared…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Machine Learning and Data Classification
MethodsConvolution
