Collaborative Learning for Deep Neural Networks
Guocong Song, Wei Chai

TL;DR
This paper proposes collaborative learning with multiple classifier heads trained simultaneously to enhance neural network generalization and robustness without additional inference costs, leveraging consensus and shared representations.
Contribution
It introduces a novel collaborative learning framework that combines multi-head training, consensus regularization, and intermediate-level representation sharing for improved performance.
Findings
Reduces generalization error on CIFAR and ImageNet
Increases robustness to label noise
Decreases training computational complexity
Abstract
We introduce collaborative learning in which multiple classifier heads of the same network are simultaneously trained on the same training data to improve generalization and robustness to label noise with no extra inference cost. It acquires the strengths from auxiliary training, multi-task learning and knowledge distillation. There are two important mechanisms involved in collaborative learning. First, the consensus of multiple views from different classifier heads on the same example provides supplementary information as well as regularization to each classifier, thereby improving generalization. Second, intermediate-level representation (ILR) sharing with backpropagation rescaling aggregates the gradient flows from all heads, which not only reduces training computational complexity, but also facilitates supervision to the shared layers. The empirical results on CIFAR and ImageNet…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Machine Learning and Data Classification · Adversarial Robustness in Machine Learning
