Collaborative Learning for Deep Neural Networks

Guocong Song; Wei Chai

arXiv:1805.11761·stat.ML·November 8, 2018·102 cites

Collaborative Learning for Deep Neural Networks

Guocong Song, Wei Chai

PDF

Open Access

TL;DR

This paper proposes collaborative learning with multiple classifier heads trained simultaneously to enhance neural network generalization and robustness without additional inference costs, leveraging consensus and shared representations.

Contribution

It introduces a novel collaborative learning framework that combines multi-head training, consensus regularization, and intermediate-level representation sharing for improved performance.

Findings

01

Reduces generalization error on CIFAR and ImageNet

02

Increases robustness to label noise

03

Decreases training computational complexity

Abstract

We introduce collaborative learning in which multiple classifier heads of the same network are simultaneously trained on the same training data to improve generalization and robustness to label noise with no extra inference cost. It acquires the strengths from auxiliary training, multi-task learning and knowledge distillation. There are two important mechanisms involved in collaborative learning. First, the consensus of multiple views from different classifier heads on the same example provides supplementary information as well as regularization to each classifier, thereby improving generalization. Second, intermediate-level representation (ILR) sharing with backpropagation rescaling aggregates the gradient flows from all heads, which not only reduces training computational complexity, but also facilitates supervision to the shared layers. The empirical results on CIFAR and ImageNet…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Machine Learning and Data Classification · Adversarial Robustness in Machine Learning