Deeply-Supervised Nets

Chen-Yu Lee; Saining Xie; Patrick Gallagher; Zhengyou Zhang; Zhuowen; Tu

arXiv:1409.5185·stat.ML·April 26, 2017·1.0k cites

Deeply-Supervised Nets

Chen-Yu Lee, Saining Xie, Patrick Gallagher, Zhengyou Zhang, Zhuowen, Tu

PDF

Open Access 1 Repo

TL;DR

Deeply-supervised nets (DSN) enhance deep network training by adding auxiliary objectives to intermediate layers, improving transparency, feature robustness, and overall classification performance, with significant gains demonstrated on benchmark datasets.

Contribution

Introduces the deeply-supervised nets (DSN) framework with companion objectives for hidden layers, improving training stability, feature discriminativeness, and achieving state-of-the-art results.

Findings

01

Significant performance improvements on MNIST, CIFAR-10, CIFAR-100, and SVHN.

02

Enhanced transparency and robustness of learned features.

03

Better training convergence due to auxiliary supervision.

Abstract

Our proposed deeply-supervised nets (DSN) method simultaneously minimizes classification error while making the learning process of hidden layers direct and transparent. We make an attempt to boost the classification performance by studying a new formulation in deep networks. Three aspects in convolutional neural networks (CNN) style architectures are being looked at: (1) transparency of the intermediate layers to the overall classification; (2) discriminativeness and robustness of learned features, especially in the early layers; (3) effectiveness in training due to the presence of the exploding and vanishing gradients. We introduce "companion objective" to the individual hidden layers, in addition to the overall objective at the output layer (a different strategy to layer-wise pre-training). We extend techniques from stochastic gradient methods to analyze our algorithm. The advantage…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ellisdg/3DUnetCNN
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Generative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning