Training Deeper Convolutional Networks with Deep Supervision

Liwei Wang; Chen-Yu Lee; Zhuowen Tu; Svetlana Lazebnik

arXiv:1505.02496·cs.CV·May 12, 2015·167 cites

Training Deeper Convolutional Networks with Deep Supervision

Liwei Wang, Chen-Yu Lee, Zhuowen Tu, Svetlana Lazebnik

PDF

Open Access 1 Repo

TL;DR

This paper introduces deep supervision in convolutional networks by adding auxiliary branches at intermediate layers, simplifying training and improving accuracy on large-scale image datasets.

Contribution

It proposes a practical method for training deeper CNNs using auxiliary supervision, enabling easier optimization and better performance.

Findings

01

Improved training efficiency for deep CNNs.

02

Achieved higher accuracy on ImageNet and MIT Places datasets.

03

Demonstrated effectiveness of deep supervision in deep learning models.

Abstract

One of the most promising ways of improving the performance of deep convolutional neural networks is by increasing the number of convolutional layers. However, adding layers makes training more difficult and computationally expensive. In order to train deeper networks, we propose to add auxiliary supervision branches after certain intermediate layers during training. We formulate a simple rule of thumb to determine where these branches should be added. The resulting deeply supervised structure makes the training much easier and also produces better classification results on ImageNet and the recently released, larger MIT Places dataset

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

LCWdmlearning/Improve-nsfw
tf

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Advanced Image and Video Retrieval Techniques