Multi-scale Convolution Aggregation and Stochastic Feature Reuse for   DenseNets

Mingjie Wang; Jun Zhou; Wendong Mao; Minglun Gong

arXiv:1810.01373·cs.LG·October 4, 2018

Multi-scale Convolution Aggregation and Stochastic Feature Reuse for DenseNets

Mingjie Wang, Jun Zhou, Wendong Mao, Minglun Gong

PDF

Open Access

TL;DR

This paper introduces a multi-scale convolution aggregation module and a stochastic feature reuse regularization to enhance DenseNets, improving accuracy and reducing overfitting with fewer parameters across multiple vision benchmarks.

Contribution

The paper proposes a novel multi-scale convolution aggregation module and a stochastic feature reuse regularization for DenseNets, leading to better performance and efficiency.

Findings

01

Improved accuracy on CIFAR-10, CIFAR-100, and SVHN benchmarks.

02

Reduced overfitting through stochastic feature reuse.

03

Achieved higher efficiency with fewer parameters.

Abstract

Recently, Convolution Neural Networks (CNNs) obtained huge success in numerous vision tasks. In particular, DenseNets have demonstrated that feature reuse via dense skip connections can effectively alleviate the difficulty of training very deep networks and that reusing features generated by the initial layers in all subsequent layers has strong impact on performance. To feed even richer information into the network, a novel adaptive Multi-scale Convolution Aggregation module is presented in this paper. Composed of layers for multi-scale convolutions, trainable cross-scale aggregation, maxout, and concatenation, this module is highly non-linear and can boost the accuracy of DenseNet while using much fewer parameters. In addition, due to high model complexity, the network with extremely dense feature reuse is prone to overfitting. To address this problem, a regularization method named…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Machine Learning and ELM

MethodsConvolution