Multi-scale Convolution Aggregation and Stochastic Feature Reuse for DenseNets
Mingjie Wang, Jun Zhou, Wendong Mao, Minglun Gong

TL;DR
This paper introduces a multi-scale convolution aggregation module and a stochastic feature reuse regularization to enhance DenseNets, improving accuracy and reducing overfitting with fewer parameters across multiple vision benchmarks.
Contribution
The paper proposes a novel multi-scale convolution aggregation module and a stochastic feature reuse regularization for DenseNets, leading to better performance and efficiency.
Findings
Improved accuracy on CIFAR-10, CIFAR-100, and SVHN benchmarks.
Reduced overfitting through stochastic feature reuse.
Achieved higher efficiency with fewer parameters.
Abstract
Recently, Convolution Neural Networks (CNNs) obtained huge success in numerous vision tasks. In particular, DenseNets have demonstrated that feature reuse via dense skip connections can effectively alleviate the difficulty of training very deep networks and that reusing features generated by the initial layers in all subsequent layers has strong impact on performance. To feed even richer information into the network, a novel adaptive Multi-scale Convolution Aggregation module is presented in this paper. Composed of layers for multi-scale convolutions, trainable cross-scale aggregation, maxout, and concatenation, this module is highly non-linear and can boost the accuracy of DenseNet while using much fewer parameters. In addition, due to high model complexity, the network with extremely dense feature reuse is prone to overfitting. To address this problem, a regularization method named…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Machine Learning and ELM
MethodsConvolution
