Receptive Field Refinement for Convolutional Neural Networks Reliably   Improves Predictive Performance

Mats L. Richter; Christopher Pal

arXiv:2211.14487·cs.CV·November 29, 2022·1 cites

Receptive Field Refinement for Convolutional Neural Networks Reliably Improves Predictive Performance

Mats L. Richter, Christopher Pal

PDF

Open Access

TL;DR

This paper introduces a new receptive field analysis method for CNNs that predicts unproductive layers, enabling automated architecture optimization and leading to significant performance improvements across multiple state-of-the-art models on ImageNet1K.

Contribution

The paper formalizes receptive field expansion analysis, allowing automated identification of unproductive layers to optimize CNN architectures for better efficiency and accuracy.

Findings

01

Improved ImageNet1K performance across multiple CNN architectures.

02

Achieved new state-of-the-art results for each evaluated model class.

03

Method is simple, automated, and applicable to various architectures.

Abstract

Minimal changes to neural architectures (e.g. changing a single hyperparameter in a key layer), can lead to significant gains in predictive performance in Convolutional Neural Networks (CNNs). In this work, we present a new approach to receptive field analysis that can yield these types of theoretical and empirical performance gains across twenty well-known CNN architectures examined in our experiments. By further developing and formalizing the analysis of receptive field expansion in convolutional neural networks, we can predict unproductive layers in an automated manner before ever training a model. This allows us to optimize the parameter-efficiency of a given architecture at low cost. Our method is computationally simple and can be done in an automated manner or even manually with minimal effort for most common architectures. We demonstrate the effectiveness of this approach by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Neural Network Applications · Anomaly Detection Techniques and Applications

MethodsConvNeXt · Depthwise Convolution · Pointwise Convolution · Depthwise Separable Convolution · Squeeze-and-Excitation Block · ReLU6 · Sigmoid Activation · Inverted Residual Block · RMSProp · Global Average Pooling