Towards a New Interpretation of Separable Convolutions

Tapabrata Ghosh

arXiv:1701.04489·cs.LG·January 18, 2017

Towards a New Interpretation of Separable Convolutions

Tapabrata Ghosh

PDF

TL;DR

This paper proposes a hybrid interpretative model to better understand why separable convolutions are effective in deep neural networks, addressing gaps in existing explanations.

Contribution

It introduces a new hybrid interpretation of separable convolutions, aiming to clarify their mechanism of action beyond current mathematical definitions.

Findings

01

Proposes a hybrid interpretation model for separable convolutions

02

Addresses limitations of previous explanations like the Inception hypothesis

03

Provides a more comprehensive understanding of separable convolutions' efficacy

Abstract

In recent times, the use of separable convolutions in deep convolutional neural network architectures has been explored. Several researchers, most notably (Chollet, 2016) and (Ghosh, 2017) have used separable convolutions in their deep architectures and have demonstrated state of the art or close to state of the art performance. However, the underlying mechanism of action of separable convolutions are still not fully understood. Although their mathematical definition is well understood as a depthwise convolution followed by a pointwise convolution, deeper interpretations such as the extreme Inception hypothesis (Chollet, 2016) have failed to provide a thorough explanation of their efficacy. In this paper, we propose a hybrid interpretation that we believe is a better model for explaining the efficacy of separable convolutions.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsDepthwise Convolution · Convolution