Global Pooling, More than Meets the Eye: Position Information is Encoded   Channel-Wise in CNNs

Md Amirul Islam; Matthew Kowal; Sen Jia; Konstantinos G. Derpanis and; Neil D. B. Bruce

arXiv:2108.07884·cs.CV·August 19, 2021

Global Pooling, More than Meets the Eye: Position Information is Encoded Channel-Wise in CNNs

Md Amirul Islam, Matthew Kowal, Sen Jia, Konstantinos G. Derpanis and, Neil D. B. Bruce

PDF

Open Access 1 Repo

TL;DR

This paper reveals that global pooling in CNNs encodes positional information in channel order, impacting translation invariance and enabling targeted attacks, thus deepening understanding of CNN internal representations.

Contribution

It demonstrates that spatial information persists in global pooled features and introduces methods to analyze and manipulate position encoding in CNNs.

Findings

01

Positional information is encoded in channel order after global pooling.

02

Semantic information is largely unaffected by spatial dimension collapsing.

03

Region-specific attacks can degrade CNN performance in targeted input areas.

Abstract

In this paper, we challenge the common assumption that collapsing the spatial dimensions of a 3D (spatial-channel) tensor in a convolutional neural network (CNN) into a vector via global pooling removes all spatial information. Specifically, we demonstrate that positional information is encoded based on the ordering of the channel dimensions, while semantic information is largely not. Following this demonstration, we show the real world impact of these findings by applying them to two applications. First, we propose a simple yet effective data augmentation strategy and loss function which improves the translation invariance of a CNN's output. Second, we propose a method to efficiently determine which channels in the latent representation are responsible for (i) encoding overall position information or (ii) region-specific positions. We first show that semantic segmentation has a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

islamamirul/permutenet
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Generative Adversarial Networks and Image Synthesis · Human Pose and Action Recognition