An Intriguing Failing of Convolutional Neural Networks and the CoordConv   Solution

Rosanne Liu; Joel Lehman; Piero Molino; Felipe Petroski Such; Eric; Frank; Alex Sergeev; Jason Yosinski

arXiv:1807.03247·cs.CV·December 4, 2018·169 cites

An Intriguing Failing of Convolutional Neural Networks and the CoordConv Solution

Rosanne Liu, Joel Lehman, Piero Molino, Felipe Petroski Such, Eric, Frank, Alex Sergeev, Jason Yosinski

PDF

Open Access 5 Repos

TL;DR

This paper identifies a fundamental failure of convolutional neural networks in coordinate transformation tasks and introduces CoordConv, a simple modification that enables CNNs to learn coordinate mappings efficiently and accurately.

Contribution

The paper presents CoordConv, a novel layer that incorporates coordinate information into CNNs, significantly improving their ability to learn coordinate transforms with fewer parameters and faster training.

Findings

01

CoordConv achieves perfect generalization on coordinate transform tasks.

02

Using CoordConv reduces training time by 150 times compared to traditional convolution.

03

CoordConv improves performance across diverse tasks, including GANs, object detection, and reinforcement learning.

Abstract

Few ideas have enjoyed as large an impact on deep learning as convolution. For any problem involving pixels or spatial representations, common intuition holds that convolutional neural networks may be appropriate. In this paper we show a striking counterexample to this intuition via the seemingly trivial coordinate transform problem, which simply requires learning a mapping between coordinates in (x,y) Cartesian space and one-hot pixel space. Although convolutional networks would seem appropriate for this task, we show that they fail spectacularly. We demonstrate and carefully analyze the failure first on a toy problem, at which point a simple fix becomes obvious. We call this solution CoordConv, which works by giving convolution access to its own input coordinates through the use of extra coordinate channels. Without sacrificing the computational and parametric efficiency of ordinary…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Generative Adversarial Networks and Image Synthesis

MethodsPrioritized Experience Replay · Ape-X · USD Coin Customer Service Number +1-833-534-1729 · HuMan(Expedia)||How do I get a human at Expedia? · Deep Convolutional GAN · Average Pooling · Step Decay · Adam · Weight Decay · A2C