An Intriguing Failing of Convolutional Neural Networks and the CoordConv Solution
Rosanne Liu, Joel Lehman, Piero Molino, Felipe Petroski Such, Eric, Frank, Alex Sergeev, Jason Yosinski

TL;DR
This paper identifies a fundamental failure of convolutional neural networks in coordinate transformation tasks and introduces CoordConv, a simple modification that enables CNNs to learn coordinate mappings efficiently and accurately.
Contribution
The paper presents CoordConv, a novel layer that incorporates coordinate information into CNNs, significantly improving their ability to learn coordinate transforms with fewer parameters and faster training.
Findings
CoordConv achieves perfect generalization on coordinate transform tasks.
Using CoordConv reduces training time by 150 times compared to traditional convolution.
CoordConv improves performance across diverse tasks, including GANs, object detection, and reinforcement learning.
Abstract
Few ideas have enjoyed as large an impact on deep learning as convolution. For any problem involving pixels or spatial representations, common intuition holds that convolutional neural networks may be appropriate. In this paper we show a striking counterexample to this intuition via the seemingly trivial coordinate transform problem, which simply requires learning a mapping between coordinates in (x,y) Cartesian space and one-hot pixel space. Although convolutional networks would seem appropriate for this task, we show that they fail spectacularly. We demonstrate and carefully analyze the failure first on a toy problem, at which point a simple fix becomes obvious. We call this solution CoordConv, which works by giving convolution access to its own input coordinates through the use of extra coordinate channels. Without sacrificing the computational and parametric efficiency of ordinary…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Generative Adversarial Networks and Image Synthesis
MethodsPrioritized Experience Replay · Ape-X · USD Coin Customer Service Number +1-833-534-1729 · HuMan(Expedia)||How do I get a human at Expedia? · Deep Convolutional GAN · Average Pooling · Step Decay · Adam · Weight Decay · A2C
