Context Encoders: Feature Learning by Inpainting

Deepak Pathak; Philipp Krahenbuhl; Jeff Donahue; Trevor Darrell,; Alexei A. Efros

arXiv:1604.07379·cs.CV·November 22, 2016

Context Encoders: Feature Learning by Inpainting

Deepak Pathak, Philipp Krahenbuhl, Jeff Donahue, Trevor Darrell,, Alexei A. Efros

PDF

5 Repos

TL;DR

This paper introduces Context Encoders, a convolutional neural network trained to predict missing parts of images based on surrounding context, enabling unsupervised feature learning and applications in inpainting and downstream vision tasks.

Contribution

The paper proposes a novel unsupervised learning method using context encoders with adversarial training to produce sharp, semantically meaningful image inpainting results.

Findings

01

Learned features improve CNN pre-training for classification, detection, and segmentation.

02

Adversarial loss yields sharper and more plausible inpainting results.

03

Context encoders capture both appearance and semantic information of visual structures.

Abstract

We present an unsupervised visual feature learning algorithm driven by context-based pixel prediction. By analogy with auto-encoders, we propose Context Encoders -- a convolutional neural network trained to generate the contents of an arbitrary image region conditioned on its surroundings. In order to succeed at this task, context encoders need to both understand the content of the entire image, as well as produce a plausible hypothesis for the missing part(s). When training context encoders, we have experimented with both a standard pixel-wise reconstruction loss, as well as a reconstruction plus an adversarial loss. The latter produces much sharper results because it can better handle multiple modes in the output. We found that a context encoder learns a representation that captures not just appearance but also the semantics of visual structures. We quantitatively demonstrate the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsInpainting