Understanding Deep Image Representations by Inverting Them

Aravindh Mahendran; Andrea Vedaldi

arXiv:1412.0035·cs.CV·December 2, 2014·85 cites

Understanding Deep Image Representations by Inverting Them

Aravindh Mahendran, Andrea Vedaldi

PDF

Open Access 5 Repos

TL;DR

This paper introduces a framework to invert various image representations, including CNNs, revealing how much visual information they retain and how invariant they are to geometric and photometric changes.

Contribution

It presents a novel, general method for inverting image representations, enabling analysis of the information preserved in CNNs and other features.

Findings

01

CNN layers retain photographically accurate information

02

Different CNN layers show varying degrees of invariance

03

The method outperforms previous approaches in inverting HOG and SIFT

Abstract

Image representations, from SIFT and Bag of Visual Words to Convolutional Neural Networks (CNNs), are a crucial component of almost any image understanding system. Nevertheless, our understanding of them remains limited. In this paper we conduct a direct analysis of the visual information contained in representations by asking the following question: given an encoding of an image, to which extent is it possible to reconstruct the image itself? To answer this question we contribute a general framework to invert representations. We show that this method can invert representations such as HOG and SIFT more accurately than recent alternatives while being applicable to CNNs too. We then use this technique to study the inverse of recent state-of-the-art CNN image representations for the first time. Among our findings, we show that several layers in CNNs retain photographically accurate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Generative Adversarial Networks and Image Synthesis · Advanced Vision and Imaging