What Do Deep CNNs Learn About Objects?

Xingchao Peng; Baochen Sun; Karim Ali; and Kate Saenko

arXiv:1504.02485·cs.CV·April 13, 2015·ICLR·2 cites

What Do Deep CNNs Learn About Objects?

Xingchao Peng, Baochen Sun, Karim Ali, and Kate Saenko

PDF

Open Access

TL;DR

This paper investigates the invariance of deep CNNs to variations in object class caused by 3D shape, pose, and photorealism, aiming to understand what features these networks learn about objects.

Contribution

The study provides a detailed analysis of CNN invariance to 3D object variations, revealing insights into the learned representations beyond 2D transformations.

Findings

01

CNNs show limited invariance to 3D shape and pose changes.

02

Photorealism affects CNN recognition performance.

03

Deeper layers encode more object-specific features.

Abstract

Deep convolutional neural networks learn extremely powerful image representations, yet most of that power is hidden in the millions of deep-layer parameters. What exactly do these parameters represent? Recent work has started to analyse CNN representations, finding that, e.g., they are invariant to some 2D transformations Fischer et al. (2014), but are confused by particular types of image noise Nguyen et al. (2014). In this work, we delve deeper and ask: how invariant are CNNs to object-class variations caused by 3D shape, pose, and photorealism?

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Generative Adversarial Networks and Image Synthesis