Towards interpreting computer vision based on transformation invariant   optimization

Chen Li; Jinzhe Jiang; Xin Zhang; Tonghuan Zhang; Yaqian Zhao,; Dongdong Jiang; RenGang Li

arXiv:2106.09982·cs.CV·June 21, 2021

Towards interpreting computer vision based on transformation invariant optimization

Chen Li, Jinzhe Jiang, Xin Zhang, Tonghuan Zhang, Yaqian Zhao,, Dongdong Jiang, RenGang Li

PDF

TL;DR

This paper introduces a transformation-invariant visualization method for deep neural networks in computer vision, enhancing interpretability by incorporating rotation and scaling invariance into generated images that activate specific network predictions.

Contribution

It proposes a novel back-propagation based visualization technique that integrates transformation invariance, improving the clarity and insightfulness of neural network interpretation.

Findings

01

Transformation invariance improves visualization quality.

02

Generated images better activate target classes.

03

Method provides new insights into neural network behavior.

Abstract

Interpreting how does deep neural networks (DNNs) make predictions is a vital field in artificial intelligence, which hinders wide applications of DNNs. Visualization of learned representations helps we humans understand the vision of DNNs. In this work, visualized images that can activate the neural network to the target classes are generated by back-propagation method. Here, rotation and scaling operations are applied to introduce the transformation invariance in the image generating process, which we find a significant improvement on visualization effect. Finally, we show some cases that such method can help us to gain insight into neural networks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.