AdjointBackMap: Reconstructing Effective Decision Hypersurfaces from CNN   Layers Using Adjoint Operators

Qing Wan; Yoonsuck Choe

arXiv:2012.09020·cs.CV·March 31, 2021

AdjointBackMap: Reconstructing Effective Decision Hypersurfaces from CNN Layers Using Adjoint Operators

Qing Wan, Yoonsuck Choe

PDF

Open Access

TL;DR

This paper introduces AdjointBackMap, a novel method using adjoint operators to reconstruct the decision hypersurfaces of CNN units, providing insights into their decision boundaries and susceptibility to adversarial inputs.

Contribution

It presents a new approach to reconstruct CNN decision surfaces using adjoint operators, enhancing interpretability of CNN inner workings.

Findings

01

Reconstructed hypersurfaces closely match CNN unit outputs.

02

Decision surfaces are highly conditioned on input data.

03

Insights into adversarial input effectiveness.

Abstract

There are several effective methods in explaining the inner workings of convolutional neural networks (CNNs). However, in general, finding the inverse of the function performed by CNNs as a whole is an ill-posed problem. In this paper, we propose a method based on adjoint operators to reconstruct, given an arbitrary unit in the CNN (except for the first convolutional layer), its effective hypersurface in the input space that replicates that unit's decision surface conditioned on a particular input image. Our results show that the hypersurface reconstructed this way, when multiplied by the original input image, would give nearly the exact output value of that unit. We find that the CNN unit's decision surface is largely conditioned on the input, and this may explain why adversarial inputs can effectively deceive CNNs.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Anomaly Detection Techniques and Applications