AdjointBackMap: Reconstructing Effective Decision Hypersurfaces from CNN Layers Using Adjoint Operators
Qing Wan, Yoonsuck Choe

TL;DR
This paper introduces AdjointBackMap, a novel method using adjoint operators to reconstruct the decision hypersurfaces of CNN units, providing insights into their decision boundaries and susceptibility to adversarial inputs.
Contribution
It presents a new approach to reconstruct CNN decision surfaces using adjoint operators, enhancing interpretability of CNN inner workings.
Findings
Reconstructed hypersurfaces closely match CNN unit outputs.
Decision surfaces are highly conditioned on input data.
Insights into adversarial input effectiveness.
Abstract
There are several effective methods in explaining the inner workings of convolutional neural networks (CNNs). However, in general, finding the inverse of the function performed by CNNs as a whole is an ill-posed problem. In this paper, we propose a method based on adjoint operators to reconstruct, given an arbitrary unit in the CNN (except for the first convolutional layer), its effective hypersurface in the input space that replicates that unit's decision surface conditioned on a particular input image. Our results show that the hypersurface reconstructed this way, when multiplied by the original input image, would give nearly the exact output value of that unit. We find that the CNN unit's decision surface is largely conditioned on the input, and this may explain why adversarial inputs can effectively deceive CNNs.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Anomaly Detection Techniques and Applications
