Representation Understanding via Activation Maximization
Hongbo Zhu, Angelo Cangelosi

TL;DR
This paper introduces a unified framework for visualizing and understanding deep neural network features across CNNs and Vision Transformers, extending to intermediate layers and exploring adversarial example generation for interpretability.
Contribution
It proposes a comprehensive feature visualization method applicable to various network architectures and layers, enhancing interpretability and vulnerability analysis.
Findings
Effective visualization of intermediate and final layer features.
Demonstrated generalizability to CNNs and ViTs.
Revealed potential for adversarial example generation.
Abstract
Understanding internal feature representations of deep neural networks (DNNs) is a fundamental step toward model interpretability. Inspired by neuroscience methods that probe biological neurons using visual stimuli, recent deep learning studies have employed Activation Maximization (AM) to synthesize inputs that elicit strong responses from artificial neurons. In this work, we propose a unified feature visualization framework applicable to both Convolutional Neural Networks (CNNs) and Vision Transformers (ViTs). Unlike prior efforts that predominantly focus on the last output-layer neurons in CNNs, we extend feature visualization to intermediate layers as well, offering deeper insights into the hierarchical structure of learned feature representations. Furthermore, we investigate how activation maximization can be leveraged to generate adversarial examples, revealing potential…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Generative Adversarial Networks and Image Synthesis
