Representation Understanding via Activation Maximization

Hongbo Zhu; Angelo Cangelosi

arXiv:2508.07281·cs.CV·August 12, 2025

Representation Understanding via Activation Maximization

Hongbo Zhu, Angelo Cangelosi

PDF

Open Access

TL;DR

This paper introduces a unified framework for visualizing and understanding deep neural network features across CNNs and Vision Transformers, extending to intermediate layers and exploring adversarial example generation for interpretability.

Contribution

It proposes a comprehensive feature visualization method applicable to various network architectures and layers, enhancing interpretability and vulnerability analysis.

Findings

01

Effective visualization of intermediate and final layer features.

02

Demonstrated generalizability to CNNs and ViTs.

03

Revealed potential for adversarial example generation.

Abstract

Understanding internal feature representations of deep neural networks (DNNs) is a fundamental step toward model interpretability. Inspired by neuroscience methods that probe biological neurons using visual stimuli, recent deep learning studies have employed Activation Maximization (AM) to synthesize inputs that elicit strong responses from artificial neurons. In this work, we propose a unified feature visualization framework applicable to both Convolutional Neural Networks (CNNs) and Vision Transformers (ViTs). Unlike prior efforts that predominantly focus on the last output-layer neurons in CNNs, we extend feature visualization to intermediate layers as well, offering deeper insights into the hierarchical structure of learned feature representations. Furthermore, we investigate how activation maximization can be leveraged to generate adversarial examples, revealing potential…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Generative Adversarial Networks and Image Synthesis