Analyzing Hierarchical Structure in Vision Models with Sparse Autoencoders

Matthew Lyle Olson; Musashi Hinck; Neale Ratzlaff; Changbai Li; Phillip Howard; Vasudev Lal; Shao-Yen Tseng

arXiv:2505.15970·cs.CV·May 23, 2025

Analyzing Hierarchical Structure in Vision Models with Sparse Autoencoders

Matthew Lyle Olson, Musashi Hinck, Neale Ratzlaff, Changbai Li, Phillip Howard, Vasudev Lal, Shao-Yen Tseng

PDF

Open Access

TL;DR

This paper uses Sparse Autoencoders to analyze how deep vision models encode the hierarchical structure of ImageNet categories, revealing their implicit understanding of taxonomic relationships across layers.

Contribution

It extends the use of Sparse Autoencoders from language models to vision models, providing a systematic framework for analyzing hierarchical representations in deep vision networks.

Findings

01

SAEs uncover hierarchical relationships in model activations

02

Representations align with ImageNet taxonomy across layers

03

Deeper layers encode more semantic information

Abstract

The ImageNet hierarchy provides a structured taxonomy of object categories, offering a valuable lens through which to analyze the representations learned by deep vision models. In this work, we conduct a comprehensive analysis of how vision models encode the ImageNet hierarchy, leveraging Sparse Autoencoders (SAEs) to probe their internal representations. SAEs have been widely used as an explanation tool for large language models (LLMs), where they enable the discovery of semantically meaningful features. Here, we extend their use to vision models to investigate whether learned representations align with the ontological structure defined by the ImageNet taxonomy. Our results show that SAEs uncover hierarchical relationships in model activations, revealing an implicit encoding of taxonomic structure. We analyze the consistency of these representations across different layers of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMedical Image Segmentation Techniques · Advanced Vision and Imaging · Image Processing and 3D Reconstruction

MethodsALIGN