Linking in Style: Understanding learned features in deep learning models
Maren H. Wehrheim, Pamela Osuna-Vargas, Matthias Kaschube

TL;DR
This paper introduces a novel, efficient method to visualize and analyze learned features in CNNs by linking them to a generative model's latent space, enabling better interpretability and understanding of model representations.
Contribution
We propose a linking network that maps CNN features to StyleGAN-XL's latent space, providing a human-friendly visualization and systematic analysis of learned features in deep models.
Findings
Semantic order is consistent between CNN features and StyleGAN-XL latent space.
The linking network is computationally inexpensive and decoupled from training the GAN and classifier.
Our pipeline enables systematic analysis of thousands of units and their semantic selectivity.
Abstract
Convolutional neural networks (CNNs) learn abstract features to perform object classification, but understanding these features remains challenging due to difficult-to-interpret results or high computational costs. We propose an automatic method to visualize and systematically analyze learned features in CNNs. Specifically, we introduce a linking network that maps the penultimate layer of a pre-trained classifier to the latent space of a generative model (StyleGAN-XL), thereby enabling an interpretable, human-friendly visualization of the classifier's representations. Our findings indicate a congruent semantic order in both spaces, enabling a direct linear mapping between them. Training the linking network is computationally inexpensive and decoupled from training both the GAN and the classifier. We introduce an automatic pipeline that utilizes such GAN-based visualizations to quantify…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
