Group Crosscoders for Mechanistic Analysis of Symmetry

Liv Gorton

arXiv:2410.24184·cs.LG·November 4, 2024

Group Crosscoders for Mechanistic Analysis of Symmetry

Liv Gorton

PDF

Open Access

TL;DR

This paper introduces group crosscoders, a novel method for automatically discovering and analyzing symmetrical features in neural networks, enhancing mechanistic interpretability of emergent symmetries.

Contribution

The paper presents group crosscoders, an automated approach for identifying and analyzing symmetries in neural network features, improving upon manual analysis and standard autoencoders.

Findings

01

Clusters features into interpretable families

02

Reveals distinct symmetry patterns for different geometric features

03

Provides systematic insights into neural network symmetry representations

Abstract

We introduce group crosscoders, an extension of crosscoders that systematically discover and analyse symmetrical features in neural networks. While neural networks often develop equivariant representations without explicit architectural constraints, understanding these emergent symmetries has traditionally relied on manual analysis. Group crosscoders automate this process by performing dictionary learning across transformed versions of inputs under a symmetry group. Applied to InceptionV1's mixed3b layer using the dihedral group $D_{32}$ , our method reveals several key insights: First, it naturally clusters features into interpretable families that correspond to previously hypothesised feature types, providing more precise separation than standard sparse autoencoders. Second, our transform block analysis enables the automatic characterisation of feature symmetries, revealing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsManufacturing Process and Optimization · Metal Forming Simulation Techniques · Optical measurement and interference techniques