SVCCA: Singular Vector Canonical Correlation Analysis for Deep Learning   Dynamics and Interpretability

Maithra Raghu; Justin Gilmer; Jason Yosinski; Jascha Sohl-Dickstein

arXiv:1706.05806·stat.ML·November 9, 2017·227 cites

SVCCA: Singular Vector Canonical Correlation Analysis for Deep Learning Dynamics and Interpretability

Maithra Raghu, Justin Gilmer, Jason Yosinski, Jascha Sohl-Dickstein

PDF

Open Access 3 Repos

TL;DR

SVCCA is a novel technique for efficiently comparing neural network representations, invariant to affine transformations, enabling insights into layer dimensionality, learning dynamics, and class-specific information.

Contribution

Introduces SVCCA, a fast, affine-invariant method for comparing neural network representations, facilitating analysis of training dynamics and network interpretability.

Findings

01

Networks can be over-parameterized in some layers.

02

Networks converge from bottom to top during training.

03

Class-specific information is localized within certain network layers.

Abstract

We propose a new technique, Singular Vector Canonical Correlation Analysis (SVCCA), a tool for quickly comparing two representations in a way that is both invariant to affine transform (allowing comparison between different layers and networks) and fast to compute (allowing more comparisons to be calculated than with previous methods). We deploy this tool to measure the intrinsic dimensionality of layers, showing in some cases needless over-parameterization; to probe learning dynamics throughout training, finding that networks converge to final representations from the bottom up; to show where class-specific information in networks is formed; and to suggest new training regimes that simultaneously save computation and overfit less. Code: https://github.com/google/svcca/

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications