Insights on representational similarity in neural networks with   canonical correlation

Ari S. Morcos; Maithra Raghu; and Samy Bengio

arXiv:1806.05759·stat.ML·October 25, 2018·58 cites

Insights on representational similarity in neural networks with canonical correlation

Ari S. Morcos, Maithra Raghu, and Samy Bengio

PDF

Open Access 2 Repos

TL;DR

This paper introduces an improved projection weighted CCA method to compare neural network representations, revealing how network architecture, training dynamics, and task type influence representational similarity in CNNs and RNNs.

Contribution

The paper develops an enhanced CCA technique for neural representation comparison and applies it to analyze CNNs and RNNs, providing new insights into their training and functional dynamics.

Findings

01

Networks that generalize have more similar representations than those that memorize.

02

Wider networks converge to more similar solutions than narrower ones.

03

RNNs show bottom-up convergence during training and high variability in hidden states.

Abstract

Comparing different neural network representations and determining how representations evolve over time remain challenging open questions in our understanding of the function of neural networks. Comparing representations in neural networks is fundamentally difficult as the structure of representations varies greatly, even across groups of networks trained on identical tasks, and over the course of training. Here, we develop projection weighted CCA (Canonical Correlation Analysis) as a tool for understanding neural networks, building off of SVCCA, a recently proposed method (Raghu et al., 2017). We first improve the core method, showing how to differentiate between signal and noise, and then apply this technique to compare across a group of CNNs, demonstrating that networks which generalize converge to more similar representations than networks which memorize, that wider networks…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Explainable Artificial Intelligence (XAI) · Machine Learning in Materials Science