The effect of task and training on intermediate representations in   convolutional neural networks revealed with modified RV similarity analysis

Jessica A.F. Thompson; Yoshua Bengio; Marc Schoenwiesner

arXiv:1912.02260·cs.LG·December 6, 2019

The effect of task and training on intermediate representations in convolutional neural networks revealed with modified RV similarity analysis

Jessica A.F. Thompson, Yoshua Bengio, Marc Schoenwiesner

PDF

TL;DR

This paper compares neural network layer representations using a modified RV-coefficient, revealing how different training methods influence internal representations and highlighting the importance of the penultimate layer.

Contribution

It introduces the use of RV2 as a less dataset-sensitive similarity metric and demonstrates its effectiveness in analyzing training effects on network representations.

Findings

01

RV2 recovers expected similarity patterns

02

Freeze training affects penultimate layer representations

03

Inputs and targets anchor representations in lowest and highest layers

Abstract

Centered Kernel Alignment (CKA) was recently proposed as a similarity metric for comparing activation patterns in deep networks. Here we experiment with the modified RV-coefficient (RV2), which has very similar properties as CKA while being less sensitive to dataset size. We compare the representations of networks that received varying amounts of training on different layers: a standard trained network (all parameters updated at every step), a freeze trained network (layers gradually frozen during training), random networks (only some layers trained), and a completely untrained network. We found that RV2 was able to recover expected similarity patterns and provide interpretable similarity matrices that suggested hypotheses about how representations are affected by different training recipes. We propose that the superior performance achieved by freeze training can be attributed to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.