The effect of task and training on intermediate representations in convolutional neural networks revealed with modified RV similarity analysis
Jessica A.F. Thompson, Yoshua Bengio, Marc Schoenwiesner

TL;DR
This paper compares neural network layer representations using a modified RV-coefficient, revealing how different training methods influence internal representations and highlighting the importance of the penultimate layer.
Contribution
It introduces the use of RV2 as a less dataset-sensitive similarity metric and demonstrates its effectiveness in analyzing training effects on network representations.
Findings
RV2 recovers expected similarity patterns
Freeze training affects penultimate layer representations
Inputs and targets anchor representations in lowest and highest layers
Abstract
Centered Kernel Alignment (CKA) was recently proposed as a similarity metric for comparing activation patterns in deep networks. Here we experiment with the modified RV-coefficient (RV2), which has very similar properties as CKA while being less sensitive to dataset size. We compare the representations of networks that received varying amounts of training on different layers: a standard trained network (all parameters updated at every step), a freeze trained network (layers gradually frozen during training), random networks (only some layers trained), and a completely untrained network. We found that RV2 was able to recover expected similarity patterns and provide interpretable similarity matrices that suggested hypotheses about how representations are affected by different training recipes. We propose that the superior performance achieved by freeze training can be attributed to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
