Transferred Discrepancy: Quantifying the Difference Between Representations
Yunzhen Feng, Runtian Zhai, Di He, Liwei Wang, Bin Dong

TL;DR
This paper introduces the transferred discrepancy (TD), a new metric for quantifying differences between neural network representations based on their downstream-task performance, providing insights into model similarity and training strategies.
Contribution
The work proposes the transferred discrepancy (TD), a novel task-dependent metric for comparing neural representations, and demonstrates its effectiveness in analyzing model similarity and training methods.
Findings
TD correlates with downstream task performance.
Models trained with beneficial data augmentations have similar TD scores.
TD reveals differences in learned features across models and training strategies.
Abstract
Understanding what information neural networks capture is an essential problem in deep learning, and studying whether different models capture similar features is an initial step to achieve this goal. Previous works sought to define metrics over the feature matrices to measure the difference between two models. However, different metrics sometimes lead to contradictory conclusions, and there has been no consensus on which metric is suitable to use in practice. In this work, we propose a novel metric that goes beyond previous approaches. Recall that one of the most practical scenarios of using the learned representations is to apply them to downstream tasks. We argue that we should design the metric based on a similar principle. For that, we introduce the transferred discrepancy (TD), a new metric that defines the difference between two representations based on their downstream-task…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Digital Media Forensic Detection
