Transferred Discrepancy: Quantifying the Difference Between   Representations

Yunzhen Feng; Runtian Zhai; Di He; Liwei Wang; Bin Dong

arXiv:2007.12446·cs.LG·July 27, 2020·6 cites

Transferred Discrepancy: Quantifying the Difference Between Representations

Yunzhen Feng, Runtian Zhai, Di He, Liwei Wang, Bin Dong

PDF

Open Access

TL;DR

This paper introduces the transferred discrepancy (TD), a new metric for quantifying differences between neural network representations based on their downstream-task performance, providing insights into model similarity and training strategies.

Contribution

The work proposes the transferred discrepancy (TD), a novel task-dependent metric for comparing neural representations, and demonstrates its effectiveness in analyzing model similarity and training methods.

Findings

01

TD correlates with downstream task performance.

02

Models trained with beneficial data augmentations have similar TD scores.

03

TD reveals differences in learned features across models and training strategies.

Abstract

Understanding what information neural networks capture is an essential problem in deep learning, and studying whether different models capture similar features is an initial step to achieve this goal. Previous works sought to define metrics over the feature matrices to measure the difference between two models. However, different metrics sometimes lead to contradictory conclusions, and there has been no consensus on which metric is suitable to use in practice. In this work, we propose a novel metric that goes beyond previous approaches. Recall that one of the most practical scenarios of using the learned representations is to apply them to downstream tasks. We argue that we should design the metric based on a similar principle. For that, we introduce the transferred discrepancy (TD), a new metric that defines the difference between two representations based on their downstream-task…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Digital Media Forensic Detection