On the Versatile Uses of Partial Distance Correlation in Deep Learning
Xingjian Zhen, Zihang Meng, Rudrasis Chakraborty, Vikas Singh

TL;DR
This paper explores the use of partial distance correlation, a statistical measure, to compare neural network models' functions, offering a versatile tool for analysis, regularization, and improving model robustness.
Contribution
It introduces a practical method to deploy partial distance correlation for large-scale neural networks, enabling diverse applications like model comparison, disentangled representation learning, and robustness enhancement.
Findings
Partial distance correlation effectively compares neural network features.
The method serves as a versatile regularizer and constraint.
Applications include model conditioning, disentanglement, and adversarial robustness.
Abstract
Comparing the functional behavior of neural network models, whether it is a single network over time or two (or more networks) during or post-training, is an essential step in understanding what they are learning (and what they are not), and for identifying strategies for regularization or efficiency improvements. Despite recent progress, e.g., comparing vision transformers to CNNs, systematic comparison of function, especially across different networks, remains difficult and is often carried out layer by layer. Approaches such as canonical correlation analysis (CCA) are applicable in principle, but have been sparingly used so far. In this paper, we revisit a (less widely known) from statistics, called distance correlation (and its partial variant), designed to evaluate correlation between feature spaces of different dimensions. We describe the steps necessary to carry out its…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Adversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning
