Exploring the Common Principal Subspace of Deep Features in Neural Networks
Haoran Liu, Haoyi Xiong, Yaqing Wang, Haozhe An, Dongrui Wu, and, Dejing Dou

TL;DR
This paper reveals that diverse deep neural networks trained on the same dataset share a common principal subspace in their learned features, regardless of architecture or training method, and introduces a new metric to measure this shared structure.
Contribution
The authors propose a novel $ ext{P}$-vector metric to quantify the principal subspace of deep features and demonstrate its effectiveness across various architectures and training paradigms.
Findings
Deep networks share a common principal subspace in learned features.
The $ ext{P}$-vector metric effectively measures angles between subspaces.
Angles decrease during training, indicating convergence of feature spaces.
Abstract
We find that different Deep Neural Networks (DNNs) trained with the same dataset share a common principal subspace in latent spaces, no matter in which architectures (e.g., Convolutional Neural Networks (CNNs), Multi-Layer Preceptors (MLPs) and Autoencoders (AEs)) the DNNs were built or even whether labels have been used in training (e.g., supervised, unsupervised, and self-supervised learning). Specifically, we design a new metric -vector to represent the principal subspace of deep features learned in a DNN, and propose to measure angles between the principal subspaces using -vectors. Small angles (with cosine close to ) have been found in the comparisons between any two DNNs trained with different algorithms/architectures. Furthermore, during the training procedure from random scratch, the angle decrease from a larger one ( usually) to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Generative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning
