Get the Best of Both Worlds: Improving Accuracy and Transferability by Grassmann Class Representation
Haoqi Wang, Zhizhong Li, Wayne Zhang

TL;DR
This paper introduces Grassmann Class Representation (GCR), which generalizes class vectors to subspaces on the Grassmann manifold, improving neural network accuracy and transferability by jointly optimizing class subspaces with deep models.
Contribution
The paper proposes a novel GCR method that models classes as subspaces, enhancing representation power and transferability, and integrates Riemannian SGD into deep learning frameworks.
Findings
Significant error reduction on ImageNet-1K across multiple architectures.
Improved transfer accuracy on downstream tasks with GCR.
Increased intra-class feature variability with larger subspaces.
Abstract
We generalize the class vectors found in neural networks to linear subspaces (i.e.~points in the Grassmann manifold) and show that the Grassmann Class Representation (GCR) enables the simultaneous improvement in accuracy and feature transferability. In GCR, each class is a subspace and the logit is defined as the norm of the projection of a feature onto the class subspace. We integrate Riemannian SGD into deep learning frameworks such that class subspaces in a Grassmannian are jointly optimized with the rest model parameters. Compared to the vector form, the representative capability of subspaces is more powerful. We show that on ImageNet-1K, the top-1 error of ResNet50-D, ResNeXt50, Swin-T and Deit3-S are reduced by 5.6%, 4.5%, 3.0% and 3.5%, respectively. Subspaces also provide freedom for features to vary and we observed that the intra-class feature variability grows when the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMedical Imaging and Analysis · Radiomics and Machine Learning in Medical Imaging · AI in cancer detection
MethodsStochastic Gradient Descent
