Supervised Dimensionality Reduction Revisited: Why LDA on Frozen CNN Features Deserves a Second Look
Indar Kumar, Girish Karhana, Sai Krishna Jasti, Ankit Hemant Lade

TL;DR
This paper reevaluates the effectiveness of Linear Discriminant Analysis (LDA) for supervised dimensionality reduction on frozen CNN features, highlighting its benefits for coarse-grained tasks and limitations for fine-grained recognition.
Contribution
It systematically compares LDA with other reduction methods across multiple backbones and datasets, providing practical guidelines for their use in transfer learning pipelines.
Findings
LDA improves accuracy in 11 of 12 coarse-grained configurations.
LDA reduces feature dimensionality by 48-87%.
LDA underperforms on fine-grained CUB-200 dataset.
Abstract
Frozen pretrained image representations are widely used for transfer learning: a backbone is kept fixed, feature vectors are extracted, and a lightweight classifier is trained on top. This pipeline usually feeds the full feature vector to the classifier, even when the target task has far fewer classes than the pretraining task. We revisit a classical alternative: supervised dimensionality reduction with Linear Discriminant Analysis (LDA) before linear probing. We evaluate ten dimensionality-reduction strategies on frozen features from six backbones -- ResNet-18, ResNet-50, MobileNetV3-Small, EfficientNet-B0, ViT-B/16, and DINOv2-ViT-S/14 -- across CIFAR-100, Tiny ImageNet, and CUB-200-2011. Under a fixed logistic-regression protocol, LDA improves accuracy over full features in 11 of 12 coarse-grained configurations, with gains up to 4.5 percentage points while reducing feature…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
