TL;DR
This paper reveals that small eigenvalues in global covariance pooling are crucial for fine-grained visual recognition, and proposes a method to amplify them, leading to state-of-the-art results on multiple benchmarks.
Contribution
It introduces a novel approach to magnify small eigenvalues in covariance pooling, improving fine-grained recognition without extra parameters.
Findings
Small eigenvalues are vital for discriminative features.
Amplifying small eigenvalues improves recognition accuracy.
The method achieves state-of-the-art results on fine-grained benchmarks.
Abstract
The Fine-Grained Visual Categorization (FGVC) is challenging because the subtle inter-class variations are difficult to be captured. One notable research line uses the Global Covariance Pooling (GCP) layer to learn powerful representations with second-order statistics, which can effectively model inter-class differences. In our previous conference paper, we show that truncating small eigenvalues of the GCP covariance can attain smoother gradient and improve the performance on large-scale benchmarks. However, on fine-grained datasets, truncating the small eigenvalues would make the model fail to converge. This observation contradicts the common assumption that the small eigenvalues merely correspond to the noisy and unimportant information. Consequently, ignoring them should have little influence on the performance. To diagnose this peculiar behavior, we propose two attribution methods…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
