Knowledge Transfer Based Fine-grained Visual Classification
Siqing Zhang, Ruoyi Du, Dongliang Chang, Zhanyu Ma, Jun Guo

TL;DR
This paper introduces a knowledge transfer approach with orthogonal loss to improve fine-grained visual classification by encouraging diverse region discovery, achieving state-of-the-art results.
Contribution
It proposes a novel knowledge transfer framework with orthogonal loss to enhance region diversity in FGVC, surpassing existing methods.
Findings
Achieved state-of-the-art performance on three FGVC datasets.
Demonstrated the effectiveness of orthogonal loss in promoting diverse region learning.
Outperformed baseline models in fine-grained classification accuracy.
Abstract
Fine-grained visual classification (FGVC) aims to distinguish the sub-classes of the same category and its essential solution is to mine the subtle and discriminative regions. Convolution neural networks (CNNs), which employ the cross entropy loss (CE-loss) as the loss function, show poor performance since the model can only learn the most discriminative part and ignore other meaningful regions. Some existing works try to solve this problem by mining more discriminative regions by some detection techniques or attention mechanisms. However, most of them will meet the background noise problem when trying to find more discriminative regions. In this paper, we address it in a knowledge transfer learning manner. Multiple models are trained one by one, and all previously trained models are regarded as teacher models to supervise the training of the current one. Specifically, a orthogonal loss…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Processing Techniques and Applications · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning
MethodsConvolution
