Explored An Effective Methodology for Fine-Grained Snake Recognition
Yong Huang, Aderon Huang, Wei Zhu, Yanming Fang, Jinghua Feng

TL;DR
This paper presents a comprehensive methodology combining multimodal backbones, novel loss functions, self-supervised pretraining, and data tricks to enhance fine-grained snake recognition, achieving top leaderboard performance.
Contribution
It introduces a new multimodal backbone, loss functions for long tail distribution, and joint self-supervised and supervised training for FGVC in snake recognition.
Findings
Achieved macro F1 score of 92.7% on private dataset
Secured 1st place on private leaderboard
Demonstrated effectiveness of combined techniques in FGVC
Abstract
Fine-Grained Visual Classification (FGVC) is a longstanding and fundamental problem in computer vision and pattern recognition, and underpins a diverse set of real-world applications. This paper describes our contribution at SnakeCLEF2022 with FGVC. Firstly, we design a strong multimodal backbone to utilize various meta-information to assist in fine-grained identification. Secondly, we provide new loss functions to solve the long tail distribution with dataset. Then, in order to take full advantage of unlabeled datasets, we use self-supervised learning and supervised learning joint training to provide pre-trained model. Moreover, some effective data process tricks also are considered in our experiments. Last but not least, fine-tuned in downstream task with hard mining, ensambled kinds of model performance. Extensive experiments demonstrate that our method can effectively improve the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHand Gesture Recognition Systems · Digital Imaging for Blood Diseases · Rabies epidemiology and control
