Bird Species Classification using Transfer Learning with Multistage Training
Sourya Dipta Das, Akash Kumar

TL;DR
This paper presents a transfer learning approach with multistage training for bird species classification, combining Mask-RCNN and ensemble Inception models to improve localization and identification accuracy.
Contribution
It introduces a novel multistage training framework utilizing pre-trained Mask-RCNN and ensemble Inception models for fine-grained bird species classification.
Findings
Achieved an F1 score of 55.67% on CVIP 2018 dataset.
Demonstrated effectiveness of combining localization and ensemble classification.
Improved bird species recognition accuracy over baseline methods.
Abstract
Bird species classification has received more and more attention in the field of computer vision, for its promising applications in biology and environmental studies. Recognizing bird species is difficult due to the challenges of discriminative region localization and fine-grained feature learning. In this paper, we have introduced a Transfer learning based method with multistage training. We have used both Pre-Trained Mask-RCNN and an ensemble model consisting of Inception Nets (InceptionV3 & InceptionResNetV2 ) to get localization and species of the bird from the images respectively. Our final model achieves an F1 score of 0.5567 or 55.67 % on the dataset provided in CVIP 2018 Challenge.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnimal Vocal Communication and Behavior · Wildlife Ecology and Conservation · Identification and Quantification in Food
