Long-tailed Species Recognition in the NACTI Wildlife Dataset
Zehua Liu, Tilo Burghardt

TL;DR
This paper systematically studies long-tail recognition methods for wildlife species identification in the NACTI dataset, achieving high accuracy and demonstrating improved generalization under domain shifts, while also discussing limitations.
Contribution
It introduces a comprehensive evaluation of LTR techniques on NACTI, achieving state-of-the-art accuracy and providing reproducible code and data splits for future research.
Findings
Achieved 99.40% Top-1 accuracy on NACTI test data.
Improved generalization to shifted domains with 52.55% accuracy.
LTR methods show consistent benefits but struggle with severe tail class shifts.
Abstract
As most ''in the wild'' data collections of the natural world, the North America Camera Trap Images (NACTI) dataset shows severe long-tailed class imbalance, noting that the largest 'Head' class alone covers >50% of the 3.7M images in the corpus. Building on the PyTorch Wildlife model, we present a systematic study of Long-Tail Recognition methodologies for species recognition on the NACTI dataset covering experiments on various LTR loss functions plus LTR-sensitive regularisation. Our best configuration achieves 99.40% Top-1 accuracy on our NACTI test data split, substantially improving over a 95.51% baseline using standard cross-entropy with Adam. This also improves on previously reported top performance in MLWIC2 at 96.8% albeit using partly unpublished (potentially different) partitioning, optimiser, and evaluation protocols. To evaluate domain shifts (e.g. night-time captures,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Wildlife Ecology and Conservation · Domain Adaptation and Few-Shot Learning
