Mastering Large Scale Multi-label Image Recognition with high efficiency   overCamera trap images

Miroslav Valan; Luk\'a\v{s} Picek

arXiv:2008.07828·cs.CV·August 19, 2020

Mastering Large Scale Multi-label Image Recognition with high efficiency overCamera trap images

Miroslav Valan, Luk\'a\v{s} Picek

PDF

Open Access

TL;DR

This paper presents a fast, lightweight machine learning approach for large-scale multi-label image recognition in camera trap images, achieving high accuracy and outperforming humans with minimal data augmentation.

Contribution

The authors introduce an efficient, simple baseline method that handles large datasets with limited hardware, avoiding overfitting and achieving top performance.

Findings

01

Achieved 97% accuracy on wildlife identification

02

Outperformed human-level performance

03

Trained on 6.7 million images using a single GPU

Abstract

Camera traps are crucial in biodiversity motivated studies, however dealing with large number of images while annotating these data sets is a tedious and time consuming task. To speed up this process, Machine Learning approaches are a reasonable asset. In this article we are proposing an easy, accessible, light-weight, fast and efficient approach based on our winning submission to the "Hakuna Ma-data - Serengeti Wildlife Identification challenge". Our system achieved an Accuracy of 97% and outperformed the human level performance. We show that, given relatively large data sets, it is effective to look at each image only once with little or no augmentation. By utilizing such a simple, yet effective baseline we were able to avoid over-fitting without extensive regularization techniques and to train a top scoring system on a very limited hardware featuring single GPU (1080Ti) despite the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Advanced Neural Network Applications · Handwritten Text Recognition Techniques