Towards Automatic Identification of Elephants in the Wild
Matthias K\"orschens, Bj\"orn Barz, Joachim Denzler

TL;DR
This paper presents an automated system for identifying individual elephants in the wild using limited training data, combining object localization, CNN features, and SVMs to assist researchers in biodiversity monitoring.
Contribution
The novel system integrates object part localization, CNN features, and multi-image aggregation to improve elephant identification accuracy with few training images.
Findings
Achieved 56% top-1 accuracy on a dataset of 276 elephants.
Improved to 74% top-1 accuracy when using multiple images.
System effectively handles occlusion, viewpoints, and poses.
Abstract
Identifying animals from a large group of possible individuals is very important for biodiversity monitoring and especially for collecting data on a small number of particularly interesting individuals, as these have to be identified first before this can be done. Identifying them can be a very time-consuming task. This is especially true, if the animals look very similar and have only a small number of distinctive features, like elephants do. In most cases the animals stay at one place only for a short period of time during which the animal needs to be identified for knowing whether it is important to collect new data on it. For this reason, a system supporting the researchers in identifying elephants to speed up this process would be of great benefit. In this paper, we present such a system for identifying elephants in the face of a large number of individuals with only few training…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Advanced Image and Video Retrieval Techniques · Music and Audio Processing
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
