TL;DR
This paper compares animal recognition models for embedded wildlife camera traps, analyzing their accuracy and speed trade-offs, and proposes optimized models to efficiently filter empty images on resource-limited devices.
Contribution
It provides a comprehensive comparison of classifiers and detectors for empty image filtering, including optimization techniques and performance analysis on edge devices.
Findings
Detectors outperform classifiers in empty image removal with similar latency.
Classifiers can achieve comparable results to detectors with ten times more training data.
Optimized models using quantization and filter reduction improve inference speed and accuracy.
Abstract
Monitoring wildlife through camera traps produces a massive amount of images, whose a significant portion does not contain animals, being later discarded. Embedding deep learning models to identify animals and filter these images directly in those devices brings advantages such as savings in the storage and transmission of data, usually resource-constrained in this type of equipment. In this work, we present a comparative study on animal recognition models to analyze the trade-off between precision and inference latency on edge devices. To accomplish this objective, we investigate classifiers and object detectors of various input resolutions and optimize them using quantization and reducing the number of model filters. The confidence threshold of each model was adjusted to obtain 96% recall for the nonempty class, since instances from the empty class are expected to be discarded. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
