Recurrence over Video Frames (RoVF) for the Re-identification of Meerkats
Mitchell Rogers, Kobe Knowles, Ga\"el Gendron, Shahrokh Heidari, David, Arturo Soriano Valdez, Mihailo Azhar, Padriac O'Leary, Simon Eyre, Michael, Witbrock, Patrice Delmas

TL;DR
This paper introduces RoVF, a recurrent video-based embedding method for animal re-identification, demonstrating improved accuracy over existing transformer models on meerkat videos, aiding conservation efforts.
Contribution
The paper presents RoVF, a novel recurrent architecture using Perceiver for video-based animal re-identification, with training via triplet loss without individual IDs, outperforming transformer baselines.
Findings
RoVF achieves 49% top-1 accuracy, surpassing DINOv2's 42%.
RoVF can identify individuals humans cannot match.
Minimal fine-tuning yields better results than other models.
Abstract
Deep learning approaches for animal re-identification have had a major impact on conservation, significantly reducing the time required for many downstream tasks, such as well-being monitoring. We propose a method called Recurrence over Video Frames (RoVF), which uses a recurrent head based on the Perceiver architecture to iteratively construct an embedding from a video clip. RoVF is trained using triplet loss based on the co-occurrence of individuals in the video frames, where the individual IDs are unavailable. We tested this method and various models based on the DINOv2 transformer architecture on a dataset of meerkats collected at the Wellington Zoo. Our method achieves a top-1 re-identification accuracy of , which is higher than that of the best DINOv2 model (). We found that the model can match observations of individuals where humans cannot, and our model (RoVF)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Media Forensic Detection
MethodsTriplet Loss
