Categorical Keypoint Positional Embedding for Robust Animal Re-Identification
Yuhao Lin, Lingqiao Liu, Javen Shi

TL;DR
This paper introduces a novel animal re-identification method using keypoint propagation and enhanced Vision Transformer embeddings, significantly improving accuracy and reducing manual annotation costs across multiple wildlife datasets.
Contribution
It proposes a new keypoint propagation technique with a diffusion model and enhances ViT with Categorical Keypoint Positional Embedding for robust animal re-identification.
Findings
Outperforms state-of-the-art methods on four wildlife datasets
Reduces manual annotation effort significantly
Provides more detailed and semantically-aware keypoint representations
Abstract
Animal re-identification (ReID) has become an indispensable tool in ecological research, playing a critical role in tracking population dynamics, analyzing behavioral patterns, and assessing ecological impacts, all of which are vital for informed conservation strategies. Unlike human ReID, animal ReID faces significant challenges due to the high variability in animal poses, diverse environmental conditions, and the inability to directly apply pre-trained models to animal data, making the identification process across species more complex. This work introduces an innovative keypoint propagation mechanism, which utilizes a single annotated image and a pre-trained diffusion model to propagate keypoints across an entire dataset, significantly reducing the cost of manual annotation. Additionally, we enhance the Vision Transformer (ViT) by implementing Keypoint Positional Encoding (KPE) and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIdentification and Quantification in Food · Food Supply Chain Traceability · Animal Behavior and Welfare Studies
MethodsAbsolute Position Encodings · Adam · Softmax · Label Smoothing · Dropout · Dense Connections · Layer Normalization · Diffusion · Linear Layer · Multi-Head Attention
