UniAP: Towards Universal Animal Perception in Vision via Few-shot Learning
Meiqi Sun, Zhonghan Zhao, Wenhao Chai, Hanjun Luo, Shidong Cao,, Yanting Zhang, Jenq-Neng Hwang, Gaoang Wang

TL;DR
UniAP is a universal animal perception model that uses few-shot learning and Transformer architecture to adapt to various animal species and perception tasks with minimal data, enabling cross-species and cross-task generalization.
Contribution
The paper introduces UniAP, a novel few-shot learning framework for universal animal perception across multiple visual tasks and species, addressing data scarcity and semantic inconsistency.
Findings
Effective in pose estimation, segmentation, and classification across diverse species.
Capable of generalizing to unseen species with minimal labeled data.
Outperforms existing methods in cross-species perception tasks.
Abstract
Animal visual perception is an important technique for automatically monitoring animal health, understanding animal behaviors, and assisting animal-related research. However, it is challenging to design a deep learning-based perception model that can freely adapt to different animals across various perception tasks, due to the varying poses of a large diversity of animals, lacking data on rare species, and the semantic inconsistency of different tasks. We introduce UniAP, a novel Universal Animal Perception model that leverages few-shot learning to enable cross-species perception among various visual tasks. Our proposed model takes support images and labels as prompt guidance for a query image. Images and labels are processed through a Transformer-based encoder and a lightweight label encoder, respectively. Then a matching module is designed for aggregating information between prompt…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Video Surveillance and Tracking Methods · Human Pose and Action Recognition
