Temporal Flow Mask Attention for Open-Set Long-Tailed Recognition of Wild Animals in Camera-Trap Images
Jeongsoo Kim, Sangmin Woo, Byeongjun Park, Changick Kim

TL;DR
This paper introduces the Temporal Flow Mask Attention Network, a novel deep learning approach combining optical flow, attention residuals, and meta-embedding techniques to improve open-set long-tailed recognition of wildlife in camera-trap images, especially for unknown classes.
Contribution
The paper presents a new network architecture that integrates optical flow, attention mechanisms, and meta-embedding to address open-set long-tailed recognition in wildlife images.
Findings
Effective in recognizing known and unknown wildlife classes.
Robust to class imbalance and open-set scenarios.
Outperforms existing methods on DMZ dataset.
Abstract
Camera traps, unmanned observation devices, and deep learning-based image recognition systems have greatly reduced human effort in collecting and analyzing wildlife images. However, data collected via above apparatus exhibits 1) long-tailed and 2) open-ended distribution problems. To tackle the open-set long-tailed recognition problem, we propose the Temporal Flow Mask Attention Network that comprises three key building blocks: 1) an optical flow module, 2) an attention residual module, and 3) a meta-embedding classifier. We extract temporal features of sequential frames using the optical flow module and learn informative representation using attention residual blocks. Moreover, we show that applying the meta-embedding technique boosts the performance of the method in open-set long-tailed recognition. We apply this method on a Korean Demilitarized Zone (DMZ) dataset. We conduct…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Human Pose and Action Recognition · Cancer-related molecular mechanisms research
