Mutually-Aware Feature Learning for Few-Shot Object Counting
Yerim Jeon, Subeen Lee, Jihwan Kim, Jae-Pil Heo

TL;DR
This paper introduces MAFEA, a mutual-aware feature learning framework for few-shot object counting that enhances target recognition by enabling continuous interaction between query and exemplar features, leading to state-of-the-art results.
Contribution
The paper proposes a novel mutual-aware feature learning framework that improves few-shot object counting by fostering continuous interaction between query and exemplar features.
Findings
Achieves state-of-the-art performance on FSCD-LVIS and FSC-147 benchmarks.
Reduces target confusion significantly in multi-category scenarios.
Effectively associates target regions with exemplars using background tokens.
Abstract
Few-shot object counting has garnered significant attention for its practicality as it aims to count target objects in a query image based on given exemplars without additional training. However, the prevailing extract-and-match approach has a shortcoming: query and exemplar features lack interaction during feature extraction since they are extracted independently and later correlated based on similarity. This can lead to insufficient target awareness and confusion in identifying the actual target when multiple class objects coexist. To address this, we propose a novel framework, Mutually-Aware FEAture learning (MAFEA), which encodes query and exemplar features with mutual awareness from the outset. By encouraging interaction throughout the pipeline, we obtain target-aware features robust to a multi-category scenario. Furthermore, we introduce background token to effectively associate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning
MethodsSoftmax · Attention Is All You Need · Attentive Walk-Aggregating Graph Neural Network
