Mutually-Aware Feature Learning for Few-Shot Object Counting

Yerim Jeon; Subeen Lee; Jihwan Kim; Jae-Pil Heo

arXiv:2408.09734·cs.CV·December 3, 2025

Mutually-Aware Feature Learning for Few-Shot Object Counting

Yerim Jeon, Subeen Lee, Jihwan Kim, Jae-Pil Heo

PDF

Open Access

TL;DR

This paper introduces MAFEA, a mutual-aware feature learning framework for few-shot object counting that enhances target recognition by enabling continuous interaction between query and exemplar features, leading to state-of-the-art results.

Contribution

The paper proposes a novel mutual-aware feature learning framework that improves few-shot object counting by fostering continuous interaction between query and exemplar features.

Findings

01

Achieves state-of-the-art performance on FSCD-LVIS and FSC-147 benchmarks.

02

Reduces target confusion significantly in multi-category scenarios.

03

Effectively associates target regions with exemplars using background tokens.

Abstract

Few-shot object counting has garnered significant attention for its practicality as it aims to count target objects in a query image based on given exemplars without additional training. However, the prevailing extract-and-match approach has a shortcoming: query and exemplar features lack interaction during feature extraction since they are extracted independently and later correlated based on similarity. This can lead to insufficient target awareness and confusion in identifying the actual target when multiple class objects coexist. To address this, we propose a novel framework, Mutually-Aware FEAture learning (MAFEA), which encodes query and exemplar features with mutual awareness from the outset. By encouraging interaction throughout the pipeline, we obtain target-aware features robust to a multi-category scenario. Furthermore, we introduce background token to effectively associate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Surveillance and Tracking Methods · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning

MethodsSoftmax · Attention Is All You Need · Attentive Walk-Aggregating Graph Neural Network