Attention Guided Cosine Margin For Overcoming Class-Imbalance in Few-Shot Road Object Detection
Ashutosh Agarwal, Anay Majee, Anbumani Subramanian, Chetan, Arora

TL;DR
This paper introduces Attention Guided Cosine Margin (AGCM), a novel method for few-shot object detection that improves class separation and reduces confusion, especially in imbalanced, real-world datasets.
Contribution
The paper proposes AGCM with Attentive Proposal Fusion and Cosine Margin Cross-Entropy loss to enhance feature clustering and class distinction in few-shot object detection.
Findings
Outperforms state-of-the-art on IDD dataset by up to 6.4 mAP points.
Achieves up to 4.9 mAP points improvement on PASCAL-VOC.
Effectively reduces catastrophic forgetting and class confusion.
Abstract
Few-shot object detection (FSOD) localizes and classifies objects in an image given only a few data samples. Recent trends in FSOD research show the adoption of metric and meta-learning techniques, which are prone to catastrophic forgetting and class confusion. To overcome these pitfalls in metric learning based FSOD techniques, we introduce Attention Guided Cosine Margin (AGCM) that facilitates the creation of tighter and well separated class-specific feature clusters in the classification head of the object detector. Our novel Attentive Proposal Fusion (APF) module minimizes catastrophic forgetting by reducing the intra-class variance among co-occurring classes. At the same time, the proposed Cosine Margin Cross-Entropy loss increases the angular margin between confusing classes to overcome the challenge of class confusion between already learned (base) and newly added (novel)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications
