Pursuing Better Decision Boundaries for Long-Tailed Object Detection via   Category Information Amount

Yanbiao Ma; Wei Dai; Jiayi Chen

arXiv:2502.03852·cs.CV·February 7, 2025·2 cites

Pursuing Better Decision Boundaries for Long-Tailed Object Detection via Category Information Amount

Yanbiao Ma, Wei Dai, Jiayi Chen

PDF

Open Access

TL;DR

This paper introduces the concept of category information amount to better understand category bias in object detection, proposing a new loss function that dynamically adjusts decision boundaries based on this measure, improving performance on long-tail and balanced datasets.

Contribution

It defines and measures category information amount, revealing its correlation with detection difficulty, and proposes IGAM Loss to adapt decision boundaries accordingly, reducing category bias.

Findings

01

IGAM Loss improves detection on long-tail datasets like LVIS and COCO-LT.

02

Category information amount correlates negatively with detection accuracy.

03

Method enhances underrepresented categories in balanced datasets like Pascal VOC.

Abstract

In object detection, the instance count is typically used to define whether a dataset exhibits a long-tail distribution, implicitly assuming that models will underperform on categories with fewer instances. This assumption has led to extensive research on category bias in datasets with imbalanced instance counts. However, models still exhibit category bias even in datasets where instance counts are relatively balanced, clearly indicating that instance count alone cannot explain this phenomenon. In this work, we first introduce the concept and measurement of category information amount. We observe a significant negative correlation between category information amount and accuracy, suggesting that category information amount more accurately reflects the learning difficulty of a category. Based on this observation, we propose Information Amount-Guided Angular Margin (IGAM) Loss. The core…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Processing Techniques and Applications · Face and Expression Recognition