TL;DR
This paper introduces a unified approach for few-shot object detection that addresses challenges in both low-shot and high-shot regimes by leveraging rich knowledge transfer and distribution alignment techniques, leading to improved performance on VOC and COCO benchmarks.
Contribution
It proposes a novel framework combining distribution calibration, shift compensation, and knowledge guidance from ImageNet to enhance FSOD across different shot regimes.
Findings
Significant performance improvements on VOC and COCO benchmarks.
Effective handling of both low-shot and high-shot regimes.
Outperforms baseline methods in various shot settings.
Abstract
Recently, few-shot object detection~(FSOD) has received much attention from the community, and many methods are proposed to address this problem from a knowledge transfer perspective. Though promising results have been achieved, these methods fail to achieve shot-stable:~methods that excel in low-shot regimes are likely to struggle in high-shot regimes, and vice versa. We believe this is because the primary challenge of FSOD changes when the number of shots varies. In the low-shot regime, the primary challenge is the lack of inner-class variation. In the high-shot regime, as the variance approaches the real one, the main hindrance to the performance comes from misalignment between learned and true distributions. However, these two distinct issues remain unsolved in most existing FSOD methods. In this paper, we propose to overcome these challenges by exploiting rich knowledge the model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsALIGN
