Fractal Calibration for long-tailed object detection

Konstantinos Panagiotis Alexandridis; Ismail Elezi; Jiankang Deng; Anh; Nguyen; Shan Luo

arXiv:2410.11774·cs.CV·March 25, 2025

Fractal Calibration for long-tailed object detection

Konstantinos Panagiotis Alexandridis, Ismail Elezi, Jiankang Deng, Anh, Nguyen, Shan Luo

PDF

Open Access 1 Repo 4 Reviews

TL;DR

FRACAL is a post-calibration method that uses fractal dimension to improve long-tailed object detection by balancing class probabilities without retraining, enhancing rare class detection across multiple datasets.

Contribution

It introduces a novel fractal dimension-based logit adjustment for post-calibration in long-tailed detection, outperforming previous methods without additional training.

Findings

01

Boosts rare class detection by up to 8.6%.

02

Surpasses previous methods on LVIS dataset.

03

Generalizes well to COCO, V3Det, and OpenImages.

Abstract

Real-world datasets follow an imbalanced distribution, which poses significant challenges in rare-category object detection. Recent studies tackle this problem by developing re-weighting and re-sampling methods, that utilise the class frequencies of the dataset. However, these techniques focus solely on the frequency statistics and ignore the distribution of the classes in image space, missing important information. In contrast to them, we propose FRActal CALibration (FRACAL): a novel post-calibration method for long-tailed object detection. FRACAL devises a logit adjustment method that utilises the fractal dimension to estimate how uniformly classes are distributed in image space. During inference, it uses the fractal dimension to inversely downweight the probabilities of uniformly spaced class predictions achieving balance in two axes: between frequent and rare categories, and between…

Peer Reviews

Decision·ICLR 2025 Conference Withdrawn Submission

Reviewer 01Rating 3Confidence 5

Strengths

The paper tackles a relevant and important problem. Post-training calibration of object detectors is of practical importance. Integrating spatial location awareness into logit adjustment is a novel and interesting idea. The authors report improved results on many datasets.

Weaknesses

There are three major weaknesses: 1. Clarity: The paper is not well-written. Important pieces of information are not clear. 2. Substantiation of the main claim: I am not sure whether the main claim of the paper, which is incorporating spatial location distributions, is substantiated. 3. Performance comparisons: Experimental comparisons do not provide a clear indication whether we should use this technique with modern object detectors. I explain them in detail below. ## Clarity: - Equat

Reviewer 02Rating 6Confidence 4

Strengths

1. This paper addresses an underexplored aspect in long-tailed object detection by introducing spatial information into post-calibration using fractal dimensions. This is innovative and may inspire new directions in handling data imbalance in object detection. 2. The proposed FRACAL method achieves significant improvements in performance for rare classes without retraining or additional inference costs, making it a practical tool for applications needing efficient solutions. 3. The authors con

Weaknesses

1. While the fractal dimension is a core component of the proposed method, the paper could benefit from a more detailed analysis on how different fractal dimension values influence performance across categories. This would add insight into the method's adaptability to datasets with varying spatial distributions. 2. The paper briefly discusses the limitations of grid-based approaches but does not thoroughly compare FRACAL against alternative spatial calibration methods. Including this would help

Reviewer 03Rating 6Confidence 3

Strengths

1) This paper attempts to introduce the class-location dependency to post-calibration, which enhances the performance of rare categories. 2) This work fuses the space information by integrating the fractal dimension and optimizes the detection performance. 3) This is a plug-in-plug-out module, so it does not need extra training.

Weaknesses

1) I feel confused about the description of Fig. 1. Why class prior ps(y) is used to align ps(y, u|x) with pt(y|x) rather than align ps(y|x) with pt(y|x)? Do you mean that space information has been considered by previous work just for training data? 2) Although the fractal dimension is introduced, the theoretical explanation of how the fractal dimension specifically affects the detection of rare categories can be more detailed. It is possible to further discuss how the fractal dimension improv

Reviewer 04Rating 5Confidence 4

Strengths

+ Calibration is a viable method for reducing the impact of class/data imbalance. + The gap identified by the authors (i.e., taking location distribution into account during calibration) is definitely promising. + The proposed calibration method taking into account prior distribution of samples across space as well as the fractal dimension is novel.

Weaknesses

1. Although the proposed approach is novel, some of the individual steps are not sufficiently motivated. For example: 1.1. "Eq. 9 downweighs the classes that appear most uniformly and it upweighs the classes that appear less uniformly. In practice, this scheme enforces a centre bias for frequent classes and no bias for rare classes, as shown in Fig. 2-bottom-right. Intuitively, removing the bias from the rare classes is better than rectifying it because it produces balanced detectors and aligns

Code & Models

Repositories

kostas1515/FRACAL
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCurrency Recognition and Detection · Industrial Vision Systems and Defect Detection · Image Retrieval and Classification Techniques

MethodsFocus