Exploring Aleatoric Uncertainty in Object Detection via Vision   Foundation Models

Peng Cui; Guande He; Dan Zhang; Zhijie Deng; Yinpeng Dong; Jun Zhu

arXiv:2411.17767·cs.CV·November 28, 2024

Exploring Aleatoric Uncertainty in Object Detection via Vision Foundation Models

Peng Cui, Guande He, Dan Zhang, Zhijie Deng, Yinpeng Dong, Jun Zhu

PDF

Open Access

TL;DR

This paper introduces a method to quantify aleatoric uncertainty in object detection using vision foundation models, enabling more robust training by filtering noisy data and balancing samples, validated across multiple models and benchmarks.

Contribution

It proposes a novel uncertainty estimation approach based on feature space analysis with mixture-of-Gaussian models, enhancing object detection robustness.

Findings

01

Effective uncertainty estimation improves detection accuracy.

02

Uncertainty-based filtering reduces overfitting on noisy data.

03

Adaptive training with uncertainty balances model learning.

Abstract

Datasets collected from the open world unavoidably suffer from various forms of randomness or noiseness, leading to the ubiquity of aleatoric (data) uncertainty. Quantifying such uncertainty is particularly pivotal for object detection, where images contain multi-scale objects with occlusion, obscureness, and even noisy annotations, in contrast to images with centric and similar-scale objects in classification. This paper suggests modeling and exploiting the uncertainty inherent in object detection data with vision foundation models and develops a data-centric reliable training paradigm. Technically, we propose to estimate the data uncertainty of each object instance based on the feature space of vision foundation models, which are trained on ultra-large-scale datasets and able to exhibit universal data representation. In particular, we assume a mixture-of-Gaussian structure of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning