Robust Object Detection With Inaccurate Bounding Boxes
Chengxin Liu, Kewei Wang, Hao Lu, Zhiguo Cao, and Ziming Zhang

TL;DR
This paper introduces OA-MIL, a novel approach that improves object detection robustness against inaccurate bounding boxes by leveraging classification guidance and instance selection, validated on noisy datasets.
Contribution
Proposes OA-MIL, an object-aware multiple instance learning method that refines localization using classification signals and instance selection to handle noisy bounding boxes.
Findings
Effective on synthetic noisy datasets like PASCAL VOC and MS-COCO.
Improves localization accuracy despite inaccurate bounding box annotations.
Demonstrates robustness on real noisy wheat head dataset.
Abstract
Learning accurate object detectors often requires large-scale training data with precise object bounding boxes. However, labeling such data is expensive and time-consuming. As the crowd-sourcing labeling process and the ambiguities of the objects may raise noisy bounding box annotations, the object detectors will suffer from the degenerated training data. In this work, we aim to address the challenge of learning robust object detectors with inaccurate bounding boxes. Inspired by the fact that localization precision suffers significantly from inaccurate bounding boxes while classification accuracy is less affected, we propose leveraging classification as a guidance signal for refining localization results. Specifically, by treating an object as a bag of instances, we introduce an Object-Aware Multiple Instance Learning approach (OA-MIL), featured with object-aware instance selection and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications
