Non-iterative optimization of pseudo-labeling thresholds for training object detection models from multiple datasets
Yuki Tanaka, Shuhei M. Yoshida, Makoto Terao

TL;DR
This paper introduces a non-iterative method to optimize pseudo-labeling thresholds for object detection models trained on multiple datasets, reducing training time while maintaining high accuracy.
Contribution
The authors propose a novel threshold optimization technique that avoids iterative search, directly maximizing the pseudo-label quality metric without retraining models.
Findings
Achieves mAP comparable to grid search on COCO and VOC datasets.
Reduces training time by eliminating iterative threshold tuning.
Maintains high pseudo-label quality without multiple training cycles.
Abstract
We propose a non-iterative method to optimize pseudo-labeling thresholds for learning object detection from a collection of low-cost datasets, each of which is annotated for only a subset of all the object classes. A popular approach to this problem is first to train teacher models and then to use their confident predictions as pseudo ground-truth labels when training a student model. To obtain the best result, however, thresholds for prediction confidence must be adjusted. This process typically involves iterative search and repeated training of student models and is time-consuming. Therefore, we develop a method to optimize the thresholds without iterative optimization by maximizing the -score on a validation dataset, which measures the quality of pseudo labels and can be measured without training a student model. We experimentally demonstrate that our proposed method…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
