Weakly Supervised Dataset Collection for Robust Person Detection
Munetaka Minoguchi, Ken Okayama, Yutaka Satoh, Hirokatsu Kataoka

TL;DR
This paper introduces a large-scale weakly supervised person detection dataset with over 8 million images, demonstrating its effectiveness for pre-training models that outperform those trained on smaller, fully supervised datasets.
Contribution
The creation of the 8.7 million image Weakly Supervised Person Dataset (WSPD) using a two-step collection process, enabling effective pre-training for person detection.
Findings
WSPD pre-trained model outperforms ImageNet-based models by 13.38% accuracy.
WSPD pre-trained model outperforms EuroCity Persons-based models by 6.38% accuracy.
WSPD is effective for pre-training in person detection tasks.
Abstract
To construct an algorithm that can provide robust person detection, we present a dataset with over 8 million images that was produced in a weakly supervised manner. Through labor-intensive human annotation, the person detection research community has produced relatively small datasets containing on the order of 100,000 images, such as the EuroCity Persons dataset, which includes 240,000 bounding boxes. Therefore, we have collected 8.7 million images of persons based on a two-step collection process, namely person detection with an existing detector and data refinement for false positive suppression. According to the experimental results, the Weakly Supervised Person Dataset (WSPD) is simple yet effective for person detection pre-training. In the context of pre-trained person detection algorithms, our WSPD pre-trained model has 13.38 and 6.38% better accuracy than the same model trained…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Advanced Neural Network Applications · Human Pose and Action Recognition
