TL;DR
The paper introduces TJU-DHD, a large, diverse, high-resolution dataset for object detection, especially targeting small objects like vehicles and pedestrians in various conditions, to advance perception systems for autonomous vehicles and surveillance.
Contribution
It provides a new high-resolution, diverse dataset with over 115,000 images and 700,000 objects, addressing limitations of existing datasets in size, resolution, and scenario diversity.
Findings
High-resolution images improve detection accuracy for small objects.
Diverse conditions enhance robustness of detection models.
Experiments show the dataset's effectiveness in training better detectors.
Abstract
Vehicles, pedestrians, and riders are the most important and interesting objects for the perception modules of self-driving vehicles and video surveillance. However, the state-of-the-art performance of detecting such important objects (esp. small objects) is far from satisfying the demand of practical systems. Large-scale, rich-diversity, and high-resolution datasets play an important role in developing better object detection methods to satisfy the demand. Existing public large-scale datasets such as MS COCO collected from websites do not focus on the specific scenarios. Moreover, the popular datasets (e.g., KITTI and Citypersons) collected from the specific scenarios are limited in the number of images and instances, the resolution, and the diversity. To attempt to solve the problem, we build a diverse high-resolution dataset (called TJU-DHD). The dataset contains 115,354…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsNon Maximum Suppression · Focal Loss · Convolution · FCOS · 1x1 Convolution · RetinaNet · Feature Pyramid Network
