TJU-DHD: A Diverse High-Resolution Dataset for Object Detection

Yanwei Pang; Jiale Cao; Yazhao Li; Jin Xie; Hanqing Sun; and Jinfeng Gong

arXiv:2011.09170·cs.CV·November 19, 2020

TJU-DHD: A Diverse High-Resolution Dataset for Object Detection

Yanwei Pang, Jiale Cao, Yazhao Li, Jin Xie, Hanqing Sun, and Jinfeng Gong

PDF

1 Repo

TL;DR

The paper introduces TJU-DHD, a large, diverse, high-resolution dataset for object detection, especially targeting small objects like vehicles and pedestrians in various conditions, to advance perception systems for autonomous vehicles and surveillance.

Contribution

It provides a new high-resolution, diverse dataset with over 115,000 images and 700,000 objects, addressing limitations of existing datasets in size, resolution, and scenario diversity.

Findings

01

High-resolution images improve detection accuracy for small objects.

02

Diverse conditions enhance robustness of detection models.

03

Experiments show the dataset's effectiveness in training better detectors.

Abstract

Vehicles, pedestrians, and riders are the most important and interesting objects for the perception modules of self-driving vehicles and video surveillance. However, the state-of-the-art performance of detecting such important objects (esp. small objects) is far from satisfying the demand of practical systems. Large-scale, rich-diversity, and high-resolution datasets play an important role in developing better object detection methods to satisfy the demand. Existing public large-scale datasets such as MS COCO collected from websites do not focus on the specific scenarios. Moreover, the popular datasets (e.g., KITTI and Citypersons) collected from the specific scenarios are limited in the number of images and instances, the resolution, and the diversity. To attempt to solve the problem, we build a diverse high-resolution dataset (called TJU-DHD). The dataset contains 115,354…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tjubiit/TJU-DHD
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsNon Maximum Suppression · Focal Loss · Convolution · FCOS · 1x1 Convolution · RetinaNet · Feature Pyramid Network