The EuroCity Persons Dataset: A Novel Benchmark for Object Detection
Markus Braun, Sebastian Krebs, Fabian Flohr, and Dariu M. Gavrila

TL;DR
The paper introduces the EuroCity Persons dataset, a large, diverse, and detailed benchmark for pedestrian and cyclist detection in urban traffic scenes, enabling improved deep learning model evaluation.
Contribution
It presents a new extensive dataset with high-quality annotations collected across Europe, and evaluates multiple state-of-the-art detectors to establish baseline performances.
Findings
EuroCity Persons dataset contains over 238,200 person instances in 47,300 images.
Training on this dataset improves generalization of detectors across different conditions.
Dataset diversity and annotation quality significantly affect detection performance.
Abstract
Big data has had a great share in the success of deep learning in computer vision. Recent works suggest that there is significant further potential to increase object detection performance by utilizing even bigger datasets. In this paper, we introduce the EuroCity Persons dataset, which provides a large number of highly diverse, accurate and detailed annotations of pedestrians, cyclists and other riders in urban traffic scenes. The images for this dataset were collected on-board a moving vehicle in 31 cities of 12 European countries. With over 238200 person instances manually labeled in over 47300 images, EuroCity Persons is nearly one order of magnitude larger than person datasets used previously for benchmarking. The dataset furthermore contains a large number of person orientation annotations (over 211200). We optimize four state-of-the-art deep learning approaches (Faster R-CNN,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsConvolution · Non Maximum Suppression · 1x1 Convolution · Position-Sensitive RoI Pooling · SSD · Region-based Fully Convolutional Network
