Towards pedestrian head tracking: A benchmark dataset and a multi-source data fusion network
Kailai Sun, Xinwei Wang, Shaobo Liu, Qianchuan Zhao, Gao Huang, Chang Liu

TL;DR
This paper introduces a large-scale pedestrian head tracking dataset and a novel multi-source data fusion neural network that significantly improves head detection and tracking in crowded scenes.
Contribution
The paper presents the first comprehensive head tracking dataset with diverse scenes and a new CNN-based network that fuses multiple data sources for enhanced accuracy.
Findings
MDFN outperforms existing methods on multiple datasets.
The dataset contains over 50,000 frames with extensive pedestrian head annotations.
Multi-source data fusion significantly improves head detection and tracking performance.
Abstract
Pedestrian detection and tracking in crowded video sequences have many applications, including autonomous driving, robot navigation and pedestrian flow analysis. However, detecting and tracking pedestrians in high-density crowds face many challenges, including intra-class occlusions, complex motions, and diverse poses. Although artificial intelligence (AI) models have achieved great progress in head detection, head tracking datasets and methods are extremely lacking. Existing head datasets have limited coverage of complex pedestrian flows and scenes (e.g., pedestrian interactions, occlusions, and object interference). It is of great importance to develop new head tracking datasets and methods. To address these challenges, we present a Chinese Large-scale Cross-scene Pedestrian Head Tracking dataset (Cchead) and a Multi-source Data Fusion Network (MDFN). The dataset has features that are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Automated Road and Building Extraction · Gait Recognition and Analysis
