SynMVCrowd: A Large Synthetic Benchmark for Multi-view Crowd Counting and Localization
Qi Zhang, Daijie Chen, Yunfei Gong, Hui Huang

TL;DR
SynMVCrowd introduces a large synthetic benchmark dataset for multi-view crowd counting and localization, enabling more practical evaluation and advancing research in large-scale crowd analysis.
Contribution
The paper presents a new large-scale synthetic dataset, SynMVCrowd, and strong baseline methods, improving evaluation and domain transfer for multi-view crowd counting and localization.
Findings
The benchmark contains 50 synthetic scenes with up to 1000 crowds.
Proposed baselines outperform existing methods on SynMVCrowd.
Using SynMVCrowd improves domain transfer to real scenes.
Abstract
Existing multi-view crowd counting and localization methods are evaluated under relatively small scenes with limited crowd numbers, camera views, and frames. This makes the evaluation and comparison of existing methods impractical, as small datasets are easily overfit by these methods. To avoid these issues, 3DROM proposes a data augmentation method. Instead, in this paper, we propose a large synthetic benchmark, SynMVCrowd, for more practical evaluation and comparison of multi-view crowd counting and localization tasks. The SynMVCrowd benchmark consists of 50 synthetic scenes with a large number of multi-view frames and camera views and a much larger crowd number (up to 1000), which is more suitable for large-scene multi-view crowd vision tasks. Besides, we propose strong multi-view crowd localization and counting baselines that outperform all comparison methods on the new SynMVCrowd…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Anomaly Detection Techniques and Applications · Human Pose and Action Recognition
