Mahalanobis Distance-based Multi-view Optimal Transport for Multi-view Crowd Localization
Qi Zhang, Kaiyi Zhang, Antoni B. Chan, and Hui Huang

TL;DR
This paper introduces a novel Mahalanobis distance-based multi-view optimal transport loss for multi-view crowd localization, improving accuracy by addressing density map ambiguities and incorporating view-specific information.
Contribution
It proposes a new Mahalanobis distance-based optimal transport loss tailored for multi-view crowd localization, enhancing performance over existing density map and Euclidean-based methods.
Findings
Outperforms density map-based methods on multiple datasets.
Effectively incorporates view ray directions and camera distances.
Demonstrates robustness in crowded scenes.
Abstract
Multi-view crowd localization predicts the ground locations of all people in the scene. Typical methods usually estimate the crowd density maps on the ground plane first, and then obtain the crowd locations. However, the performance of existing methods is limited by the ambiguity of the density maps in crowded areas, where local peaks can be smoothed away. To mitigate the weakness of density map supervision, optimal transport-based point supervision methods have been proposed in the single-image crowd localization tasks, but have not been explored for multi-view crowd localization yet. Thus, in this paper, we propose a novel Mahalanobis distance-based multi-view optimal transport (M-MVOT) loss specifically designed for multi-view crowd localization. First, we replace the Euclidean-based transport cost with the Mahalanobis distance, which defines elliptical iso-contours in the cost…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIndoor and Outdoor Localization Technologies · Video Surveillance and Tracking Methods · Speech and Audio Processing
