Metrics for Evaluating Social Conformity of Crowd Navigation Algorithms
Junxian Wang, Wesley P. Chan, Pamela Carreno-Medrano, Akansel Cosgun,, Elizabeth Croft

TL;DR
This paper proposes a comprehensive set of metrics and a systematic evaluation protocol for assessing social conformity and efficiency in crowd navigation algorithms, enabling better comparison and generalization.
Contribution
It introduces a consistent evaluation protocol and metrics that account for social behavior and efficiency, addressing inconsistencies in current assessments.
Findings
Some algorithms struggle to generalize across scenarios.
The proposed protocol improves algorithm performance.
Metrics effectively differentiate social conformity and efficiency.
Abstract
Recent protocols and metrics for training and evaluating autonomous robot navigation through crowds are inconsistent due to diversified definitions of "social behavior". This makes it difficult, if not impossible, to effectively compare published navigation algorithms. Furthermore, with the lack of a good evaluation protocol, resulting algorithms may fail to generalize, due to lack of diversity in training. To address these gaps, this paper facilitates a more comprehensive evaluation and objective comparison of crowd navigation algorithms by proposing a consistent set of metrics that accounts for both efficiency and social conformity, and a systematic protocol comprising multiple crowd navigation scenarios of varying complexity for evaluation. We tested four state-of-the-art algorithms under this protocol. Results revealed that some state-of-the-art algorithms have much challenge in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMobile Crowdsensing and Crowdsourcing · Evacuation and Crowd Dynamics · Human Mobility and Location-Based Analysis
