Multi-Context Fusion Transformer for Pedestrian Crossing Intention Prediction in Urban Environments
Yuanzhe Li, Hang Zhong, Steffen M\"uller

TL;DR
This paper introduces a Multi-Context Fusion Transformer that integrates diverse contextual information to significantly improve pedestrian crossing intention prediction accuracy in urban environments for autonomous vehicles.
Contribution
The paper presents a novel multi-context fusion Transformer architecture that effectively combines behavior, environment, localization, and vehicle motion data for better prediction accuracy.
Findings
Achieves 73%, 93%, and 90% accuracy on JAADbeh, JAADall, and PIE datasets.
Outperforms state-of-the-art methods in pedestrian intention prediction.
Extensive ablation studies confirm the effectiveness of the multi-context fusion approach.
Abstract
Pedestrian crossing intention prediction is essential for autonomous vehicles to improve pedestrian safety and reduce traffic accidents. However, accurate pedestrian intention prediction in urban environments remains challenging due to the multitude of factors affecting pedestrian behavior. In this paper, we propose a multi-context fusion Transformer (MFT) that leverages diverse numerical contextual attributes across four key dimensions, encompassing pedestrian behavior context, environmental context, pedestrian localization context and vehicle motion context, to enable accurate pedestrian intention prediction. MFT employs a progressive fusion strategy, where mutual intra-context attention enables reciprocal interactions within each context, thereby facilitating feature sequence fusion and yielding a context token as a context-specific representation. This is followed by mutual…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAutonomous Vehicle Technology and Safety · Advanced Neural Network Applications · Traffic and Road Safety
