Multi-view Phase-aware Pedestrian-Vehicle Incident Reasoning Framework with Vision-Language Models
Hao Zhen, Yunxiang Yang, Jidong J. Yang

TL;DR
This paper presents MP-PVIR, a comprehensive multi-view, phase-aware framework utilizing vision-language models to analyze pedestrian-vehicle incidents, providing structured insights and prevention strategies from multi-view video data.
Contribution
The paper introduces a novel multi-view, phase-aware incident reasoning framework that integrates behavioral theory with vision-language models for detailed incident analysis.
Findings
TG-VLM achieves 0.4881 mIoU in phase segmentation
PhaVR-VLM attains 33.063 captioning score
Up to 64.70% accuracy in question answering
Abstract
Pedestrian-vehicle incidents remain a critical urban safety challenge, with pedestrians accounting for over 20% of global traffic fatalities. Although existing video-based systems can detect when incidents occur, they provide little insight into how these events unfold across the distinct cognitive phases of pedestrian behavior. Recent vision-language models (VLMs) have shown strong potential for video understanding, but they remain limited in that they typically process videos in isolation, without explicit temporal structuring or multi-view integration. This paper introduces Multi-view Phase-aware Pedestrian-Vehicle Incident Reasoning (MP-PVIR), a unified framework that systematically processes multi-view video streams into structured diagnostic reports through four stages: (1) event-triggered multi-view video acquisition, (2) pedestrian behavior phase segmentation, (3) phase-specific…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Autonomous Vehicle Technology and Safety · Human Pose and Action Recognition
