Video-to-Text Pedestrian Monitoring (VTPM): Leveraging Computer Vision and Large Language Models for Privacy-Preserve Pedestrian Activity Monitoring at Intersections
Ahmed S. Abdelrahman, Mohamed Abdel-Aty, Dongdong Wang

TL;DR
The paper introduces VTPM, a privacy-preserving pedestrian monitoring system at intersections that generates real-time textual reports from video data, detecting violations and analyzing safety patterns efficiently.
Contribution
It presents a novel framework combining computer vision and large language models to monitor pedestrians while preserving privacy and enabling detailed historical analysis.
Findings
Achieves 0.05s latency per frame for detection and tracking.
Detects crossing violations with 90.2% accuracy.
Reduces memory usage by up to 253 million percent.
Abstract
Computer vision has advanced research methodologies, enhancing system services across various fields. It is a core component in traffic monitoring systems for improving road safety; however, these monitoring systems don't preserve the privacy of pedestrians who appear in the videos, potentially revealing their identities. Addressing this issue, our paper introduces Video-to-Text Pedestrian Monitoring (VTPM), which monitors pedestrian movements at intersections and generates real-time textual reports, including traffic signal and weather information. VTPM uses computer vision models for pedestrian detection and tracking, achieving a latency of 0.05 seconds per video frame. Additionally, it detects crossing violations with 90.2% accuracy by incorporating traffic signal data. The proposed framework is equipped with Phi-3 mini-4k to generate real-time textual reports of pedestrian activity…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTraffic Prediction and Management Techniques · Traffic and Road Safety
