TL;DR
This paper introduces a three-step framework for improved multiple object detection and tracking in panoramic videos, enhancing cycling safety analysis by addressing distortions and boundary issues.
Contribution
It proposes novel methods for enhancing detection and tracking accuracy in panoramic imagery, validated through real-world cycling safety applications.
Findings
Higher average precision across various image resolutions.
10.0% reduction in identification switches.
F-score of 0.82 in vehicle overtaking detection.
Abstract
Cyclists face a disproportionate risk of injury, yet conventional crash records are too sparse to identify risk factors at fine spatial and temporal scales. Recently, naturalistic studies have used video data to capture the complex behavioural and infrastructural risk factors. A promising format is panoramic video, which can record 360 views around a rider. However, its use is limited by distortions, large numbers of small objects, and boundary continuity, which cannot be handled using existing computer vision models. This research proposes a novel three-step framework: (1) enhancing object detection accuracy on panoramic imagery by segmenting and projecting the original 360 images into sub-images; (2) modifying multi-object tracking models to incorporate boundary continuity and object category information; and (3) validating through a real-world application of vehicle…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Autonomous Vehicle Technology and Safety · Vehicle License Plate Recognition
MethodsSparse Evolutionary Training
