Cyclist Trajectory Forecasts by Incorporation of Multi-View Video Information
Stefan Zernetsch, Oliver Trupp, Viktor Kress, Konrad Doll and, Bernhard Sick

TL;DR
This paper introduces a new method that uses multi-view video data and deep learning to improve cyclist trajectory predictions at urban intersections, achieving significant accuracy gains over trajectory-only models.
Contribution
It presents a novel approach combining visual cues from stereo camera footage with trajectory data using 3D CNNs, enhancing prediction accuracy in real-world traffic scenarios.
Findings
Positional accuracy improved by up to 22% with visual cues.
Optical flow sequences alone significantly enhance accuracy.
Method is applicable in real-time urban traffic environments.
Abstract
This article presents a novel approach to incorporate visual cues from video-data from a wide-angle stereo camera system mounted at an urban intersection into the forecast of cyclist trajectories. We extract features from image and optical flow (OF) sequences using 3D convolutional neural networks (3D-ConvNet) and combine them with features extracted from the cyclist's past trajectory to forecast future cyclist positions. By the use of additional information, we are able to improve positional accuracy by about 7.5 % for our test dataset and by up to 22 % for specific motion types compared to a method solely based on past trajectories. Furthermore, we compare the use of image sequences to the use of OF sequences as additional information, showing that OF alone leads to significant improvements in positional accuracy. By training and testing our methods using a real-world dataset recorded…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
