Towards view-invariant vehicle speed detection from driving simulator images
Antonio Hern\'andez Mart\'inez, David Fernandez Llorca, Iv\'an, Garc\'ia Daza

TL;DR
This paper investigates using multiple virtual camera views in driving simulators to develop a view-invariant vehicle speed detection system with deep learning, demonstrating that a single model can outperform view-specific models.
Contribution
It introduces a multi-view synthetic dataset and evaluates 3D-CNN architectures for view-invariant vehicle speed estimation, showing the effectiveness of a unified model.
Findings
Single model with multi-view data outperforms camera-specific models
Deep learning can implicitly learn view-invariant speed features
Simulated data can replace costly real-world data collection
Abstract
The use of cameras for vehicle speed measurement is much more cost effective compared to other technologies such as inductive loops, radar or laser. However, accurate speed measurement remains a challenge due to the inherent limitations of cameras to provide accurate range estimates. In addition, classical vision-based methods are very sensitive to extrinsic calibration between the camera and the road. In this context, the use of data-driven approaches appears as an interesting alternative. However, data collection requires a complex and costly setup to record videos under real traffic conditions from the camera synchronized with a high-precision speed sensor to generate the ground truth speed values. It has recently been demonstrated that the use of driving simulators (e.g., CARLA) can serve as a robust alternative for generating large synthetic datasets to enable the application of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAutonomous Vehicle Technology and Safety · Video Surveillance and Tracking Methods · Advanced Neural Network Applications
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
