Real Time Monocular Vehicle Velocity Estimation using Synthetic Data
Robert McCraith, Lukas Neumann, Andrea Vedaldi

TL;DR
This paper presents a two-step monocular vehicle velocity estimation method that combines off-the-shelf tracking with a neural network, leveraging synthetic data to enhance accuracy and interpretability in autonomous driving.
Contribution
It introduces a novel two-step approach separating perception and dynamics estimation, and demonstrates the effective use of synthetic bounding box data for training.
Findings
Achieves state-of-the-art velocity estimation performance.
Synthetic data improves model accuracy.
Separation of perception and dynamics enhances interpretability.
Abstract
Vision is one of the primary sensing modalities in autonomous driving. In this paper we look at the problem of estimating the velocity of road vehicles from a camera mounted on a moving car. Contrary to prior methods that train end-to-end deep networks that estimate the vehicles' velocity from the video pixels, we propose a two-step approach where first an off-the-shelf tracker is used to extract vehicle bounding boxes and then a small neural network is used to regress the vehicle velocity from the tracked bounding boxes. Surprisingly, we find that this still achieves state-of-the-art estimation performance with the significant benefit of separating perception from dynamics estimation via a clean, interpretable and verifiable interface which allows us distill the statistics which are crucial for velocity estimation. We show that the latter can be used to easily generate synthetic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
