EgoSpeed-Net: Forecasting Speed-Control in Driver Behavior from Egocentric Video Data
Yichen Ding, Ziming Zhang, Yanhua Li, Xun Zhou

TL;DR
EgoSpeed-Net is a novel graph convolutional network that predicts driver speed-control actions solely from egocentric video data by modeling object relations over time.
Contribution
The paper introduces EgoSpeed-Net, a GCN-based model that forecasts driver speed actions using only egocentric video, unlike prior methods relying on third-person or sensor data.
Findings
EgoSpeed-Net outperforms existing methods on the Honda Research Institute Driving Dataset.
Object relation modeling over time improves speed-control forecasting accuracy.
The approach demonstrates the effectiveness of egocentric video for driver behavior prediction.
Abstract
Speed-control forecasting, a challenging problem in driver behavior analysis, aims to predict the future actions of a driver in controlling vehicle speed such as braking or acceleration. In this paper, we try to address this challenge solely using egocentric video data, in contrast to the majority of works in the literature using either third-person view data or extra vehicle sensor data such as GPS, or both. To this end, we propose a novel graph convolutional network (GCN) based network, namely, EgoSpeed-Net. We are motivated by the fact that the position changes of objects over time can provide us very useful clues for forecasting the speed change in future. We first model the spatial relations among the objects from each class, frame by frame, using fully-connected graphs, on top of which GCNs are applied for feature extraction. Then we utilize a long short-term memory network to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Memory Network · Greedy Policy Search
