A Multi-Modal States based Vehicle Descriptor and Dilated Convolutional Social Pooling for Vehicle Trajectory Prediction
Huimin Zhang, Yafei Wang, Junjia Liu, Chengwei Li, Taiyuan Ma,, Chengliang Yin

TL;DR
This paper introduces a novel vehicle trajectory prediction model that leverages multi-modal vehicle states and dilated convolutional social pooling to improve accuracy in autonomous driving scenarios.
Contribution
It proposes a vehicle descriptor encoding multi-modal states and a dilated convolutional social pooling to better model vehicle interactions, enhancing trajectory prediction accuracy.
Findings
Outperforms recent benchmarks on NGSIM datasets.
Effectively encodes multi-modal vehicle states.
Improves modeling of spatial vehicle interactions.
Abstract
Precise trajectory prediction of surrounding vehicles is critical for decision-making of autonomous vehicles and learning-based approaches are well recognized for the robustness. However, state-of-the-art learning-based methods ignore 1) the feasibility of the vehicle's multi-modal state information for prediction and 2) the mutual exclusive relationship between the global traffic scene receptive fields and the local position resolution when modeling vehicles' interactions, which may influence prediction accuracy. Therefore, we propose a vehicle-descriptor based LSTM model with the dilated convolutional social pooling (VD+DCS-LSTM) to cope with the above issues. First, each vehicle's multi-modal state information is employed as our model's input and a new vehicle descriptor encoded by stacked sparse auto-encoders is proposed to reflect the deep interactive relationships between various…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTraffic Prediction and Management Techniques · Autonomous Vehicle Technology and Safety · Video Surveillance and Tracking Methods
MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory
