Exploring Transformer-Augmented LSTM for Temporal and Spatial Feature   Learning in Trajectory Prediction

Chandra Raskoti; Weizi Li

arXiv:2412.13419·cs.RO·December 19, 2024

Exploring Transformer-Augmented LSTM for Temporal and Spatial Feature Learning in Trajectory Prediction

Chandra Raskoti, Weizi Li

PDF

Open Access

TL;DR

This paper investigates a hybrid model combining LSTM and Transformer architectures to improve vehicle trajectory prediction by capturing complex spatial and temporal interactions, aiming for more interpretable and robust autonomous driving systems.

Contribution

It introduces a novel hybrid LSTM-Transformer model for trajectory prediction, integrating spatial and temporal features in a unified framework.

Findings

01

The hybrid model effectively captures vehicle interactions.

02

It demonstrates potential for improved interpretability.

03

Benchmark results show competitive performance.

Abstract

Accurate vehicle trajectory prediction is crucial for ensuring safe and efficient autonomous driving. This work explores the integration of Transformer based model with Long Short-Term Memory (LSTM) based technique to enhance spatial and temporal feature learning in vehicle trajectory prediction. Here, a hybrid model that combines LSTMs for temporal encoding with a Transformer encoder for capturing complex interactions between vehicles is proposed. Spatial trajectory features of the neighboring vehicles are processed and goes through a masked scatter mechanism in a grid based environment, which is then combined with temporal trajectory of the vehicles. This combined trajectory data are learned by sequential LSTM encoding and Transformer based attention layers. The proposed model is benchmarked against predecessor LSTM based methods, including STA-LSTM, SA-LSTM, CS-LSTM, and NaiveLSTM.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTraffic Prediction and Management Techniques

MethodsAttention Is All You Need · Linear Layer · Dropout · Multi-Head Attention · Adam · Layer Normalization · Sigmoid Activation · Position-Wise Feed-Forward Layer · Label Smoothing · Tanh Activation