Pedestrian Trajectory Prediction using Context-Augmented Transformer Networks
Khaled Saleh

TL;DR
This paper introduces a transformer-based framework for pedestrian trajectory prediction in urban traffic, integrating positional, interaction, and scene semantics data, outperforming RNN-based methods on real datasets.
Contribution
The work presents a novel transformer-based approach that effectively fuses multiple data types for improved pedestrian trajectory prediction.
Findings
Outperforms baseline methods in short-term predictions
Achieves better long-term trajectory accuracy
Validated on real urban traffic datasets
Abstract
Forecasting the trajectory of pedestrians in shared urban traffic environments is still considered one of the challenging problems facing the development of autonomous vehicles (AVs). In the literature, this problem is often tackled using recurrent neural networks (RNNs). Despite the powerful capabilities of RNNs in capturing the temporal dependency in the pedestrians' motion trajectories, they were argued to be challenged when dealing with longer sequential data. Thus, in this work, we are introducing a framework based on the transformer networks that were shown recently to be more efficient and outperformed RNNs in many sequential-based tasks. We relied on a fusion of the past positional information, agent interactions information and scene physical semantics information as an input to our framework in order to provide a robust trajectory prediction of pedestrians. We have evaluated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAutonomous Vehicle Technology and Safety · Video Surveillance and Tracking Methods · Traffic Prediction and Management Techniques
