# Social-aware trajectory prediction using goal-directed attention networks with egocentric vision

**Authors:** Lia Astuti, Chui-Hong Chiu, Yu-Chen Lin, Ming-Chih Lin

PMC · DOI: 10.7717/peerj-cs.2842 · PeerJ Computer Science · 2025-04-25

## TL;DR

This paper introduces a new model for predicting future movement paths of road users by combining vision data and social interactions into a unified framework.

## Contribution

The novel SGANet model integrates social interactions, temporal dynamics, and destination prediction into a unified multimodal trajectory prediction framework.

## Key findings

- SGANet outperforms previous benchmarks on both homogeneous and heterogeneous road user datasets.
- Socially-aware interaction weighting significantly improves prediction accuracy for ego-vehicle maneuvers.
- Each network component in SGANet is validated as effective through extensive experiments.

## Abstract

This study presents a novel social-goal attention networks (SGANet) model that employs a vision-based multi-stacked neural network framework to predict multiple future trajectories for both homogeneous and heterogeneous road users. Unlike existing methods that focus solely on one dataset type and treat social interactions, temporal dynamics, destination point, and uncertainty behaviors independently, SGANet integrates these components into a unified multimodal prediction framework. A graph attention network (GAT) captures socially-aware interaction correlation, a long short-term memory (LSTM) network encodes temporal dependencies, a goal-directed forecaster (GDF) estimates coarse future goals, and a conditional variational autoencoder (CVAE) generates multiple plausible trajectories, with multi-head attention (MHA) and feed-forward networks (FFN) refining the final multimodal trajectory prediction. Evaluations on homogeneous datasets (JAAD and PIE) and the heterogeneous TITAN dataset demonstrate that SGANet consistently outperforms previous benchmarks across varying prediction horizons. Extensive experiments highlight the critical role of socially-aware interaction weighting in capturing road users’ influence on ego-vehicle maneuvers while validating the effectiveness of each network component, thereby demonstrating the advantages of multi-stacked neural network integration for trajectory prediction. The dataset is available at https://usa.honda-ri.com/titan.

## Full-text entities

- **Genes:** PAEP (progestagen associated endometrial protein) [NCBI Gene 5047] {aka GD, GdA, GdF, GdS, PAEG, PEP}, GLYAT (glycine-N-acyltransferase) [NCBI Gene 10249] {aka ACGNAT, GAT}
- **Chemicals:** FIOU (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]
- **Cell lines:** LSTM — Homo sapiens (Human), Transformed cell line (CVCL_VJ00)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12192891/full.md

## Figures

13 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12192891/full.md

## References

46 references — full list in the complete paper: https://tomesphere.com/paper/PMC12192891/full.md

---
Source: https://tomesphere.com/paper/PMC12192891