TR-LLM: Integrating Trajectory Data for Scene-Aware LLM-Based Human Action Prediction
Kojiro Takeyama, Yimeng Liu, Misha Sra

TL;DR
This paper introduces TR-LLM, a multimodal framework that combines large language models with trajectory data to improve human action prediction in complex, occluded environments, addressing limitations of traditional video-based methods.
Contribution
The paper proposes a novel integration of trajectory data with LLMs to incorporate physical constraints, enhancing prediction accuracy in spatially complex scenarios.
Findings
Combining LLMs with trajectory data improves prediction accuracy.
The approach is especially effective with limited scene information.
Physical constraints from trajectories enhance LLM understanding.
Abstract
Accurate prediction of human behavior is crucial for AI systems to effectively support real-world applications, such as autonomous robots anticipating and assisting with human tasks. Real-world scenarios frequently present challenges such as occlusions and incomplete scene observations, which can compromise predictive accuracy. Thus, traditional video-based methods often struggle due to limited temporal and spatial perspectives. Large Language Models (LLMs) offer a promising alternative. Having been trained on a large text corpus describing human behaviors, LLMs likely encode plausible sequences of human actions in a home environment. However, LLMs, trained primarily on text data, lack inherent spatial awareness and real-time environmental perception. They struggle with understanding physical constraints and spatial geometry. Therefore, to be effective in a real-world spatial scenario,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Anomaly Detection Techniques and Applications · Time Series Analysis and Forecasting
