TrajPrism: A Multi-Task Benchmark for Language-Grounded Urban Trajectory Understanding
Lihuan Li, Wilson Wongso, Baiyu Chen, Hao Xue, Ruiyi Yang, Yifan Duan, Xiachong Lin, Yang Song, Flora Salim

TL;DR
TrajPrism is a comprehensive benchmark that evaluates the alignment between urban trajectories and natural language descriptions across multiple tasks, promoting integrated research in language-grounded mobility understanding.
Contribution
It introduces a novel multi-task benchmark with a large, diverse dataset and baseline models for trajectory generation, retrieval, and captioning, unifying geometry and language modalities.
Findings
Geometry-only baselines perform poorly on language-grounded tasks.
The benchmark contains 300K trajectories with 2.1M task instances across three cities.
Baseline models show significant room for improvement in language-trajectory alignment.
Abstract
Urban mobility is naturally expressed both as trajectories in space and as natural-language descriptions of travel intent, constraints, and preferences. However, prior work rarely evaluates these two modalities together on the same real-world trajectories: trajectory modeling often stays geometry-centric, while language-centric mobility benchmarks frequently target route planning and tool use rather than fine-grained, verifiable alignment between text and the underlying route. We introduce TrajPrism, a multi-task benchmark for language-trajectory alignment that unifies (i) instruction-conditioned trajectory generation, (ii) language-driven semantic trajectory retrieval, and (iii) trajectory captioning, together with an evaluation protocol that measures trajectory fidelity, retrieval quality, and language groundedness. We construct TrajPrism by pairing real urban trajectories with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
