InsTraj: Instructing Diffusion Models with Travel Intentions to Generate Real-world Trajectories
Yuanshao Zhu, Yuxuan Liang, Xiangyu Zhao, Liang Han, Xinwei Fang, Xun Zhou, Xuetao Wei, James Jianqiao Yu

TL;DR
InsTraj is a novel framework that uses diffusion models guided by natural language instructions to generate realistic, diverse, and semantically faithful GPS trajectories for urban applications.
Contribution
It introduces a multimodal diffusion transformer guided by large language models to interpret natural language travel intentions and generate high-fidelity trajectories.
Findings
Outperforms state-of-the-art methods in realism and diversity.
Generates trajectories faithful to user instructions.
Effectively interprets complex natural language travel intents.
Abstract
The generation of realistic and controllable GPS trajectories is a fundamental task for applications in urban planning, mobility simulation, and privacy-preserving data sharing. However, existing methods face a two-fold challenge: they lack the deep semantic understanding to interpret complex user travel intent, and struggle to handle complex constraints while maintaining the realistic diversity inherent in human behavior. To resolve this, we introduce InsTraj, a novel framework that instructs diffusion models to generate high-fidelity trajectories directly from natural language descriptions. Specifically, InsTraj first utilizes a powerful large language model to decipher unstructured travel intentions formed in natural language, thereby creating rich semantic blueprints and bridging the representation gap between intentions and trajectories. Subsequently, we proposed a multimodal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
