Valuing Time in Silicon: Can Large Language Models Replicate Human Value of Travel Time
Yingnan Yan, Tianming Liu, Yafeng Yin

TL;DR
This study evaluates whether large language models can accurately simulate human travel time valuation, finding they often exhibit human-like behavior but with some differences in sensitivity and magnitude, serving as a benchmark for future AI-human transportation modeling.
Contribution
It provides a comprehensive analysis of LLMs' ability to replicate human travel time valuation across diverse contexts, establishing a foundational benchmark for their use as proxies in transportation research.
Findings
LLMs show high behavioral similarity to humans in travel choices.
Some LLMs have aggregate VOT comparable to humans.
Behavioral patterns are consistent across different contexts.
Abstract
As a key advancement in artificial intelligence, large language models (LLMs) are set to transform transportation systems. While LLMs offer the potential to simulate human travelers in future mixed-autonomy transportation systems, their behavioral fidelity in complex scenarios remains largely unconfirmed by existing research. This study addresses this gap by conducting a comprehensive analysis of the value of travel time (VOT) of three popular LLMs. We employ a full factorial experimental design to systematically examine LLMs' sensitivities to various transportation contexts, including the choice setting, travel purpose, and socio-demographic factors. Our results reveal a high degree of behavioral similarity between LLMs and humans. Some LLMs exhibit an aggregate VOT similar to that of humans, and all tested models demonstrate human-like sensitivity to travel purpose, income, and the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
