Can large language models assist choice modelling? Insights into prompting strategies and current models capabilities
Georges Sfeir, Gabriel Nova, Stephane Hess, Sander van Cranenburgh

TL;DR
This study explores the potential of large language models to assist in discrete choice modelling, evaluating their ability to suggest and estimate models using various prompting strategies and model configurations.
Contribution
It systematically assesses multiple LLMs' capabilities in specifying and estimating Multinomial Logit models, highlighting the effectiveness of structured prompts and identifying current limitations.
Findings
Proprietary LLMs can generate valid utility specifications.
Structured prompts improve model specification quality.
Open-weight models like Llama struggle with meaningful specifications.
Abstract
Large Language Models (LLMs) are becoming widely used to support various workflows across different disciplines, yet their potential in discrete choice modelling remains relatively unexplored. This work examines the potential of LLMs as assistive agents in the specification and, where technically feasible, estimation of Multinomial Logit models. We implement a systematic experimental framework involving twelve versions of seven leading LLMs (ChatGPT, Claude, DeepSeek, Gemini, Gemma, Llama, and Mistral) evaluated under five experimental configurations. These configurations vary along three dimensions: (i) modelling goal (suggesting vs. suggesting and estimating MNL models); (ii) prompting strategy (Zero-Shot vs. Chain-of-Thoughts (CoT)); and (iii) information availability (full dataset vs. data dictionary summarising variable names and types). Each specification suggested by the LLMs is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
