Can large language models assist choice modelling? Insights into prompting strategies and current models capabilities

Georges Sfeir; Gabriel Nova; Stephane Hess; Sander van Cranenburgh

arXiv:2507.21790·econ.EM·March 18, 2026

Can large language models assist choice modelling? Insights into prompting strategies and current models capabilities

Georges Sfeir, Gabriel Nova, Stephane Hess, Sander van Cranenburgh

PDF

TL;DR

This study explores the potential of large language models to assist in discrete choice modelling, evaluating their ability to suggest and estimate models using various prompting strategies and model configurations.

Contribution

It systematically assesses multiple LLMs' capabilities in specifying and estimating Multinomial Logit models, highlighting the effectiveness of structured prompts and identifying current limitations.

Findings

01

Proprietary LLMs can generate valid utility specifications.

02

Structured prompts improve model specification quality.

03

Open-weight models like Llama struggle with meaningful specifications.

Abstract

Large Language Models (LLMs) are becoming widely used to support various workflows across different disciplines, yet their potential in discrete choice modelling remains relatively unexplored. This work examines the potential of LLMs as assistive agents in the specification and, where technically feasible, estimation of Multinomial Logit models. We implement a systematic experimental framework involving twelve versions of seven leading LLMs (ChatGPT, Claude, DeepSeek, Gemini, Gemma, Llama, and Mistral) evaluated under five experimental configurations. These configurations vary along three dimensions: (i) modelling goal (suggesting vs. suggesting and estimating MNL models); (ii) prompting strategy (Zero-Shot vs. Chain-of-Thoughts (CoT)); and (iii) information availability (full dataset vs. data dictionary summarising variable names and types). Each specification suggested by the LLMs is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.