When Words Change the Model: Sensitivity of LLMs for Constraint Programming Modelling
Alessio Pellegrino, Jacopo Mauro

TL;DR
This paper investigates how large language models' ability to generate constraint programming models is sensitive to wording changes, revealing limitations in their understanding and robustness.
Contribution
It systematically evaluates LLMs' robustness to rephrased problem descriptions, highlighting their sensitivity and shallow understanding in constraint programming modeling.
Findings
LLMs produce plausible models but are sensitive to wording changes
Performance drops significantly with rephrased or misleading problem descriptions
Models show shallow understanding rather than deep reasoning
Abstract
One of the long-standing goals in optimisation and constraint programming is to describe a problem in natural language and automatically obtain an executable, efficient model. Large language models appear to bring this vision closer, showing impressive results in automatically generating models for classical benchmarks. However, much of this apparent success may derive from data contamination rather than genuine reasoning: many standard CP problems are likely included in the training data of these models. To examine this hypothesis, we systematically rephrased and perturbed a set of well-known CSPLib problems to preserve their structure while modifying their context and introducing misleading elements. We then compared the models produced by three representative LLMs across original and modified descriptions. Our qualitative analysis shows that while LLMs can produce syntactically valid…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsConstraint Satisfaction and Optimization · Topic Modeling · Model-Driven Software Engineering Techniques
