Behavior and Representation in Large Language Models for Combinatorial Optimization: From Feature Extraction to Algorithm Selection

Francesca Da Ros; Luca Di Gaspero; Kevin Roitero

arXiv:2512.13374·cs.AI·December 16, 2025

Behavior and Representation in Large Language Models for Combinatorial Optimization: From Feature Extraction to Algorithm Selection

Francesca Da Ros, Luca Di Gaspero, Kevin Roitero

PDF

Open Access

TL;DR

This paper explores how large language models understand and represent combinatorial optimization problems, revealing their potential to support feature extraction and algorithm selection through internal representations.

Contribution

It introduces a probing framework to analyze LLMs' internal encoding of problem features and demonstrates their capability to predict optimal solvers for various benchmark problems.

Findings

01

LLMs can moderately recover problem features via direct querying.

02

Hidden-layer representations of LLMs are as effective as traditional features for prediction.

03

LLMs encode meaningful structural information relevant to optimization tasks.

Abstract

Recent advances in Large Language Models (LLMs) have opened new perspectives for automation in optimization. While several studies have explored how LLMs can generate or solve optimization models, far less is understood about what these models actually learn regarding problem structure or algorithmic behavior. This study investigates how LLMs internally represent combinatorial optimization problems and whether such representations can support downstream decision tasks. We adopt a twofold methodology combining direct querying, which assesses LLM capacity to explicitly extract instance features, with probing analyses that examine whether such information is implicitly encoded within their hidden layers. The probing framework is further extended to a per-instance algorithm selection task, evaluating whether LLM-derived representations can predict the best-performing solver. Experiments…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Natural Language Processing Techniques · Topic Modeling