Latent Guided Sampling for Combinatorial Optimization
Sobihan Surendran (LPSM (UMR\_8001), SU), Adeline Fermanian, Sylvain Le Corff (LPSM (UMR\_8001), SU)

TL;DR
This paper introduces LGS-Net, a latent space model with an efficient sampling method for combinatorial optimization, achieving state-of-the-art results in routing tasks with theoretical convergence guarantees.
Contribution
The work presents a novel latent space model and a sampling method that improves solution quality and robustness in combinatorial optimization, with theoretical and empirical validation.
Findings
Achieves state-of-the-art performance on routing benchmarks.
Provides rigorous convergence guarantees for the sampling method.
Outperforms existing RL-based approaches in out-of-distribution instances.
Abstract
Combinatorial Optimization problems are widespread in domains such as logistics, manufacturing, and drug discovery, yet their NP-hard nature makes them computationally challenging. Recent Neural Combinatorial Optimization methods leverage deep learning to learn solution strategies, trained via Supervised or Reinforcement Learning (RL). While promising, these approaches often rely on task-specific augmentations, perform poorly on out-of-distribution instances, and lack robust inference mechanisms. Moreover, existing latent space models either require labeled data or rely on pre-trained policies. In this work, we propose LGS-Net, a novel latent space model that conditions on problem instances, and introduce an efficient inference method, Latent Guided Sampling (LGS), based on Markov Chain Monte Carlo and Stochastic Approximation. We show that the iterations of our method form a…
Peer Reviews
Decision·Submitted to ICLR 2026
The instance-conditioned latent model eliminates the need for pre-computed solutions or pre-trained policies unlike CVAE-Opt and COMPASS, and demonstrates clear performance improvements. LGS inference combines interacting MCMC with real-time parameter updates, consistently outperforming all alternatives including DE, CMA-ES, active search, and gradient-based finetuning. The convergence proof for time-inhomogeneous Markov chains is a non-trivial theoretical contribution for this problem class. Th
The adaptive chain convergence relies on Assumption 4, but the paper only provides high-level justification. There is a lack of sufficient conditions linking Algorithm 1's specific step-size choices and gradient-variance conditions, creating a gap between theory and implementation. The empirical study is limited to Euclidean routing (TSP/CVRP), restricting the demonstration of generality. Experiments on non-Euclidean problems or other combinatorial domains would help establish the versatility o
Significance: The use of inference-time strategies for combinatorial optimization (CO) problems is highly relevant. The proposed method achieves state-of-the-art performance with statistically significant improvements on reference benchmarks, evaluated on two well-studied problems (TSP and CVRP), both in- and out-of-distribution. I appreciate that the paper provides not only SOTA results but also mathematical justification for the proposed approach. Originality: Most underlying concepts already
**W1. Weak motivation and unclear justification of contributions.** The main motivation of the work remains somewhat weak. Lines 51–54 and 57–60 present the limitations of prior work (EAS and COMPASS) that LGS is supposed to address (lines 60–62). This section is crucial, as it motivates the entire paper, yet it is not fully convincing. (i) The authors claim that EAS “fine-tunes” the policy, which poses computational challenges, but LGS also performs fine-tuning. (ii) The statement that “COMPA
The paper is overall well-written, gives good credit to related works, and performs decent experiments and ablations. - Strong theoretical grounding regarding convergence. - Standard experiments on TSP and CVRP: n in [100, 125, 150]. - SOTA results on these benchmarks, even beaten the solver for CVRP 100-125.
The paper and method suffer from a few weaknesses. - Additional complexity of the inference scheme: MCMC and SA must bring a significant overhead. - Limited experimental scope: I assume this would transfer well to other combinatorial problems like job-shop scheduling or graph problems, but the paper only tests on TSP and CVRP.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Advanced Clustering Algorithms Research · Text and Document Classification Technologies
