Learning with Local Search MCMC Layers
Germain Vivier-Ardisson, Mathieu Blondel, Axel Parmentier

TL;DR
This paper introduces a theoretically grounded method for integrating local search heuristics into neural networks using MCMC, enabling differentiable combinatorial layers and efficient learning for complex NP-hard problems.
Contribution
It proposes transforming local search heuristics into proposal distributions for MCMC, providing a principled way to learn with inexact combinatorial solvers.
Findings
Reduces computational burden in learning for combinatorial problems
Demonstrates effectiveness on large-scale vehicle routing problem
Provides a differentiable layer for combinatorial optimization
Abstract
Integrating combinatorial optimization layers into neural networks has recently attracted significant research interest. However, many existing approaches lack theoretical guarantees or fail to perform adequately when relying on inexact solvers. This is a critical limitation, as many operations research problems are NP-hard, often necessitating the use of neighborhood-based local search heuristics. These heuristics iteratively generate and evaluate candidate solutions based on an acceptance rule. In this paper, we introduce a theoretically-principled approach for learning with such inexact combinatorial solvers. Inspired by the connection between simulated annealing and Metropolis-Hastings, we propose to transform problem-specific neighborhood systems used in local search heuristics into proposal distributions, implementing MCMC on the combinatorial space of feasible solutions. This…
Peer Reviews
Decision·Submitted to ICLR 2026
- The core idea of the paper is strong, as it successfully combines two well-established concepts, local search and Markov Chain Monte Carlo, into a coherent framework for differentiable combinatorial optimisation. As well, it is both general and flexible as it supports multiple neighborhood systems and extends beyond prior works. - The work offers rigorous analysis and solid theoretical guarantees for the proposed framework. - Despite the technical depth, the paper is clear, well structured, an
My main concerns are with the empirical section: - The authors provide empirical results on two cases (dynamic vehicle routing and binary vector prediction), which was effective to show the strength of the claims but yet lack extra experiments to prove the generalisation of the framework to other combinatorial problems aside from routing. - The evaluation comparison on dynamic vehicles tasks relies solely on the EURO Meets NeurIPS 2022 PC-HGS–based baseline, which, though strong, does limit the
1. Well-motivated problem formulation: the paper clearly identifies the key limitation of existing differentiable combinatorial optimization approaches and formulates the challenge of learning with inexact local-search-based oracles in a principled manner. 2. Theoretical guarantees: the proposed method is supported by rigorous theoretical guarantees, including convergence analyses and connections to Fenchel–Young losses. This provides confidence in the soundness and reliability of the approach,
1. Limited coverage of application tasks: although the proposed approach is claimed to be general, experiments are restricted to DVRPTW and a toy binary vector prediction task. As a result, the empirical validation of generality remains narrow. 2. Sparse comparison with existing methods: Table 1 summarizes a broad landscape of differentiable combinatorial optimization methods, yet the experiments compare only against perturbed optimizers (Berthet et al., 2020). While the authors may argue that o
- The paper proposes a novel approach to improving local search performance using a differentiable MCMC layer, which is technically interesting. - It demonstrates the ability to solve problems with complex constraints, such as VRPTW, within a short amount of time.
- Similar to traditional heuristic-based local search methods, the performance of the proposed approach heavily depends on how well the local search component is designed. In this sense, the paper does not improve prior NCO methods to better handle complex constraints; rather, it proposes a new type of meta-heuristic that utilizes a differentiable MCMC layer. Considering the ongoing trend of incorporating neural networks more actively in combinatorial optimization, the long-term impact of this w
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Machine Learning and Algorithms
