Learning Hard Optimization Problems: A Data Generation Perspective
James Kotary, Ferdinando Fioretto, Pascal Van Hentenryck

TL;DR
This paper explores how data generation strategies impact the ability of machine learning models to effectively learn solutions for hard optimization problems, especially when solutions are approximate or data is volatile.
Contribution
It introduces a novel data generation approach that produces more learnable solutions for complex optimization problems, addressing challenges of solution volatility and approximation.
Findings
Improved solution quality for non-linear nonconvex problems
Enhanced model learning with the proposed data generation method
Demonstrated effectiveness on discrete combinatorial problems
Abstract
Optimization problems are ubiquitous in our societies and are present in almost every segment of the economy. Most of these optimization problems are NP-hard and computationally demanding, often requiring approximate solutions for large-scale instances. Machine learning frameworks that learn to approximate solutions to such hard optimization problems are a potentially promising avenue to address these difficulties, particularly when many closely related problem instances must be solved repeatedly. Supervised learning frameworks can train a model using the outputs of pre-solved instances. However, when the outputs are themselves approximations, when the optimization problem has symmetric solutions, and/or when the solver uses randomization, solutions to closely related instances may exhibit large differences and the learning task can become inherently more difficult. This paper…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsMachine Learning and Data Classification · Machine Learning and Algorithms · Metaheuristic Optimization Algorithms Research
