Optimizing Solution-Samplers for Combinatorial Problems: The Landscape of Policy-Gradient Methods
Constantine Caramanis, Dimitris Fotakis, Alkis Kalavasis, Vasilis, Kontonis, Christos Tzamos

TL;DR
This paper develops a theoretical framework analyzing policy-gradient methods for combinatorial problems, demonstrating the existence of expressive, tractable generative models with benign optimization landscapes, and introduces a regularization technique to improve training.
Contribution
It provides the first theoretical analysis showing the existence of effective generative models for combinatorial problems and proposes a novel regularization method to enhance gradient-based training.
Findings
Existence of expressive, tractable generative models for combinatorial problems.
Benign optimization landscapes without sub-optimal stationary points.
Regularization improves gradient descent effectiveness and escapes bad stationary points.
Abstract
Deep Neural Networks and Reinforcement Learning methods have empirically shown great promise in tackling challenging combinatorial problems. In those methods a deep neural network is used as a solution generator which is then trained by gradient-based methods (e.g., policy gradient) to successively obtain better solution distributions. In this work we introduce a novel theoretical framework for analyzing the effectiveness of such methods. We ask whether there exist generative models that (i) are expressive enough to generate approximately optimal solutions; (ii) have a tractable, i.e, polynomial in the size of the input, number of parameters; (iii) their optimization landscape is benign in the sense that it does not contain sub-optimal stationary points. Our main contribution is a positive answer to this question. Our result holds for a broad class of combinatorial problems including…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Machine Learning and Algorithms · Advanced Bandit Algorithms Research
