On the Difficulty of Generalizing Reinforcement Learning Framework for   Combinatorial Optimization

Mostafa Pashazadeh; Kui Wu

arXiv:2108.03713·cs.LG·August 10, 2021

On the Difficulty of Generalizing Reinforcement Learning Framework for Combinatorial Optimization

Mostafa Pashazadeh, Kui Wu

PDF

Open Access

TL;DR

This paper investigates the generalization capabilities of reinforcement learning models for combinatorial optimization, specifically applying them to the quadratic assignment problem, and finds that current models may not generalize well.

Contribution

It provides an empirical evaluation of RL-based models on a classical quadratic assignment problem, highlighting limitations in their generalization across problem classes.

Findings

01

RL models struggle to generalize to quadratic assignment problems

02

Existing RL approaches perform well on specific problems like TSP

03

Generalization of RL models remains a significant challenge in combinatorial optimization

Abstract

Combinatorial optimization problems (COPs) on the graph with real-life applications are canonical challenges in Computer Science. The difficulty of finding quality labels for problem instances holds back leveraging supervised learning across combinatorial problems. Reinforcement learning (RL) algorithms have recently been adopted to solve this challenge automatically. The underlying principle of this approach is to deploy a graph neural network (GNN) for encoding both the local information of the nodes and the graph-structured data in order to capture the current state of the environment. Then, it is followed by the actor to learn the problem-specific heuristics on its own and make an informed decision at each state for finally reaching a good solution. Recent studies on this subject mainly focus on a family of combinatorial problems on the graph, such as the travel salesman problem,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsScheduling and Optimization Algorithms · Metaheuristic Optimization Algorithms Research · Auction Theory and Applications

MethodsEmirates Airlines Office in Dubai · Graph Neural Network