Solving the Quadratic Assignment Problem using Deep Reinforcement   Learning

Puneet S. Bagga; Arthur Delarue

arXiv:2310.01604·cs.LG·October 4, 2023

Solving the Quadratic Assignment Problem using Deep Reinforcement Learning

Puneet S. Bagga, Arthur Delarue

PDF

Open Access

TL;DR

This paper introduces a deep reinforcement learning approach with a novel double pointer network to solve the NP-hard Quadratic Assignment Problem, achieving near-optimal solutions without instance-specific retraining.

Contribution

It presents a new deep RL method with a double pointer network for QAP, capable of solving large instances efficiently and accurately.

Findings

01

Solutions are on average within 7.5% of a high-quality baseline.

02

The method outperforms the baseline on 1.2% of instances.

03

No instance-specific retraining is required for out-of-sample solutions.

Abstract

The Quadratic Assignment Problem (QAP) is an NP-hard problem which has proven particularly challenging to solve: unlike other combinatorial problems like the traveling salesman problem (TSP), which can be solved to optimality for instances with hundreds or even thousands of locations using advanced integer programming techniques, no methods are known to exactly solve QAP instances of size greater than 30. Solving the QAP is nevertheless important because of its many critical applications, such as electronic wiring design and facility layout selection. We propose a method to solve the original Koopmans-Beckman formulation of the QAP using deep reinforcement learning. Our approach relies on a novel double pointer network, which alternates between selecting a location in which to place the next facility and a facility to place in the previous location. We train our model using A2C on a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVehicle Routing Optimization Methods · Metaheuristic Optimization Algorithms Research · Auction Theory and Applications

MethodsA2C