Reinforcement Learning with Combinatorial Actions: An Application to   Vehicle Routing

Arthur Delarue; Ross Anderson; Christian Tjandraatmadja

arXiv:2010.12001·cs.LG·October 26, 2020·46 cites

Reinforcement Learning with Combinatorial Actions: An Application to Vehicle Routing

Arthur Delarue, Ross Anderson, Christian Tjandraatmadja

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a deep reinforcement learning framework for combinatorial action spaces, applying it to vehicle routing and achieving competitive results with traditional optimization methods.

Contribution

It formulates action selection as a mixed-integer optimization problem within deep RL, specifically addressing large combinatorial action spaces in vehicle routing.

Findings

01

Achieves an average gap of 1.7% on standard CVRP instances.

02

Framework is competitive with state-of-the-art optimization methods.

03

Demonstrates effectiveness of combining RL with combinatorial optimization.

Abstract

Value-function-based methods have long played an important role in reinforcement learning. However, finding the best next action given a value function of arbitrary complexity is nontrivial when the action space is too large for enumeration. We develop a framework for value-function-based deep reinforcement learning with a combinatorial action space, in which the action selection problem is explicitly formulated as a mixed-integer optimization problem. As a motivating example, we present an application of this framework to the capacitated vehicle routing problem (CVRP), a combinatorial optimization problem in which a set of locations must be covered by a single vehicle with limited capacity. On each instance, we model an action as the construction of a single route, and consider a deterministic policy which is improved through a simple policy iteration algorithm. Our approach is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

google-research/tf-opt
tfOfficial

Videos

Reinforcement Learning with Combinatorial Actions: An Application to Vehicle Routing· slideslive

Taxonomy

TopicsReinforcement Learning in Robotics · Modular Robots and Swarm Intelligence · Traffic control and management