OR-Gym: A Reinforcement Learning Library for Operations Research   Problems

Christian D. Hubbs; Hector D. Perez; Owais Sarwar; Nikolaos; V. Sahinidis; Ignacio E. Grossmann; John M. Wassick

arXiv:2008.06319·cs.AI·October 20, 2020·26 cites

OR-Gym: A Reinforcement Learning Library for Operations Research Problems

Christian D. Hubbs, Hector D. Perez, Owais Sarwar, Nikolaos, V. Sahinidis, Ignacio E. Grossmann, John M. Wassick

PDF

Open Access 3 Repos

TL;DR

This paper introduces OR-Gym, an open-source reinforcement learning library tailored for operations research problems, demonstrating its application and benchmarking against traditional optimization methods.

Contribution

The paper presents OR-Gym, a novel library that adapts classic OR problems into RL environments, enabling new approaches and benchmarking in operations research.

Findings

01

RL solutions outperform heuristics in certain problems

02

Benchmarking shows RL can be competitive with MILP methods

03

OR-Gym facilitates cross-disciplinary research in RL and OR

Abstract

Reinforcement learning (RL) has been widely applied to game-playing and surpassed the best human-level performance in many domains, yet there are few use-cases in industrial or commercial settings. We introduce OR-Gym, an open-source library for developing reinforcement learning algorithms to address operations research problems. In this paper, we apply reinforcement learning to the knapsack, multi-dimensional bin packing, multi-echelon supply chain, and multi-period asset allocation model problems, as well as benchmark the RL solutions against MILP and heuristic models. These problems are used in logistics, finance, engineering, and are common in many business operation settings. We develop environments based on prototypical models in the literature and implement various optimization and heuristic models in order to benchmark the RL results. By re-framing a series of classic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsScheduling and Optimization Algorithms · Supply Chain and Inventory Management · Assembly Line Balancing Optimization