Learning to control inexact Benders decomposition via reinforcement learning

Zhe Li; Bernard T. Agyeman; Ilias Mitrai; Prodromos Daoutidis

arXiv:2508.06700·math.OC·August 12, 2025·Comput. Chem. Eng.

Learning to control inexact Benders decomposition via reinforcement learning

Zhe Li, Bernard T. Agyeman, Ilias Mitrai, Prodromos Daoutidis

PDF

Open Access

TL;DR

This paper introduces a reinforcement learning approach to adaptively control the optimality gap in inexact generalized Benders decomposition, significantly reducing solution times for large-scale mixed-integer problems.

Contribution

It develops an RL-based policy to dynamically determine optimality gaps in inexact GBD, improving efficiency over traditional fixed-gap methods.

Findings

01

RL-iGBD reduces solution time substantially.

02

The learned policy adapts to problem features and solution progress.

03

In experiments, RL-iGBD outperforms standard approaches.

Abstract

Benders decomposition (BD), along with its generalized version (GBD), is a widely used algorithm for solving large-scale mixed-integer optimization problems that arise in the operation of process systems. However, the off-the-shelf application to online settings can be computationally inefficient due to the repeated solution of the master problem. An approach to reduce the solution time is to solve the master problem to local optimality. However, identifying the level of suboptimality at each iteration that minimizes the total solution time is nontrivial. In this paper, we propose the application of reinforcement learning to determine the best optimality gap at each GBD iteration. First, we show that the inexact GBD can converge to the optimal solution given a properly designed optimality gap schedule. Next, leveraging reinforcement learning, we learn a policy that minimizes the total…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Control Systems Optimization · Reinforcement Learning in Robotics · Process Optimization and Integration