Accelerating Quadratic Optimization with Reinforcement Learning

Jeffrey Ichnowski; Paras Jain; Bartolomeo Stellato; Goran Banjac,; Michael Luo; Francesco Borrelli; Joseph E. Gonzalez; Ion Stoica; Ken Goldberg

arXiv:2107.10847·cs.LG·July 23, 2021·20 cites

Accelerating Quadratic Optimization with Reinforcement Learning

Jeffrey Ichnowski, Paras Jain, Bartolomeo Stellato, Goran Banjac,, Michael Luo, Francesco Borrelli, Joseph E. Gonzalez, Ion Stoica, Ken Goldberg

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces RLQP, a reinforcement learning-based method that learns to tune parameters for quadratic optimization, significantly accelerating convergence and outperforming existing solvers on various benchmarks.

Contribution

The paper presents RLQP, a novel reinforcement learning approach that automates parameter tuning in quadratic optimization, improving speed and generalization over traditional methods.

Findings

01

RLQP outperforms state-of-the-art solvers by up to 3x in benchmarks.

02

RLQP generalizes well to unseen problems with different structures.

03

Reinforcement learning effectively automates hyperparameter tuning for quadratic programming.

Abstract

First-order methods for quadratic optimization such as OSQP are widely used for large-scale machine learning and embedded optimal control, where many related problems must be rapidly solved. These methods face two persistent challenges: manual hyperparameter tuning and convergence time to high-accuracy solutions. To address these, we explore how Reinforcement Learning (RL) can learn a policy to tune parameters to accelerate convergence. In experiments with well-known QP benchmarks we find that our RL policy, RLQP, significantly outperforms state-of-the-art QP solvers by up to 3x. RLQP generalizes surprisingly well to previously unseen problems with varying dimension and structure from different applications, including the QPLIB, Netlib LP and Maros-Meszaros problems. Code for RLQP is available at https://github.com/berkeleyautomation/rlqp.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

berkeleyautomation/rlqp
noneOfficial

Videos

Accelerating Quadratic Optimization with Reinforcement Learning· slideslive

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Reinforcement Learning in Robotics · Machine Learning and Data Classification