Accelerating Quadratic Optimization with Reinforcement Learning
Jeffrey Ichnowski, Paras Jain, Bartolomeo Stellato, Goran Banjac,, Michael Luo, Francesco Borrelli, Joseph E. Gonzalez, Ion Stoica, Ken Goldberg

TL;DR
This paper introduces RLQP, a reinforcement learning-based method that learns to tune parameters for quadratic optimization, significantly accelerating convergence and outperforming existing solvers on various benchmarks.
Contribution
The paper presents RLQP, a novel reinforcement learning approach that automates parameter tuning in quadratic optimization, improving speed and generalization over traditional methods.
Findings
RLQP outperforms state-of-the-art solvers by up to 3x in benchmarks.
RLQP generalizes well to unseen problems with different structures.
Reinforcement learning effectively automates hyperparameter tuning for quadratic programming.
Abstract
First-order methods for quadratic optimization such as OSQP are widely used for large-scale machine learning and embedded optimal control, where many related problems must be rapidly solved. These methods face two persistent challenges: manual hyperparameter tuning and convergence time to high-accuracy solutions. To address these, we explore how Reinforcement Learning (RL) can learn a policy to tune parameters to accelerate convergence. In experiments with well-known QP benchmarks we find that our RL policy, RLQP, significantly outperforms state-of-the-art QP solvers by up to 3x. RLQP generalizes surprisingly well to previously unseen problems with varying dimension and structure from different applications, including the QPLIB, Netlib LP and Maros-Meszaros problems. Code for RLQP is available at https://github.com/berkeleyautomation/rlqp.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Reinforcement Learning in Robotics · Machine Learning and Data Classification
