Deep Distributed Optimization for Large-Scale Quadratic Programming
Augustinos D. Saravanos, Hunter Kuperman, Alex Oshin, Arshiya Taj, Abdul, Vincent Pacelli, Evangelos A. Theodorou

TL;DR
This paper presents a deep learning-enhanced distributed optimization framework for large-scale quadratic programming, combining theoretical guarantees with practical improvements in speed and solution quality across diverse applications.
Contribution
It introduces DeepDistributedQP, a novel deep learning-based method that accelerates large-scale QP solving with proven convergence and strong generalization capabilities.
Findings
Outperforms standard QP solvers in speed and accuracy
Successfully scales to problems with 50K variables and 150K constraints
Provides theoretical generalization bounds and performance guarantees
Abstract
Quadratic programming (QP) forms a crucial foundation in optimization, encompassing a broad spectrum of domains and serving as the basis for more advanced algorithms. Consequently, as the scale and complexity of modern applications continue to grow, the development of efficient and reliable QP algorithms is becoming increasingly vital. In this context, this paper introduces a novel deep learning-aided distributed optimization architecture designed for tackling large-scale QP problems. First, we combine the state-of-the-art Operator Splitting QP (OSQP) method with a consensus approach to derive DistributedQP, a new method tailored for network-structured problems, with convergence guarantees to optimality. Subsequently, we unfold this optimizer into a deep learning framework, leading to DeepDistributedQP, which leverages learned policies to accelerate reaching to desired accuracy within a…
Peer Reviews
Decision·ICLR 2025 Poster
Using solved QP problems to train the deep learning model, the authors are able to select optimal hyperparameters that greatly improve the convergence for solving new distributed QP problems.
A limitation of this approach seems to be the amount of training data required to train the neural network.
The mathematical aspects of the paper are technically correct.
The primary concern revolves around the novelty of the unfolded network. A similar approach can be found in [1], which also predicts parameters for ADMM in conventional OSQP using reinforcement learning, where this work seems to be a straightforward substitution of tuning efforts with basic MLPs. This also raises questions about the fairness and completeness of the experiments, as the authors only compare their method against OSQP without considering other learning-based frameworks that could
1. The organization of the paper is clear -- the authors first introduce distributed version of operator splitting, and then combine explain how to incorporate deep learning techniques into the proposed framework to solve optimization problems more effectively. 2. The theoretical proof is well-structured and easy to follow.
Major: 1. (Lack of Motivation) In DEEPDISTRQP framework, the authors aimed to combine distributed optimization, in particular distributed quadratic programming, with deep learning. However, it lacks motivation of why one should study this type of problems. It is true that distributed QP part has the advantage of interpretability, while the deep learning part has stronger generalization capabilities. However, the experiments are only limited to relatively small-scale problems like optimal contro
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Distributed and Parallel Computing Systems · Advanced Control Systems Optimization
