Parallel Interior-Point Solver for Block-Structured Nonlinear Programs   on SIMD/GPU Architectures

Fran\c{c}ois Pacaud; Michel Schanen; Sungho Shin; Daniel Adrian; Maldonado; Mihai Anitescu

arXiv:2301.04869·math.OC·January 13, 2023·Optim. Methods Softw.

Parallel Interior-Point Solver for Block-Structured Nonlinear Programs on SIMD/GPU Architectures

Fran\c{c}ois Pacaud, Michel Schanen, Sungho Shin, Daniel Adrian, Maldonado, Mihai Anitescu

PDF

Open Access

TL;DR

This paper presents a parallel interior-point method optimized for SIMD/GPU architectures to efficiently solve block-structured nonlinear programs, demonstrating significant speed-ups on exascale hardware.

Contribution

The paper introduces a two-level parallelization approach for interior-point methods tailored for SIMD/GPU architectures, reducing memory use and accelerating computations for large-scale nonlinear programs.

Findings

01

Achieves 50x speed-up over existing methods.

02

Effectively utilizes multi-GPU supercomputers for complex problems.

03

Demonstrates scalability on exascale architectures.

Abstract

We investigate how to port the standard interior-point method to new exascale architectures for block-structured nonlinear programs with state equations. Computationally, we decompose the interior-point algorithm into two successive operations: the evaluation of the derivatives and the solution of the associated Karush-Kuhn-Tucker (KKT) linear system. Our method accelerates both operations using two levels of parallelism. First, we distribute the computations on multiple processes using coarse parallelism. Second, each process uses a SIMD/GPU accelerator locally to accelerate the operations using fine-grained parallelism. The KKT system is reduced by eliminating the inequalities and the state variables from the corresponding equations, to a dense matrix encoding the sensitivities of the problem's degrees of freedom, drastically minimizing the memory exchange. We demonstrate the method's…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Optimization Algorithms Research · Matrix Theory and Algorithms · Parallel Computing and Optimization Techniques