Temporal Parallelisation of Dynamic Programming and Linear Quadratic   Control

Simo S\"arkk\"a; \'Angel F. Garc\'ia-Fern\'andez

arXiv:2104.03186·math.OC·January 25, 2022

Temporal Parallelisation of Dynamic Programming and Linear Quadratic Control

Simo S\"arkk\"a, \'Angel F. Garc\'ia-Fern\'andez

PDF

Open Access

TL;DR

This paper introduces a general framework for parallelizing dynamic programming in optimal control, significantly reducing computation time from linear to logarithmic using parallel scans, with applications to various control problems.

Contribution

It develops a novel formulation enabling parallel computation of dynamic programming, applicable to finite, linear quadratic, and certain nonlinear control problems, demonstrating substantial computational speedups.

Findings

01

Parallel algorithms achieve logarithmic complexity.

02

Numerical simulations confirm efficiency on GPU.

03

Applicable to diverse control problem classes.

Abstract

This paper proposes a general formulation for temporal parallelisation of dynamic programming for optimal control problems. We derive the elements and associative operators to be able to use parallel scans to solve these problems with logarithmic time complexity rather than linear time complexity. We apply this methodology to problems with finite state and control spaces, linear quadratic tracking control problems, and to a class of nonlinear control problems. The computational benefits of the parallel methods are demonstrated via numerical simulations run on a graphics processing unit.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsConstraint Satisfaction and Optimization · Reinforcement Learning in Robotics · Advanced Control Systems Optimization