Coordinating Planning and Tracking in Layered Control Policies via   Actor-Critic Learning

Fengjun Yang; Nikolai Matni

arXiv:2408.01639·eess.SY·December 18, 2024

Coordinating Planning and Tracking in Layered Control Policies via Actor-Critic Learning

Fengjun Yang, Nikolai Matni

PDF

Open Access 1 Repo

TL;DR

This paper introduces a reinforcement learning algorithm that jointly trains a trajectory planner and a tracking controller in layered control systems, improving coordination and interpretability.

Contribution

It presents a novel actor-critic RL approach with a dual network for coordinating planning and tracking layers, including theoretical convergence proof in LQR and empirical validation on nonlinear systems.

Findings

01

Converges to optimal dual network in LQR setting

02

Effective coordination between planning and tracking layers

03

Validated on nonlinear unicycle model simulations

Abstract

We propose a reinforcement learning (RL)-based algorithm to jointly train (1) a trajectory planner and (2) a tracking controller in a layered control architecture. Our algorithm arises naturally from a rewrite of the underlying optimal control problem that lends itself to an actor-critic learning approach. By explicitly learning a \textit{dual} network to coordinate the interaction between the planning and tracking layers, we demonstrate the ability to achieve an effective consensus between the two components, leading to an interpretable policy. We theoretically prove that our algorithm converges to the optimal dual network in the Linear Quadratic Regulator (LQR) setting and empirically validate its applicability to nonlinear systems through simulation experiments on a unicycle model.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

unstable-zeros/layered-ac
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComplex Systems and Decision Making