Learning Stabilizing Policies via an Unstable Subspace Representation

Leonardo F. Toso; Lintao Ye; and James Anderson

arXiv:2505.01348·cs.LG·May 7, 2025

Learning Stabilizing Policies via an Unstable Subspace Representation

Leonardo F. Toso, Lintao Ye, and James Anderson

PDF

Open Access 1 Repo

TL;DR

This paper introduces a two-phase method for stabilizing linear systems by learning their unstable subspace, significantly reducing data requirements and speeding up the stabilization process compared to traditional approaches.

Contribution

It proposes a novel two-phase approach that first learns the unstable subspace and then stabilizes it, improving sample efficiency for control of unknown linear systems.

Findings

01

Learning the unstable subspace reduces sample complexity.

02

Faster stabilization when unstable modes are few.

03

Numerical experiments confirm theoretical advantages.

Abstract

We study the problem of learning to stabilize (LTS) a linear time-invariant (LTI) system. Policy gradient (PG) methods for control assume access to an initial stabilizing policy. However, designing such a policy for an unknown system is one of the most fundamental problems in control, and it may be as hard as learning the optimal policy itself. Existing work on the LTS problem requires large data as it scales quadratically with the ambient dimension. We propose a two-phase approach that first learns the left unstable subspace of the system and then solves a series of discounted linear quadratic regulator (LQR) problems on the learned unstable subspace, targeting to stabilize only the system's unstable dynamics and reduce the effective dimension of the control space. We provide non-asymptotic guarantees for both phases and demonstrate that operating on the unstable subspace reduces…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jd-anderson/LTS-unstable-representation
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel Reduction and Neural Networks