Beyond Quadratic Costs in LQR: Bregman Divergence Control

Babak Hassibi; Joudi Hajar; Reza Ghane

arXiv:2505.00317·eess.SY·May 2, 2025

Beyond Quadratic Costs in LQR: Bregman Divergence Control

Babak Hassibi, Joudi Hajar, Reza Ghane

PDF

TL;DR

This paper extends LQR control to a broader class of convex costs derived from Bregman divergence, enabling nonlinear feedback laws with stability guarantees for diverse control scenarios.

Contribution

It introduces a novel framework for control with Bregman divergence-based convex costs, generalizing quadratic cost methods to nonlinear, stable controllers.

Findings

01

Controllers are nonlinear and stable.

02

Applicable to safety, sparse, and bang-bang control.

03

Provides a full extension of quadratic LQR framework.

Abstract

In the past couple of decades, the use of ``non-quadratic" convex cost functions has revolutionized signal processing, machine learning, and statistics, allowing one to customize solutions to have desired structures and properties. However, the situation is not the same in control where the use of quadratic costs still dominates, ostensibly because determining the ``value function", i.e., the optimal expected cost-to-go, which is critical to the construction of the optimal controller, becomes computationally intractable as soon as one considers general convex costs. As a result, practitioners often resort to heuristics and approximations, such as model predictive control that only looks a few steps into the future. In the quadratic case, the value function is easily determined by solving Riccati equations. In this work, we consider a special class of convex cost functions constructed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.