Model Identification Adaptive Control with $\rho$-POMDP Planning

Michelle Ho; Arec Jamgochian; Mykel J. Kochenderfer

arXiv:2505.09119·cs.RO·May 26, 2025

Model Identification Adaptive Control with $\rho$-POMDP Planning

Michelle Ho, Arec Jamgochian, Mykel J. Kochenderfer

PDF

Open Access

TL;DR

This paper introduces a novel belief-space planning approach using $ ho$-POMDPs and BiLQR for adaptive control and system identification under partial observability, improving accuracy and safety.

Contribution

It formulates system identification adaptive control as a belief space planning problem with $ ho$-POMDPs and solves it using an adapted BiLQR, demonstrating superior performance.

Findings

01

Outperforms baseline methods in system identification accuracy.

02

Effective under partial observability and disturbances.

03

Applicable to cart-pole and aircraft flight domains.

Abstract

Accurate system modeling is crucial for safe, effective control, as misidentification can lead to accumulated errors, especially under partial observability. We address this problem by formulating informative input design and model identification adaptive control (MIAC) as belief space planning problems, modeled as partially observable Markov decision processes with belief-dependent rewards ( $ρ$ -POMDPs). We treat system parameters as hidden state variables that must be localized while simultaneously controlling the system. We solve this problem with an adapted belief-space iterative Linear Quadratic Regulator (BiLQR). We demonstrate it on fully and partially observable tasks for cart-pole and steady aircraft flight domains. Our method outperforms baselines such as regression, filtering, and local optimal control methods, even under instantaneous disturbances to system parameters.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Control Systems Optimization · Control Systems and Identification · Iterative Learning Control Systems