Equipping Black-Box Policies with Model-Based Advice for Stable Nonlinear Control
Tongxin Li, Ruixiao Yang, Guannan Qu, Yiheng Lin, Steven Low, Adam, Wierman

TL;DR
This paper introduces an adaptive confidence-based method to combine black-box policies with model-based advice for nonlinear control, ensuring stability and improved performance on real-world and simulated problems.
Contribution
It proposes an adaptive λ-confident policy that guarantees stability and bounded competitiveness, along with an online learning approach for practical implementation.
Findings
The naive convex combination can cause instability.
The λ-confident policy guarantees stability under certain conditions.
The approach performs well in CartPole and EV charging case studies.
Abstract
Machine-learned black-box policies are ubiquitous for nonlinear control problems. Meanwhile, crude model information is often available for these problems from, e.g., linear approximations of nonlinear dynamics. We study the problem of equipping a black-box control policy with model-based advice for nonlinear control on a single trajectory. We first show a general negative result that a naive convex combination of a black-box policy and a linear model-based policy can lead to instability, even if the two policies are both stabilizing. We then propose an adaptive -confident policy, with a coefficient indicating the confidence in a black-box policy, and prove its stability. With bounded nonlinearity, in addition, we show that the adaptive -confident policy achieves a bounded competitive ratio when a black-box policy is near-optimal. Finally, we propose an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSmart Grid Energy Management · Advanced Bandit Algorithms Research · Electric Vehicles and Infrastructure
