Real-Time Auto-Optimization in Unknown Environments via Structure-Exploiting Dual Control for Exploration and Exploitation
Shiying Dong, Haoyang Yang, Qiwei Liu, Wen-Hua Chen

TL;DR
This paper introduces a structure-exploiting dual control method for real-time auto-optimization in unknown environments, significantly reducing computation time and improving control performance.
Contribution
It develops a novel numerical approach that leverages the inherent convex-over-nonlinear structure of DCEE reward functions for faster, reliable online optimization.
Findings
Achieves approximately tenfold speedup over existing methods.
Demonstrates improved control performance in vehicle auto-optimization.
Attains microsecond-level computation times on embedded hardware.
Abstract
This paper develops a fast numerical dual control for exploration and exploitation (DCEE) method to address auto-optimization problems in unknown environments. In auto-optimization problems, the optimal operating condition is unknown a priori and may vary with the environment. As in classical dual control techniques, computational burden remains a major concern in DCEE for active learning. Existing DCEE methods provide a principled exploration-exploitation objective, but mainly realized through standard optimization packages or explicit gradient-type update laws, where the numerical structure of the DCEE has not been fully exploited. This paper shows that the reward function in DCEE has an inherent convex-over-nonlinear structure, where the exploitation and exploration terms form a unified nonlinear residual map equipped with a convex outer loss. Benefiting from this structure, a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
