Real-Time Auto-Optimization in Unknown Environments via Structure-Exploiting Dual Control for Exploration and Exploitation

Shiying Dong; Haoyang Yang; Qiwei Liu; Wen-Hua Chen

arXiv:2605.22431·cs.RO·May 22, 2026

Real-Time Auto-Optimization in Unknown Environments via Structure-Exploiting Dual Control for Exploration and Exploitation

Shiying Dong, Haoyang Yang, Qiwei Liu, Wen-Hua Chen

PDF

TL;DR

This paper introduces a structure-exploiting dual control method for real-time auto-optimization in unknown environments, significantly reducing computation time and improving control performance.

Contribution

It develops a novel numerical approach that leverages the inherent convex-over-nonlinear structure of DCEE reward functions for faster, reliable online optimization.

Findings

01

Achieves approximately tenfold speedup over existing methods.

02

Demonstrates improved control performance in vehicle auto-optimization.

03

Attains microsecond-level computation times on embedded hardware.

Abstract

This paper develops a fast numerical dual control for exploration and exploitation (DCEE) method to address auto-optimization problems in unknown environments. In auto-optimization problems, the optimal operating condition is unknown a priori and may vary with the environment. As in classical dual control techniques, computational burden remains a major concern in DCEE for active learning. Existing DCEE methods provide a principled exploration-exploitation objective, but mainly realized through standard optimization packages or explicit gradient-type update laws, where the numerical structure of the DCEE has not been fully exploited. This paper shows that the reward function in DCEE has an inherent convex-over-nonlinear structure, where the exploitation and exploration terms form a unified nonlinear residual map equipped with a convex outer loss. Benefiting from this structure, a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.