Safety-Critical Contextual Control via Online Riemannian Optimization with World Models
Tongxin Li

TL;DR
This paper introduces a novel online Riemannian optimization framework for safety-critical control using world models, enabling better safety and performance in complex, black-box environments.
Contribution
It develops a sample-based Penalized Predictive Control method that leverages Riemannian geometry of feasibility manifolds for improved safety and convergence guarantees.
Findings
Simulations show the approach outperforms marginal and frozen density models.
The safety bound depends on the curvature and score estimation error, improving with richer context.
The method effectively adapts after environment shifts.
Abstract
Modern world models are becoming too complex to admit explicit dynamical descriptions. We study safety-critical contextual control, where a Planner must optimize a task objective using only feasibility samples from a black-box Simulator, conditioned on a context signal . We develop a sample-based Penalized Predictive Control (PPC) framework grounded in online Riemannian optimization, in which the Simulator compresses the feasibility manifold into a score-based density that endows the action space with a Riemannian geometry guiding the Planner's gradient descent. The barrier curvature , the minimum curvature of the conditional log-density , governs both convergence rate and safety margin, replacing the Lipschitz constant of the unknown dynamics. Our main result is a contextual safety bound showing that the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
