Distributionally Robust Path Integral Control

Hyuk Park; Duo Zhou; Grani A. Hanasusanto; Takashi Tanaka

arXiv:2310.01633·eess.SY·October 4, 2023

Distributionally Robust Path Integral Control

Hyuk Park, Duo Zhou, Grani A. Hanasusanto, Takashi Tanaka

PDF

Open Access

TL;DR

This paper introduces Distributionally Robust Path Integral (DRPI), a novel control method that enhances policy robustness in stochastic control problems with limited data by leveraging distributionally robust optimization within the path integral framework.

Contribution

The paper develops DRPI, integrating distributionally robust optimization with path integral control, providing theoretical guarantees and improved performance over risk-neutral policies.

Findings

01

DRPI outperforms risk-neutral PIC in uncertain diffusion scenarios.

02

Theoretical guarantees align with risk-sensitive control parameters.

03

DRPI offers a robust alternative when diffusion process knowledge is limited.

Abstract

We consider a continuous-time continuous-space stochastic optimal control problem, where the controller lacks exact knowledge of the underlying diffusion process, relying instead on a finite set of historical disturbance trajectories. In situations where data collection is limited, the controller synthesized from empirical data may exhibit poor performance. To address this issue, we introduce a novel approach named Distributionally Robust Path Integral (DRPI). The proposed method employs distributionally robust optimization (DRO) to robustify the resulting policy against the unknown diffusion process. Notably, the DRPI scheme shows similarities with risk-sensitive control, which enables us to utilize the path integral control (PIC) framework as an efficient solution scheme. We derive theoretical performance guarantees for the DRPI scheme, which closely aligns with selecting a risk…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Control Systems Optimization

MethodsDiffusion