Learning to Reach, Swim, Walk and Fly in One Trial: Data-Driven Control   with Scarce Data and Side Information

Franck Djeumou; Ufuk Topcu

arXiv:2106.10533·eess.SY·December 30, 2021

Learning to Reach, Swim, Walk and Fly in One Trial: Data-Driven Control with Scarce Data and Side Information

Franck Djeumou, Ufuk Topcu

PDF

Open Access

TL;DR

This paper introduces a data-efficient control algorithm that leverages minimal data and side information to achieve high performance in unknown dynamical systems, demonstrated through simulations and outperforming existing methods.

Contribution

The paper presents a novel learning-based control method that effectively uses side information and a single trial to control unknown systems with minimal data.

Findings

01

Achieves comparable performance to large-data reinforcement learning methods.

02

Outperforms existing system identification and model predictive control techniques.

03

Provides theoretical bounds on suboptimality related to data and side information.

Abstract

We develop a learning-based control algorithm for unknown dynamical systems under very severe data limitations. Specifically, the algorithm has access to streaming and noisy data only from a single and ongoing trial. It accomplishes such performance by effectively leveraging various forms of side information on the dynamics to reduce the sample complexity. Such side information typically comes from elementary laws of physics and qualitative properties of the system. More precisely, the algorithm approximately solves an optimal control problem encoding the system's desired behavior. To this end, it constructs and iteratively refines a data-driven differential inclusion that contains the unknown vector field of the dynamics. The differential inclusion, used in an interval Taylor-based method, enables to over-approximate the set of states the system may reach. Theoretically, we establish a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel Reduction and Neural Networks · Reinforcement Learning in Robotics · Advanced Control Systems Optimization