Learning to Reach, Swim, Walk and Fly in One Trial: Data-Driven Control with Scarce Data and Side Information
Franck Djeumou, Ufuk Topcu

TL;DR
This paper introduces a data-efficient control algorithm that leverages minimal data and side information to achieve high performance in unknown dynamical systems, demonstrated through simulations and outperforming existing methods.
Contribution
The paper presents a novel learning-based control method that effectively uses side information and a single trial to control unknown systems with minimal data.
Findings
Achieves comparable performance to large-data reinforcement learning methods.
Outperforms existing system identification and model predictive control techniques.
Provides theoretical bounds on suboptimality related to data and side information.
Abstract
We develop a learning-based control algorithm for unknown dynamical systems under very severe data limitations. Specifically, the algorithm has access to streaming and noisy data only from a single and ongoing trial. It accomplishes such performance by effectively leveraging various forms of side information on the dynamics to reduce the sample complexity. Such side information typically comes from elementary laws of physics and qualitative properties of the system. More precisely, the algorithm approximately solves an optimal control problem encoding the system's desired behavior. To this end, it constructs and iteratively refines a data-driven differential inclusion that contains the unknown vector field of the dynamics. The differential inclusion, used in an interval Taylor-based method, enables to over-approximate the set of states the system may reach. Theoretically, we establish a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Reinforcement Learning in Robotics · Advanced Control Systems Optimization
