Output Feedback Tube MPC-Guided Data Augmentation for Robust, Efficient   Sensorimotor Policy Learning

Andrea Tagliabue; Jonathan P. How

arXiv:2210.10127·cs.RO·October 20, 2022

Output Feedback Tube MPC-Guided Data Augmentation for Robust, Efficient Sensorimotor Policy Learning

Andrea Tagliabue, Jonathan P. How

PDF

Open Access

TL;DR

This paper introduces a novel data augmentation method combining imitation learning with a robust tube MPC controller, enabling efficient and robust sensorimotor policy learning from minimal demonstrations, specifically demonstrated on aerial robot trajectory tracking.

Contribution

It presents a new approach that integrates RTMPC-based data augmentation with imitation learning to significantly reduce demonstration requirements and enhance robustness.

Findings

01

Learned a robust visuomotor policy from a single demonstration.

02

Reduced demonstration collection and computation time by two orders of magnitude.

03

Enhanced robustness to sensing and process uncertainties.

Abstract

Imitation learning (IL) can generate computationally efficient sensorimotor policies from demonstrations provided by computationally expensive model-based sensing and control algorithms. However, commonly employed IL methods are often data-inefficient, requiring the collection of a large number of demonstrations and producing policies with limited robustness to uncertainties. In this work, we combine IL with an output feedback robust tube model predictive controller (RTMPC) to co-generate demonstrations and a data augmentation strategy to efficiently learn neural network-based sensorimotor policies. Thanks to the augmented data, we reduce the computation time and the number of demonstrations needed by IL, while providing robustness to sensing and process uncertainty. We tailor our approach to the task of learning a trajectory tracking visuomotor policy for an aerial robot, leveraging a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Control Systems Optimization · Robot Manipulation and Learning · Reinforcement Learning in Robotics