Safe MPC Alignment with Human Directional Feedback

Zhixian Xie; Wenlong Zhang; Yi Ren; Zhaoran Wang; George J. Pappas; Wanxin Jin

arXiv:2407.04216·cs.RO·December 9, 2025

Safe MPC Alignment with Human Directional Feedback

Zhixian Xie, Wenlong Zhang, Yi Ren, Zhaoran Wang, George J. Pappas, Wanxin Jin

PDF

Open Access

TL;DR

This paper introduces a novel certifiable method for robots to learn safety constraints in MPC from human directional feedback, ensuring safety and efficiency in real-world tasks with minimal feedback.

Contribution

It is the first approach to learn safety constraints from human feedback, providing certifiability and efficiency in safety-critical robot control.

Findings

01

Successfully learned safety constraints with tens of human corrections.

02

Validated on simulation games and real-world robot tasks.

03

Demonstrated efficacy and efficiency of the method.

Abstract

In safety-critical robot planning or control, manually specifying safety constraints or learning them from demonstrations can be challenging. In this article, we propose a certifiable alignment method for a robot to learn a safety constraint in its model predictive control (MPC) policy from human online directional feedback. To our knowledge, it is the first method to learn safety constraints from human feedback. The proposed method is based on an empirical observation: human directional feedback, when available, tends to guide the robot toward safer regions. The method only requires the direction of human feedback to update the learning hypothesis space. It is certifiable, providing an upper bound on the total number of human feedback in the case of successful learning, or declaring the hypothesis misspecification, i.e., the true safety constraint cannot be found within the specified…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Control Systems Optimization