Constrained Style Learning from Imperfect Demonstrations under Task Optimality

Kehan Wen; Chenhao Li; Junzhe He; Marco Hutter

arXiv:2507.09371·cs.RO·September 24, 2025

Constrained Style Learning from Imperfect Demonstrations under Task Optimality

Kehan Wen, Chenhao Li, Junzhe He, Marco Hutter

PDF

Open Access

TL;DR

This paper introduces a constrained learning framework that enables robots to imitate style from imperfect demonstrations while maintaining near-optimal task performance, validated on real robotic hardware.

Contribution

It formulates style learning as a constrained Markov Decision Process with an adaptive Lagrangian, allowing selective imitation without sacrificing task effectiveness.

Findings

01

Achieved high-fidelity style imitation on multiple robots.

02

Maintained near-optimal task performance despite imperfect demonstrations.

03

Reduced mechanical energy by 14.5% on real hardware.

Abstract

Learning from demonstration has proven effective in robotics for acquiring natural behaviors, such as stylistic motions and lifelike agility, particularly when explicitly defining style-oriented reward functions is challenging. Synthesizing stylistic motions for real-world tasks usually requires balancing task performance and imitation quality. Existing methods generally depend on expert demonstrations closely aligned with task objectives. However, practical demonstrations are often incomplete or unrealistic, causing current methods to boost style at the expense of task performance. To address this issue, we propose formulating the problem as a constrained Markov Decision Process (CMDP). Specifically, we optimize a style-imitation objective with constraints to maintain near-optimal task performance. We introduce an adaptively adjustable Lagrangian multiplier to guide the agent to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Reinforcement Learning in Robotics · Social Robot Interaction and HRI