Beyond Success: Refining Elegant Robot Manipulation from Mixed-Quality Data via Just-in-Time Intervention

Yanbo Mao; Jianlong Fu; Ruoxuan Zhang; Hongxia Xie; Meibao Yao

arXiv:2511.22555·cs.RO·December 1, 2025

Beyond Success: Refining Elegant Robot Manipulation from Mixed-Quality Data via Just-in-Time Intervention

Yanbo Mao, Jianlong Fu, Ruoxuan Zhang, Hongxia Xie, Meibao Yao

PDF

Open Access

TL;DR

This paper introduces a framework for improving robotic manipulation execution quality by using an Elegance Critic and Just-in-Time Intervention, which refine actions based on implicit task constraints without retraining the base policy.

Contribution

It proposes a decoupled refinement framework with an Elegance Critic trained via offline Calibrated Q-Learning and a JITI mechanism for on-demand intervention, enhancing execution quality in VLA models.

Findings

01

Significant improvement in execution quality on LIBERO-Elegant benchmark.

02

Effective refinement on unseen manipulation tasks.

03

JITI mechanism reduces unnecessary interventions.

Abstract

Vision-Language-Action (VLA) models have enabled notable progress in general-purpose robotic manipulation, yet their learned policies often exhibit variable execution quality. We attribute this variability to the mixed-quality nature of human demonstrations, where the implicit principles that govern how actions should be carried out are only partially satisfied. To address this challenge, we introduce the LIBERO-Elegant benchmark with explicit criteria for evaluating execution quality. Using these criteria, we develop a decoupled refinement framework that improves execution quality without modifying or retraining the base VLA policy. We formalize Elegant Execution as the satisfaction of Implicit Task Constraints (ITCs) and train an Elegance Critic via offline Calibrated Q-Learning to estimate the expected quality of candidate actions. At inference time, a Just-in-Time Intervention…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Multimodal Machine Learning Applications · Reinforcement Learning in Robotics