How to Peel with a Knife: Aligning Fine-Grained Manipulation with Human Preference
Toru Lin, Shuying Deng, Zhao-Heng Yin, Pieter Abbeel, Jitendra Malik

TL;DR
This paper introduces a two-stage learning framework for fine-grained manipulation tasks like peeling with a knife, combining force-aware imitation learning and human preference-based fine-tuning to achieve high success rates and generalization.
Contribution
It presents a novel approach that integrates quantitative and qualitative feedback for policy refinement in contact-rich, subjective tasks, demonstrating effective generalization and high success rates.
Findings
Achieves over 90% success rate on various produce
Improves performance by up to 40% through preference-based finetuning
Strong zero-shot generalization to unseen and out-of-distribution produce
Abstract
Many essential manipulation tasks - such as food preparation, surgery, and craftsmanship - remain intractable for autonomous robots. These tasks are characterized not only by contact-rich, force-sensitive dynamics, but also by their "implicit" success criteria: unlike pick-and-place, task quality in these domains is continuous and subjective (e.g. how well a potato is peeled), making quantitative evaluation and reward engineering difficult. We present a learning framework for such tasks, using peeling with a knife as a representative example. Our approach follows a two-stage pipeline: first, we learn a robust initial policy via force-aware data collection and imitation learning, enabling generalization across object variations; second, we refine the policy through preference-based finetuning using a learned reward model that combines quantitative task metrics with qualitative human…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Soft Robotics and Applications · Motor Control and Adaptation
