How to Peel with a Knife: Aligning Fine-Grained Manipulation with Human Preference

Toru Lin; Shuying Deng; Zhao-Heng Yin; Pieter Abbeel; Jitendra Malik

arXiv:2603.03280·cs.RO·March 4, 2026

How to Peel with a Knife: Aligning Fine-Grained Manipulation with Human Preference

Toru Lin, Shuying Deng, Zhao-Heng Yin, Pieter Abbeel, Jitendra Malik

PDF

Open Access

TL;DR

This paper introduces a two-stage learning framework for fine-grained manipulation tasks like peeling with a knife, combining force-aware imitation learning and human preference-based fine-tuning to achieve high success rates and generalization.

Contribution

It presents a novel approach that integrates quantitative and qualitative feedback for policy refinement in contact-rich, subjective tasks, demonstrating effective generalization and high success rates.

Findings

01

Achieves over 90% success rate on various produce

02

Improves performance by up to 40% through preference-based finetuning

03

Strong zero-shot generalization to unseen and out-of-distribution produce

Abstract

Many essential manipulation tasks - such as food preparation, surgery, and craftsmanship - remain intractable for autonomous robots. These tasks are characterized not only by contact-rich, force-sensitive dynamics, but also by their "implicit" success criteria: unlike pick-and-place, task quality in these domains is continuous and subjective (e.g. how well a potato is peeled), making quantitative evaluation and reward engineering difficult. We present a learning framework for such tasks, using peeling with a knife as a representative example. Our approach follows a two-stage pipeline: first, we learn a robust initial policy via force-aware data collection and imitation learning, enabling generalization across object variations; second, we refine the policy through preference-based finetuning using a learned reward model that combines quantitative task metrics with qualitative human…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Soft Robotics and Applications · Motor Control and Adaptation