Keyframe-Guided Structured Rewards for Reinforcement Learning in Long-Horizon Laboratory Robotics

Yibo Qiu; Shu'ang Sun; Haoliang Ye; Ronald X Xu; and Mingzhai Sun

arXiv:2603.00719·cs.RO·March 3, 2026

Keyframe-Guided Structured Rewards for Reinforcement Learning in Long-Horizon Laboratory Robotics

Yibo Qiu, Shu'ang Sun, Haoliang Ye, Ronald X Xu, and Mingzhai Sun

PDF

Open Access

TL;DR

This paper introduces a keyframe-guided reward framework for reinforcement learning that improves long-horizon laboratory robotic tasks by automatically extracting keyframes and generating structured rewards, leading to higher success rates.

Contribution

The paper presents a novel reward generation framework that leverages keyframes and diffusion-based predictors to enhance reinforcement learning in complex laboratory automation tasks.

Findings

01

Achieved 82% success rate after 40-60 minutes of fine-tuning.

02

Outperformed existing methods HG-DAgger and Hil-ConRFT.

03

Effective in high-precision pipette attachment and liquid transfer tasks.

Abstract

Long-horizon precision manipulation in laboratory automation, such as pipette tip attachment and liquid transfer, requires policies that respect strict procedural logic while operating in continuous, high-dimensional state spaces. However, existing approaches struggle with reward sparsity, multi-stage structural constraints, and noisy or imperfect demonstrations, leading to inefficient exploration and unstable convergence. We propose a Keyframe-Guided Reward Generation Framework that automatically extracts kinematics-aware keyframes from demonstrations, generates stage-wise targets via a diffusion-based predictor in latent space, and constructs a geometric progress-based reward to guide online reinforcement learning. The framework integrates multi-view visual encoding, latent similarity-based progress tracking, and human-in-the-loop reinforcement fine-tuning on a Vision-Language-Action…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Reinforcement Learning in Robotics · Soft Robotics and Applications