Learning a High-quality Robotic Wiping Policy Using Systematic Reward   Analysis and Visual-Language Model Based Curriculum

Yihong Liu; Dongyeop Kang; Sehoon Ha

arXiv:2502.12599·cs.RO·February 19, 2025

Learning a High-quality Robotic Wiping Policy Using Systematic Reward Analysis and Visual-Language Model Based Curriculum

Yihong Liu, Dongyeop Kang, Sehoon Ha

PDF

Open Access

TL;DR

This paper introduces a systematic reward analysis and a visual-language model based curriculum to improve deep reinforcement learning for robotic wiping, enabling high-quality cleaning policies on complex surfaces.

Contribution

It proposes a new bounded reward formulation and a VLM-based curriculum to enhance learning efficiency and policy quality in robotic wiping tasks.

Findings

01

Successfully learned wiping policies on various surface types

02

Improved convergence compared to baseline reward formulations

03

Demonstrated adaptability to different surface geometries

Abstract

Autonomous robotic wiping is an important task in various industries, ranging from industrial manufacturing to sanitization in healthcare. Deep reinforcement learning (Deep RL) has emerged as a promising algorithm, however, it often suffers from a high demand for repetitive reward engineering. Instead of relying on manual tuning, we first analyze the convergence of quality-critical robotic wiping, which requires both high-quality wiping and fast task completion, to show the poor convergence of the problem and propose a new bounded reward formulation to make the problem feasible. Then, we further improve the learning process by proposing a novel visual-language model (VLM) based curriculum, which actively monitors the progress and suggests hyperparameter tuning. We demonstrate that the combined method can find a desirable wiping policy on surfaces with various curvatures, frictions, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPharmacy and Medical Practices · Online Learning and Analytics