Continual Policy Distillation of Reinforcement Learning-based   Controllers for Soft Robotic In-Hand Manipulation

Lanpei Li; Enrico Donato; Vincenzo Lomonaco; Egidio Falotico

arXiv:2404.04219·cs.RO·February 10, 2025·1 cites

Continual Policy Distillation of Reinforcement Learning-based Controllers for Soft Robotic In-Hand Manipulation

Lanpei Li, Enrico Donato, Vincenzo Lomonaco, Egidio Falotico

PDF

Open Access 1 Repo

TL;DR

This paper presents a Continual Policy Distillation framework that enables soft robotic hands to learn versatile, adaptive in-hand manipulation skills across different objects by transferring knowledge from multiple expert policies while mitigating forgetting.

Contribution

The novel CPD framework combines policy distillation with exemplar-based rehearsal to improve adaptability and generalization in soft robotic in-hand manipulation tasks.

Findings

01

Effective knowledge transfer from multiple experts

02

Enhanced generalization and adaptability in manipulation

03

Mitigated catastrophic forgetting during learning

Abstract

Dexterous manipulation, often facilitated by multi-fingered robotic hands, holds solid impact for real-world applications. Soft robotic hands, due to their compliant nature, offer flexibility and adaptability during object grasping and manipulation. Yet, benefits come with challenges, particularly in the control development for finger coordination. Reinforcement Learning (RL) can be employed to train object-specific in-hand manipulation policies, but limiting adaptability and generalizability. We introduce a Continual Policy Distillation (CPD) framework to acquire a versatile controller for in-hand manipulation, to rotate different objects in shape and size within a four-fingered soft gripper. The framework leverages Policy Distillation (PD) to transfer knowledge from expert policies to a continually evolving student policy network. Exemplar-based rehearsal methods are then integrated…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lilanpei/Continual-Policy-Distillation-for-Soft-Robotic
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Soft Robotics and Applications