Continual Policy Distillation of Reinforcement Learning-based Controllers for Soft Robotic In-Hand Manipulation
Lanpei Li, Enrico Donato, Vincenzo Lomonaco, Egidio Falotico

TL;DR
This paper presents a Continual Policy Distillation framework that enables soft robotic hands to learn versatile, adaptive in-hand manipulation skills across different objects by transferring knowledge from multiple expert policies while mitigating forgetting.
Contribution
The novel CPD framework combines policy distillation with exemplar-based rehearsal to improve adaptability and generalization in soft robotic in-hand manipulation tasks.
Findings
Effective knowledge transfer from multiple experts
Enhanced generalization and adaptability in manipulation
Mitigated catastrophic forgetting during learning
Abstract
Dexterous manipulation, often facilitated by multi-fingered robotic hands, holds solid impact for real-world applications. Soft robotic hands, due to their compliant nature, offer flexibility and adaptability during object grasping and manipulation. Yet, benefits come with challenges, particularly in the control development for finger coordination. Reinforcement Learning (RL) can be employed to train object-specific in-hand manipulation policies, but limiting adaptability and generalizability. We introduce a Continual Policy Distillation (CPD) framework to acquire a versatile controller for in-hand manipulation, to rotate different objects in shape and size within a four-fingered soft gripper. The framework leverages Policy Distillation (PD) to transfer knowledge from expert policies to a continually evolving student policy network. Exemplar-based rehearsal methods are then integrated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Soft Robotics and Applications
