Tolerance-Guided Policy Learning for Adaptable and Transferrable Delicate Industrial Insertion
Boshen Niu, Chenxi Wang, Changliu Liu

TL;DR
This paper introduces a tolerance-guided policy learning approach for delicate industrial insertion tasks, enhancing transferability and defect robustness through task embedding, generative adversarial imitation learning, and probabilistic inference.
Contribution
It proposes a novel tolerance-guided policy learning framework combining task embedding, RS-GAIL, and probabilistic inference for improved adaptability and transferability in industrial insertion tasks.
Findings
RS-GAIL efficiently learns policies with sparse rewards
Tolerance embedding improves transferability across workpieces
Probabilistic inference enhances robustness to workpiece defects
Abstract
Policy learning for delicate industrial insertion tasks (e.g., PC board assembly) is challenging. This paper considers two major problems: how to learn a diversified policy (instead of just one average policy) that can efficiently handle different workpieces with minimum amount of training data, and how to handle defects of workpieces during insertion. To address the problems, we propose tolerance-guided policy learning. To encourage transferability of the learned policy to different workpieces, we add a task embedding to the policy's input space using the insertion tolerance. Then we train the policy using generative adversarial imitation learning with reward shaping (RS-GAIL) on a variety of representative situations. To encourage adaptability of the learned policy to handle defects, we build a probabilistic inference model that can output the best inserting pose based on failed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Manufacturing Process and Optimization · Reinforcement Learning in Robotics
