Mastering the working sequence in human-robot collaborative assembly based on reinforcement learning
Tian Yu, Jing Huang, Qing Chang

TL;DR
This paper applies reinforcement learning, inspired by game-playing AI, to optimize human-robot collaborative assembly sequences, significantly improving efficiency through self-learning without domain-specific guidance.
Contribution
It introduces a novel reinforcement learning approach to optimize HRC assembly sequences, modeling the process as a game and training a neural network for decision prediction.
Findings
The RL-trained robot achieves higher assembly efficiency.
The neural network accurately predicts optimal move priorities.
The method outperforms traditional sequence planning approaches.
Abstract
A long-standing goal of the Human-Robot Collaboration (HRC) in manufacturing systems is to increase the collaborative working efficiency. In line with the trend of Industry 4.0 to build up the smart manufacturing system, the Co-robot in the HRC system deserves better designing to be more self-organized and to find the superhuman proficiency by self-learning. Inspired by the impressive machine learning algorithms developed by Google Deep Mind like Alphago Zero, in this paper, the human-robot collaborative assembly working process is formatted into a chessboard and the selection of moves in the chessboard is used to analogize the decision making by both human and robot in the HRC assembly working process. To obtain the optimal policy of working sequence to maximize the working efficiency, the robot is trained with a self-play algorithm based on reinforcement learning, without guidance or…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Digital Transformation in Industry · Manufacturing Process and Optimization
