# Visuo-tactile feedback policies for terminal assembly facilitated by reinforcement learning

**Authors:** Yuchao Li, Ziqi Jin, Jin Liu, Daolin Ma

PMC · DOI: 10.3389/frobt.2025.1660244 · 2025-10-22

## TL;DR

This paper introduces a reinforcement learning method that uses vision and touch feedback to safely assemble industrial terminals, achieving high success rates.

## Contribution

A novel safe reinforcement learning approach using visuo-tactile feedback for robust terminal assembly.

## Key findings

- The proposed method achieved 100% successful insertions in PLC terminal assembly.
- The method outperformed imitation learning and online-RL policies significantly.
- Decomposing the task into three phases improved robustness to grasp pose variations.

## Abstract

Industrial terminal assembly tasks are often repetitive and involve handling components with tight tolerances that are susceptible to damage. Learning an effective terminal assembly policy in real-world is challenging, as collisions between parts and the environment can lead to slippage or part breakage. In this paper, we propose a safe reinforcement learning approach to develop a visuo-tactile assembly policy that is robust to variations in grasp poses. Our method minimizes collisions between the terminal head and terminal base by decomposing the assembly task into three distinct phases. In the first grasp phase,a vision-guided model is trained to pick the terminal head from an initial bin. In the second align phase, a tactile-based grasp pose estimation model is employed to align the terminal head with the terminal base. In the final assembly phase, a visuo-tactile policy is learned to precisely insert the terminal head into the terminal base. To ensure safe training, the robot leverages human demonstrations and interventions. Experimental results on PLC terminal assembly demonstrate that the proposed method achieves 100% successful insertions across 100 different initial end-effector and grasp poses, while imitation learning and online-RL policy yield only 9% and 0%.

## Full-text entities

- **Genes:** HSPG2 (heparan sulfate proteoglycan 2) [NCBI Gene 3339] {aka HSPG, PLC, PRCAN, SJA, SJS, SJS1}
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12586048/full.md

---
Source: https://tomesphere.com/paper/PMC12586048