SMR-Net:Robot Snap Detection Based on Multi-Scale Features and Self-Attention Network

Kuanxu Hou

arXiv:2603.01036·cs.CV·March 3, 2026

SMR-Net:Robot Snap Detection Based on Multi-Scale Features and Self-Attention Network

Kuanxu Hou

PDF

Open Access

TL;DR

This paper introduces SMR-Net, a self-attention-based multi-scale detection algorithm designed to improve snap detection and localization accuracy in robotic assembly, especially under complex visual conditions.

Contribution

The paper proposes a novel attention-enhanced multi-scale feature fusion architecture for snap detection, significantly improving robustness and precision over traditional methods.

Findings

01

IoU improved by 6.52% and 5.8% on two datasets

02

mAP increased by 2.8% and 1.5%

03

Superior performance in complex snap detection tasks

Abstract

In robot automated assembly, snap assembly precision and efficiency directly determine overall production quality. As a core prerequisite, snap detection and localization critically affect subsequent assembly success. Traditional visual methods suffer from poor robustness and large localization errors when handling complex scenarios (e.g., transparent or low-contrast snaps), failing to meet high-precision assembly demands. To address this, this paper designs a dedicated sensor and proposes SMR-Net, an self-attention-based multi-scale object detection algorithm, to synergistically enhance detection and localization performance. SMR-Net adopts an attention-enhanced multi-scale feature fusion architecture: raw sensor data is encoded via an attention-embedded feature extractor to strengthen key snap features and suppress noise; three multi-scale feature maps are processed in parallel with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Advanced Neural Network Applications · Industrial Vision Systems and Defect Detection