Guided Self-attention: Find the Generalized Necessarily Distinct Vectors for Grain Size Grading

Fang Gao; Xuetao Li; Jiabao Wang; Shengheng Ma; Jun Yu

arXiv:2410.05762·cs.CV·February 12, 2026

Guided Self-attention: Find the Generalized Necessarily Distinct Vectors for Grain Size Grading

Fang Gao, Xuetao Li, Jiabao Wang, Shengheng Ma, Jun Yu

PDF

TL;DR

This paper introduces GSNets, a deep learning classification method with guided self-attention for automated steel grain size analysis, achieving high accuracy and potential for broader image analysis tasks.

Contribution

The paper presents a novel guided self-attention module and triple-stream merging technique that enhance feature representation and model generalization in grain size classification.

Findings

01

Achieved 90.1% classification accuracy, surpassing state-of-the-art methods.

02

Introduced a guided self-attention module for better relational feature capture.

03

Demonstrated potential applicability to object detection and semantic segmentation.

Abstract

With the development of steel materials, metallographic analysis has become increasingly important. Unfortunately, grain size analysis is a manual process that requires experts to evaluate metallographic photographs, which is unreliable and time-consuming. To resolve this problem, we propose a novel classifi-cation method based on deep learning, namely GSNets, a family of hybrid models which can effectively introduce guided self-attention for classifying grain size. Concretely, we build our models from three insights:(1) Introducing our novel guided self-attention module can assist the model in finding the generalized necessarily distinct vectors capable of retaining intricate rela-tional connections and rich local feature information; (2) By improving the pixel-wise linear independence of the feature map, the highly condensed semantic representation will be captured by the model; (3)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsAttention Is All You Need · Adam · Linear Layer · Residual Connection · Position-Wise Feed-Forward Layer · Label Smoothing · Stochastic Depth · Dropout · Byte Pair Encoding · Absolute Position Encodings