DSCformer: A Dual-Branch Network Integrating Enhanced Dynamic Snake Convolution and SegFormer for Crack Segmentation
Kaiwei Yu, I-Ming Chen, Jing Wu

TL;DR
DSCformer is a hybrid neural network combining enhanced Dynamic Snake Convolution and Transformers, designed to improve crack segmentation in concrete structures by capturing fine details and global context.
Contribution
The paper introduces DSCformer, a novel hybrid model with enhanced DSConv and WCAM modules, advancing crack segmentation accuracy over existing methods.
Findings
Achieved IoU of 59.22% on Crack3238 dataset.
Achieved IoU of 87.24% on FIND dataset.
Outperforms state-of-the-art methods in crack segmentation.
Abstract
In construction quality monitoring, accurately detecting and segmenting cracks in concrete structures is paramount for safety and maintenance. Current convolutional neural networks (CNNs) have demonstrated strong performance in crack segmentation tasks, yet they often struggle with complex backgrounds and fail to capture fine-grained tubular structures fully. In contrast, Transformers excel at capturing global context but lack precision in detailed feature extraction. We introduce DSCformer, a novel hybrid model that integrates an enhanced Dynamic Snake Convolution (DSConv) with a Transformer architecture for crack segmentation to address these challenges. Our key contributions include the enhanced DSConv through a pyramid kernel for adaptive offset computation and a simultaneous bi-directional learnable offset iteration, significantly improving the model's performance to capture…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInfrastructure Maintenance and Monitoring · Industrial Vision Systems and Defect Detection · Tunneling and Rock Mechanics
MethodsAttention Is All You Need · Absolute Position Encodings · Label Smoothing · Adam · Residual Connection · Softmax · Linear Layer · Dropout · Layer Normalization · Multi-Head Attention
