DSCformer: A Dual-Branch Network Integrating Enhanced Dynamic Snake   Convolution and SegFormer for Crack Segmentation

Kaiwei Yu; I-Ming Chen; Jing Wu

arXiv:2411.09371·cs.CV·November 15, 2024

DSCformer: A Dual-Branch Network Integrating Enhanced Dynamic Snake Convolution and SegFormer for Crack Segmentation

Kaiwei Yu, I-Ming Chen, Jing Wu

PDF

Open Access

TL;DR

DSCformer is a hybrid neural network combining enhanced Dynamic Snake Convolution and Transformers, designed to improve crack segmentation in concrete structures by capturing fine details and global context.

Contribution

The paper introduces DSCformer, a novel hybrid model with enhanced DSConv and WCAM modules, advancing crack segmentation accuracy over existing methods.

Findings

01

Achieved IoU of 59.22% on Crack3238 dataset.

02

Achieved IoU of 87.24% on FIND dataset.

03

Outperforms state-of-the-art methods in crack segmentation.

Abstract

In construction quality monitoring, accurately detecting and segmenting cracks in concrete structures is paramount for safety and maintenance. Current convolutional neural networks (CNNs) have demonstrated strong performance in crack segmentation tasks, yet they often struggle with complex backgrounds and fail to capture fine-grained tubular structures fully. In contrast, Transformers excel at capturing global context but lack precision in detailed feature extraction. We introduce DSCformer, a novel hybrid model that integrates an enhanced Dynamic Snake Convolution (DSConv) with a Transformer architecture for crack segmentation to address these challenges. Our key contributions include the enhanced DSConv through a pyramid kernel for adaptive offset computation and a simultaneous bi-directional learnable offset iteration, significantly improving the model's performance to capture…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsInfrastructure Maintenance and Monitoring · Industrial Vision Systems and Defect Detection · Tunneling and Rock Mechanics

MethodsAttention Is All You Need · Absolute Position Encodings · Label Smoothing · Adam · Residual Connection · Softmax · Linear Layer · Dropout · Layer Normalization · Multi-Head Attention