A Computer Vision Enabled damage detection model with improved YOLOv5 based on Transformer Prediction Head
Arunabha M. Roy, Jayabrata Bhaduri

TL;DR
This paper introduces DenseSPH-YOLOv5, a real-time damage detection model that combines DenseNet, CBAM, and Swin-Transformer components to improve accuracy and robustness in complex environments.
Contribution
The paper presents a novel damage detection model integrating DenseNet, CBAM, and Swin-Transformer Prediction Head for enhanced feature extraction and multiscale object detection.
Findings
Achieves 85.25% mAP on RDD-2018 dataset.
Operates at 62.4 FPS in real-time detection.
Outperforms existing state-of-the-art damage detection models.
Abstract
Objective:Computer vision-based up-to-date accurate damage classification and localization are of decisive importance for infrastructure monitoring, safety, and the serviceability of civil infrastructure. Current state-of-the-art deep learning (DL)-based damage detection models, however, often lack superior feature extraction capability in complex and noisy environments, limiting the development of accurate and reliable object distinction. Method: To this end, we present DenseSPH-YOLOv5, a real-time DL-based high-performance damage detection model where DenseNet blocks have been integrated with the backbone to improve in preserving and reusing critical feature information. Additionally, convolutional block attention modules (CBAM) have been implemented to improve attention performance mechanisms for strong and discriminating deep spatial feature extraction that results in superior…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInfrastructure Maintenance and Monitoring · Advanced Neural Network Applications · Fire Detection and Safety Systems
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Batch Normalization · 1x1 Convolution · Concatenated Skip Connection · Global Average Pooling · Kaiming Initialization · Max Pooling · Softmax · Dense Connections · Convolution
