# ZoomHead: A Flexible and Lightweight Detection Head Structure Design for Slender Cracks

**Authors:** Hua Li, Fan Yang, Junzhou Huo, Qiang Gao, Shusen Deng, Chang Guo

PMC · DOI: 10.3390/s25133990 · Sensors (Basel, Switzerland) · 2025-06-26

## TL;DR

This paper introduces ZoomHead, a lightweight detection head structure that improves crack detection on metal surfaces while reducing computational costs.

## Contribution

ZoomHead is a novel lightweight detection head structure that enhances detection performance and reduces model complexity for crack detection.

## Key findings

- ZoomHead improves detection accuracy while reducing parameters and computations.
- The model achieves higher FPS and maintains the same mAP as state-of-the-art models.
- ZoomHead outperforms existing models in balancing detection accuracy and speed.

## Abstract

Detecting metal surface crack defects is of great significance for the safe operation of industrial equipment. However, most existing mainstream deep-object detection models suffer from complex structures, large parameter sizes, and high training costs, which hinder their deployment and application in frontline construction sites. Therefore, this paper optimizes the existing YOLO series head structure and proposes a lightweight detection head structure, ZoomHead, with lower computational complexity and stronger detection performance. First, the GroupNorm2d module replaces the BatchNorm2d module to stabilize the model’s feature distribution and accelerate the training speed. Second, Detail Enhanced Convolution (DEConv) replaces traditional convolution kernels, and shared convolution is adopted to reduce redundant structures, which enhances the ability to capture details and improves the detection performance for small objects. Next, the Zoom scale factor is introduced to achieve proportional scaling of the convolution kernels in the regression branch, minimizing redundant computational complexity. Finally, using the YOLOv10 and YOLO11 series models as baseline models, ZoomHead was used to replace the head structure of the baseline models entirely, and a series of performance comparison experiments were conducted on the rail surface crack dataset and NEU surface defect database. The results showed that the integration of ZoomHead effectively improved the model’s detection accuracy, reduced the number of parameters and computations, and increased the FPS, achieving a good balance between detection accuracy and speed. In the comparative experiment of the SOTA model, the addition of ZoomHead resulted in the model having the smallest number of parameters and the highest FPS, while maintaining the same mAP value as the SOTA model, indicating that the ZoomHead structure proposed in this paper has better comprehensive detection performance.

## Full-text entities

- **Diseases:** SSD (MESH:C563928), Crack defect (MESH:D003387), steel defect (MESH:D013494), GN (MESH:C537354), NEU surface (MESH:C536405), metal (MESH:D013651), injury to (MESH:D014947), surface defect (MESH:D010534)
- **Chemicals:** lithium (MESH:D008094), steel (MESH:D013232), DEConv (-), aluminum (MESH:D000535)
- **Species:** Homo sapiens (human, species) [taxon 9606]
- **Cell lines:** YOLOv10m — Homo sapiens (Human), Spontaneously immortalized cell line (CVCL_C7QB), YOLOv10x — Homo sapiens (Human), Ovarian clear cell adenocarcinoma, Cancer cell line (CVCL_DH04), YOLOv10 — Mus musculus (Mouse), Hybridoma (CVCL_C4R4)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12252334/full.md

## Figures

11 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12252334/full.md

## References

36 references — full list in the complete paper: https://tomesphere.com/paper/PMC12252334/full.md

---
Source: https://tomesphere.com/paper/PMC12252334