# VBM-YOLO: an enhanced YOLO model with reduced information loss for vehicle body markers detection

**Authors:** Bin Wang, Chao Li, Chao Zhou, Jun Sun

PMC · DOI: 10.7717/peerj-cs.2932 · PeerJ Computer Science · 2025-06-02

## TL;DR

This paper introduces VBM-YOLO, an improved YOLO model that better detects vehicle body markers by reducing information loss during detection.

## Contribution

The novel VBM-YOLO model introduces a cross-spatial-channel attention mechanism, a multi-scale selective feature pyramid network, and an auxiliary gradient branch.

## Key findings

- VBM-YOLO improves mean average precision by 2.3% and 4.3% compared to YOLOv8s.
- The model achieves a better balance between accuracy and computational resources.
- It shows good generalization on public datasets like PASCAL VOC and D-Fire.

## Abstract

In vehicle safety detection, the accurate identification of body markers on medium and large vehicles plays a critical role in ensuring safe road travel. To address the issues of the feature and gradient information loss in previous You Only Look Once (YOLO) series models, a novel Vehicle Body Markers YOLO (VBM-YOLO) model has been designed. Firstly, the model integrates the cross-spatial-channel attention (CSCA) mechanism proposed in this study. The CSCA uses cross-dimensional information to address interaction issues during the fusion of spatial and channel dimensions, significantly enhancing the model’s representational capacity. Secondly, we propose a multi-scale selective feature pyramid network (MSSFPN). By a progressive fusion approach and multi-scale feature selection learning, MSSFPN alleviates the issues of feature loss and target layer information confusion caused by traditional top-down and bottom-up feature pyramids. Finally, an auxiliary gradient branch (AGB) is proposed. During training, AGB incorporates feature information from different target layers to help the current layer retain complete gradient information. Additionally, the AGB branch does not participate in model inference, thereby reducing additional overhead. Experimental results demonstrate that VBM-YOLO improves mean average precision (mAP) by 2.3% and 4.3% at intersection over union (IoU) thresholds of 0.5 and 0.5:0.95, respectively, compared to YOLOv8s on the vehicle body markers dataset. VBM-YOLO also achieves a better balance between accuracy and computational resources than other mainstream models, exhibiting good generalization performance on public datasets like PASCAL VOC and D-Fire.

## Full-text entities

- **Genes:** ETFA (electron transfer flavoprotein subunit alpha) [NCBI Gene 2108] {aka EMA, GA2, MADD}, EMG1 (EMG1 N1-specific pseudouridine methyltransferase) [NCBI Gene 10436] {aka C2F, Grcc2f, NEP1}
- **Diseases:** Fire (MESH:D000092422)
- **Chemicals:** YOLO (-), VBM (MESH:C035913)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12193416/full.md

## Figures

12 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12193416/full.md

## References

51 references — full list in the complete paper: https://tomesphere.com/paper/PMC12193416/full.md

---
Source: https://tomesphere.com/paper/PMC12193416