# WG-FuseNet: Wavelet-Guided and Gated Fusion Network for Road Segmentation

**Authors:** Yu Nie, Jiaqi Sun, Ming Zhu, Yuan Liu, Yuanfu Yuan, Shuhui Jiang, Yan Lu, Jiarong Wang

PMC · DOI: 10.3390/s26010218 · Sensors (Basel, Switzerland) · 2025-12-29

## TL;DR

This paper introduces a new network for road segmentation that improves edge detection and texture handling using wavelet analysis and adaptive fusion of RGB and depth data.

## Contribution

The novel Cross-scale Wavelet Enhancement Module and Gated Cross-Modality Fusion module improve road segmentation accuracy and efficiency.

## Key findings

- The proposed model achieves 97.31% segmentation accuracy on the KITTI_Road dataset.
- The method maintains a real-time inference speed of 34 FPS.
- It performs exceptionally well in road edge integrity and shadow area handling.

## Abstract

In current road segmentation tasks, high-frequency details of roads (such as road edges and pavement textures) tend to become blurred or even lost during feature extraction due to progressive downsampling, leading to imprecise segmentation boundaries. Moreover, existing fusion methods predominantly rely on simple concatenation or summation operations, which struggle to adaptively integrate the rich texture information from RGB modality with the geometric structural information from Depth modality, thereby limiting fusion efficiency. To address these issues, this paper proposes an innovative model. We design a Cross-scale Wavelet Enhancement Module (CWEM) to compensate for the shortcomings of traditional networks in frequency domain analysis, explicitly enhancing the representation capability of edge and texture features. Simultaneously, a Gated Cross-Modality Fusion module (GCMF) is constructed to achieve adaptive and efficient fusion between RGB and Depth features. Additionally, to tackle the high false detection rates and confusion between sidewalks and opposite lanes in existing methods, this paper optimizes the loss function to further improve the model’s discriminative ability in complex scenarios. Experiments on the public KITTI_Road dataset demonstrate that the proposed method achieves a segmentation accuracy of 97.31% while maintaining a real-time inference speed of 34 FPS, with particularly outstanding performance in road edge integrity and shadow area handling.

## Full-text entities

- **Diseases:** WT (MESH:D002472), injury to (MESH:D014947)
- **Chemicals:** CWEM (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12788201/full.md

## Figures

9 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12788201/full.md

## References

40 references — full list in the complete paper: https://tomesphere.com/paper/PMC12788201/full.md

---
Source: https://tomesphere.com/paper/PMC12788201