EDFNet: Early Fusion of Edge and Depth for Thin-Obstacle Segmentation in UAV Navigation
Negar Fathi

TL;DR
EDFNet is a modular early-fusion framework that combines RGB, depth, and edge cues to improve thin-obstacle segmentation for UAV navigation, demonstrating competitive performance on the DDOS dataset.
Contribution
The paper introduces EDFNet, a novel early-fusion segmentation approach integrating multimodal data for enhanced thin-obstacle perception in aerial scenes.
Findings
Early RGB-Depth-Edge fusion yields balanced segmentation performance.
Pretrained RGBDE U-Net achieves highest scores on key metrics.
Performance on ultra-thin categories remains a significant challenge.
Abstract
Autonomous Unmanned Aerial Vehicles (UAVs) must reliably detect thin obstacles such as wires, poles, and branches to navigate safely in real-world environments. These structures remain difficult to perceive because they occupy few pixels, often exhibit weak visual contrast, and are strongly affected by class imbalance. Existing segmentation methods primarily target coarser obstacles and do not fully exploit the complementary multimodal cues needed for thin-structure perception. We present EDFNet, a modular early-fusion segmentation framework that integrates RGB, depth, and edge information for thin-obstacle perception in cluttered aerial scenes. We evaluate EDFNet on the Drone Depth and Obstacle Segmentation (DDOS) dataset across sixteen modality-backbone configurations using U-Net and DeepLabV3 in pretrained and non-pretrained settings. The results show that early RGB-Depth-Edge fusion…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
