Cross-Modal Purification and Fusion for Small-Object RGB-D Transmission-Line Defect Detection
Jiaming Cui, Wenqiang Li, Shuai Zhou, Ruifeng Qin, and Feng Shen

TL;DR
This paper introduces CMAFNet, a novel cross-modal network that effectively detects small transmission line defects by purifying and fusing RGB and depth data, significantly improving accuracy and efficiency over existing methods.
Contribution
The paper proposes a new purify-then-fuse paradigm with a semantic recomposition module and contextual semantic integration framework for small-object defect detection in RGB-D data.
Findings
CMAFNet achieves 32.2% mAP@50 on TLRGBD benchmark, outperforming baselines.
A lightweight variant reaches 24.8% mAP50 at 228 FPS, surpassing YOLO-based detectors.
The method effectively suppresses modality noise and enhances structural semantic reasoning.
Abstract
Transmission line defect detection remains challenging for automated UAV inspection due to the dominance of small-scale defects, complex backgrounds, and illumination variations. Existing RGB-based detectors, despite recent progress, struggle to distinguish geometrically subtle defects from visually similar background structures under limited chromatic contrast. This paper proposes CMAFNet, a Cross-Modal Alignment and Fusion Network that integrates RGB appearance and depth geometry through a principled purify-then-fuse paradigm. CMAFNet consists of a Semantic Recomposition Module that performs dictionary-based feature purification via a learned codebook to suppress modality-specific noise while preserving defect-discriminative information, and a Contextual Semantic Integration Framework that captures global spatial dependencies using partial-channel attention to enhance structural…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Remote Sensing and LiDAR Applications · Infrastructure Maintenance and Monitoring
