Hierarchical Semantic-Visual Fusion of Visible and Near-infrared Images for Long-range Haze Removal
Yi Li, Xiaoxiong Wang, Jiawei Wang, Yi Chang, Kai Cao, Luxin Yan

TL;DR
This paper introduces a hierarchical fusion framework leveraging semantic and visual cues from visible and near-infrared images to effectively remove long-range haze, outperforming existing methods.
Contribution
The proposed HSVF framework uniquely combines semantic and structural information from multimodal images for long-range haze removal, with a new dataset for benchmarking.
Findings
Outperforms state-of-the-art methods in long-range haze removal
Effectively restores distant scene details under severe haze
Provides a new dataset with semantic labels for benchmarking
Abstract
While image dehazing has advanced substantially in the past decade, most efforts have focused on short-range scenarios, leaving long-range haze removal under-explored. As distance increases, intensified scattering leads to severe haze and signal loss, making it impractical to recover distant details solely from visible images. Near-infrared, with superior fog penetration, offers critical complementary cues through multimodal fusion. However, existing methods focus on content integration while often neglecting haze embedded in visible images, leading to results with residual haze. In this work, we argue that the infrared and visible modalities not only provide complementary low-level visual features, but also share high-level semantic consistency. Motivated by this, we propose a Hierarchical Semantic-Visual Fusion (HSVF) framework, comprising a semantic stream to reconstruct haze-free…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
