AWM-Fuse: Multi-Modality Image Fusion for Adverse Weather via Global and Local Text Perception

Xilai Li; Huichun Liu; Xiaosong Li; Tao Ye; Zhenyu Kuang; Huafeng Li

arXiv:2508.16881·cs.CV·August 26, 2025

AWM-Fuse: Multi-Modality Image Fusion for Adverse Weather via Global and Local Text Perception

Xilai Li, Huichun Liu, Xiaosong Li, Tao Ye, Zhenyu Kuang, Huafeng Li

PDF

TL;DR

AWM-Fuse introduces a multi-modality image fusion approach that utilizes global and local textual perception to enhance scene clarity and semantic understanding in adverse weather conditions, outperforming existing methods.

Contribution

The paper presents a novel fusion method integrating global and local text perception modules, leveraging BLIP and ChatGPT for improved semantic-aware image fusion under adverse weather.

Findings

01

Outperforms state-of-the-art in adverse weather scenarios

02

Enhances semantic perception through textual guidance

03

Improves downstream task performance

Abstract

Multi-modality image fusion (MMIF) in adverse weather aims to address the loss of visual information caused by weather-related degradations, providing clearer scene representations. Although less studies have attempted to incorporate textual information to improve semantic perception, they often lack effective categorization and thorough analysis of textual content. In response, we propose AWM-Fuse, a novel fusion method for adverse weather conditions, designed to handle multiple degradations through global and local text perception within a unified, shared weight architecture. In particular, a global feature perception module leverages BLIP-produced captions to extract overall scene features and identify primary degradation types, thus promoting generalization across various adverse weather conditions. Complementing this, the local module employs detailed scene descriptions produced by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.