FTPFusion: Frequency-Aware Infrared and Visible Video Fusion with Temporal Perturbation

Xilai Li; Chusheng Fang; Xiaosong Li

arXiv:2604.01900·cs.CV·April 3, 2026

FTPFusion: Frequency-Aware Infrared and Visible Video Fusion with Temporal Perturbation

Xilai Li, Chusheng Fang, Xiaosong Li

PDF

1 Repo

TL;DR

FTPFusion is a novel frequency-aware video fusion method that enhances spatial details and temporal stability in infrared and visible videos by decomposing features and applying perturbation strategies.

Contribution

It introduces a frequency-aware framework with temporal perturbation and cross-modal interaction to improve video fusion robustness and quality.

Findings

01

Outperforms state-of-the-art methods on multiple benchmarks.

02

Improves spatial fidelity and temporal consistency.

03

Effectively handles flickering, jitter, and misalignment.

Abstract

Infrared and visible video fusion plays a critical role in intelligent surveillance and low-light monitoring. However, maintaining temporal stability while preserving spatial detail remains a fundamental challenge. Existing methods either focus on frame-wise enhancement with limited temporal modeling or rely on heavy spatio-temporal aggregation that often sacrifices high-frequency details. In this paper, we propose FTPFusion, a frequency-aware infrared and visible video fusion method based on temporal perturbation and sparse cross-modal interaction. Specifically, FTPFusion decomposes the feature representations into high-frequency and low-frequency components for collaborative modeling. The high-frequency branch performs sparse cross-modal spatio-temporal interaction to capture motion-related context and complementary details. The low-frequency branch introduces a temporal perturbation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ixilai/FTPFusion
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.