CC-Pan: Channel-wise Compression based Diffusion for Efficient Pan-Sharpening
Junjie Li, Congyang Ou, Haokui Zhang, Guoting Wei, Shengqin Jiang, Ying Li

TL;DR
CC-Pan introduces a cross-sensor latent diffusion framework with a band-wise VAE and a cross-band attention module, enabling efficient, high-precision pan-sharpening across different sensors with faster inference.
Contribution
It proposes a novel latent diffusion approach with a band-wise VAE and cross-band attention, supporting multiple sensors without retraining and improving efficiency and accuracy.
Findings
Outperforms state-of-the-art diffusion-based methods on GaoFen-2, QuickBird, and WorldView-3 datasets.
Achieves 2-3 times faster inference speed.
Demonstrates robust cross-sensor generalization without sensor-specific retraining.
Abstract
Recently, diffusion models have brought novel insights to pan-sharpening and notably boosted fusion precision. However, most existing models perform diffusion in the pixel space and train distinct models for different multispectral (MS) sensors, suffering from high inference latency and sensor-specific limitations. In this paper, we present CC-Pan, a cross-sensor latent diffusion framework for efficient pan-sharpening. Specifically, CC-Pan trains a band-wise single-channel variational autoencoder (VAE) to encode high-resolution multispectral (HRMS) images into compact latent representations, naturally supporting MS images with varying band counts across different sensors and establishing a basis for inference acceleration. Spectral physical properties, along with PAN and MS images, are then injected into the diffusion backbone through carefully designed unidirectional and bidirectional…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
