EarthBridge: A Solution for 4th Multi-modal Aerial View Image Challenge Translation Track

Zhenyuan Chen; Guanyuan Shen; Feng Zhang

arXiv:2603.06753·cs.CV·April 1, 2026

EarthBridge: A Solution for 4th Multi-modal Aerial View Image Challenge Translation Track

Zhenyuan Chen, Guanyuan Shen, Feng Zhang

PDF

1 Repo

TL;DR

EarthBridge introduces advanced diffusion and contrastive learning methods for high-fidelity cross-modal aerial image translation, achieving top performance in the MAVIC-T challenge.

Contribution

The paper presents EarthBridge, a novel framework combining diffusion models and contrastive learning for improved multi-modal aerial image translation.

Findings

01

Achieved second place in MAVIC-T leaderboard with a score of 0.38.

02

Demonstrated superior spatial detail and spectral accuracy across challenge tasks.

03

Utilized specialized training techniques like bridge scalings and booting noise.

Abstract

Cross-modal image-to-image translation among Electro-Optical (EO), Infrared (IR), and Synthetic Aperture Radar (SAR) sensors is essential for comprehensive multi-modal aerial-view analysis. However, translating between these modalities is notoriously difficult due to their distinct electromagnetic signatures and geometric characteristics. This paper presents \textbf{EarthBridge}, a high-fidelity translation framework developed for the 4th Multi-modal Aerial View Image Challenge -- Translation (MAVIC-T). We explore two distinct methodologies: \textbf{Diffusion Bridge Implicit Models (DBIM)}, which we generalize using non-Markovian bridge processes for high-quality deterministic sampling, and \textbf{Contrastive Unpaired Translation (CUT)}, which utilizes contrastive learning for structural consistency. Our EarthBridge framework employs a channel-concatenated UNet denoiser trained with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Bili-Sakura/EarthBridge-Preview
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.