Structured Diffusion Bridges: Inductive Bias for Denoising Diffusion Bridges
Eitan Kosman, Gabriele Serussi, Chaim Baskin

TL;DR
This paper introduces a diffusion-bridge framework for modality translation that relaxes the need for fully paired datasets, achieving high-quality results even with limited or no pairing.
Contribution
It presents a novel diffusion-bridge approach that characterizes and restricts the solution space using alignment constraints, reducing reliance on paired data.
Findings
Achieves near fully-paired quality with less pairing requirement
Performs well across unpaired, semi-paired, and paired regimes
Highlights diffusion bridges as flexible for modality translation
Abstract
Modality translation is inherently under-constrained, as multiple cross-modal mappings may yield the same marginals. Recent work has shown that diffusion bridges are effective for this task. However, most existing approaches rely on fully paired datasets, thereby imposing a single data-driven constraint. We propose a diffusion-bridge framework that characterizes the space of admissible solutions and restricts it via alignment constraints, treating paired supervision as an optional heuristic rather than a prerequisite. We validate our method on synthetic and real modality translation benchmarks across unpaired, semi-paired, and paired regimes, showing consistent performance across supervision levels. Notably, \textbf{it achieves near fully-paired quality with a substantial relaxation in pairing requirements, and remaining applicable in the unpaired regime}. These results highlight…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
