SANA I2I: A Text Free Flow Matching Framework for Paired Image to Image Translation with a Case Study in Fetal MRI Artifact Reduction

Italo Felix Santos; Gilson Antonio Giraldi; Heron Werner Junior

arXiv:2604.00298·cs.CV·April 2, 2026

SANA I2I: A Text Free Flow Matching Framework for Paired Image to Image Translation with a Case Study in Fetal MRI Artifact Reduction

Italo Felix Santos, Gilson Antonio Giraldi, Heron Werner Junior

PDF

TL;DR

SANA-I2I is a novel text-free, flow-based image-to-image translation framework that effectively reduces fetal MRI motion artifacts using paired data, without relying on textual prompts.

Contribution

It introduces a purely image-based conditional flow-matching model for high-resolution medical image translation, removing the need for textual conditioning.

Findings

01

Effectively suppresses fetal MRI motion artifacts

02

Preserves anatomical structures during translation

03

Achieves competitive performance with few inference steps

Abstract

We propose SANA-I2I, a text-free high-resolution image-to-image generation framework that extends the SANA family by removing textual conditioning entirely. In contrast to SanaControlNet, which combines text and image-based control, SANA-I2I relies exclusively on paired source-target images to learn a conditional flow-matching model in latent space. The model learns a conditional velocity field that maps a target image distribution to another one, enabling supervised image translation without reliance on language prompts. We evaluate the proposed approach on the challenging task of fetal MRI motion artifact reduction. To enable paired training in this application, where real paired data are difficult to acquire, we adopt a synthetic data generation strategy based on the method proposed by Duffy et al., which simulates realistic motion artifacts in fetal magnetic resonance imaging (MRI).…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.