SynerMedGen: Synergizing Medical Multimodal Understanding with Generation via Task Alignment

Weiren Zhao; Yi Dong; Cheng Chen

arXiv:2605.08724·cs.CV·May 12, 2026

SynerMedGen: Synergizing Medical Multimodal Understanding with Generation via Task Alignment

Weiren Zhao, Yi Dong, Cheng Chen

PDF

1 Repo

TL;DR

SynerMedGen introduces a unified medical model that aligns understanding and generation tasks, achieving strong zero-shot performance and outperforming specialized models in medical image synthesis.

Contribution

It proposes generation-aligned understanding and a two-stage training strategy, enhancing medical image synthesis and generalization in a unified framework.

Findings

01

Achieves strong zero-shot performance across 22 medical image synthesis tasks.

02

Outperforms state-of-the-art specialized and unified medical models.

03

Demonstrates robust generalization to unseen datasets.

Abstract

Unifying multimodal understanding and generation is a compelling frontier that is beginning to emerge in the medical field. However, the limited existing unified medical models typically treat understanding and generation as disjoint objectives, lacking a meaningful functional synergy. In this work, we identify and address a critical question in unified medical modeling: what form of understanding truly benefits generation. We present SynerMedGen, a unified framework built on the proposed principle of generation-aligned understanding, which synergizes understanding objectives with generation tasks via task alignment. SynerMedGen introduces three generation-aligned understanding tasks and a two-stage training strategy that transfers generation-beneficial representations learned during understanding training to medical image synthesis. Remarkably, even with understanding training alone,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Mhilab/SynerMedGen
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.