TL;DR
SynerMedGen introduces a unified medical model that aligns understanding and generation tasks, achieving strong zero-shot performance and outperforming specialized models in medical image synthesis.
Contribution
It proposes generation-aligned understanding and a two-stage training strategy, enhancing medical image synthesis and generalization in a unified framework.
Findings
Achieves strong zero-shot performance across 22 medical image synthesis tasks.
Outperforms state-of-the-art specialized and unified medical models.
Demonstrates robust generalization to unseen datasets.
Abstract
Unifying multimodal understanding and generation is a compelling frontier that is beginning to emerge in the medical field. However, the limited existing unified medical models typically treat understanding and generation as disjoint objectives, lacking a meaningful functional synergy. In this work, we identify and address a critical question in unified medical modeling: what form of understanding truly benefits generation. We present SynerMedGen, a unified framework built on the proposed principle of generation-aligned understanding, which synergizes understanding objectives with generation tasks via task alignment. SynerMedGen introduces three generation-aligned understanding tasks and a two-stage training strategy that transfers generation-beneficial representations learned during understanding training to medical image synthesis. Remarkably, even with understanding training alone,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
