A Semantically Enhanced Generative Foundation Model Improves Pathological Image Synthesis
Xianchao Guan, Zhiyuan Fan, Yifeng Wang, Fuqiang Chen, Yanjiang Zhou, Zengyang Che, Hongxue Meng, Xin Li, Yaowei Wang, Hongpeng Wang, Min Zhang, Heng Tao Shen, Zheng Zhang, Yongbing Zhang

TL;DR
This paper introduces CRAFTS, a novel generative model for pathology image synthesis that ensures semantic accuracy and diversity, significantly aiding clinical AI development and diagnostic tasks.
Contribution
CRAFTS is the first pathology-specific text-to-image generative model with a novel alignment mechanism to reduce semantic drift, trained on 2.8 million image-caption pairs.
Findings
Generates diverse, high-quality pathological images across 30 cancer types.
Improves performance in classification, retrieval, and other clinical tasks.
Enables precise tissue architecture control using additional inputs.
Abstract
The development of clinical-grade artificial intelligence in pathology is limited by the scarcity of diverse, high-quality annotated datasets. Generative models offer a potential solution but suffer from semantic instability and morphological hallucinations that compromise diagnostic reliability. To address this challenge, we introduce a Correlation-Regulated Alignment Framework for Tissue Synthesis (CRAFTS), the first generative foundation model for pathology-specific text-to-image synthesis. By leveraging a dual-stage training strategy on approximately 2.8 million image-caption pairs, CRAFTS incorporates a novel alignment mechanism that suppresses semantic drift to ensure biological accuracy. This model generates diverse pathological images spanning 30 cancer types, with quality rigorously validated by objective metrics and pathologist evaluations. Furthermore, CRAFTS-augmented…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in cancer detection · Generative Adversarial Networks and Image Synthesis · Multimodal Machine Learning Applications
