SimGen: A Diffusion-Based Framework for Simultaneous Surgical Image and Segmentation Mask Generation
Aditya Bhat, Rupak Bose, Chinedu Innocent Nwoye, Nicolas Padoy

TL;DR
SimGen introduces a diffusion-based framework that simultaneously generates high-quality surgical images and their segmentation masks, reducing reliance on manual annotations for surgical AI applications.
Contribution
The paper presents a novel diffusion model, SimGen, capable of jointly generating surgical images and masks, incorporating cross-correlation priors and a CFL for improved quality and class separation.
Findings
Outperforms baselines in image and mask quality metrics
CFL enhances mask class separability and spatial uniformity
Generated data is suitable for downstream surgical AI tasks
Abstract
Acquiring and annotating surgical data is often resource-intensive, ethical constraining, and requiring significant expert involvement. While generative AI models like text-to-image can alleviate data scarcity, incorporating spatial annotations, such as segmentation masks, is crucial for precision-driven surgical applications, simulation, and education. This study introduces both a novel task and method, SimGen, for Simultaneous Image and Mask Generation. SimGen is a diffusion model based on the DDPM framework and Residual U-Net, designed to jointly generate high-fidelity surgical images and their corresponding segmentation masks. The model leverages cross-correlation priors to capture dependencies between continuous image and discrete mask distributions. Additionally, a Canonical Fibonacci Lattice (CFL) is employed to enhance class separability and uniformity in the RGB space of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSurgical Simulation and Training · Medical Image Segmentation Techniques · Medical Imaging and Analysis
MethodsMax Pooling · Convolution · *Communicated@Fast*How Do I Communicate to Expedia? · Concatenated Skip Connection · U-Net · Diffusion
