SSDD-GAN: Single-Step Denoising Diffusion GAN for Cochlear Implant Surgical Scene Completion
Yike Zhang, Eduardo Davalos, Jack Noble

TL;DR
This paper introduces SSDD-GAN, a novel single-step denoising diffusion GAN that effectively completes surgical scenes in microscopy images, improving realism and structural similarity without requiring explicit labels.
Contribution
The paper presents a new diffusion GAN model trained via self-supervised learning for surgical scene completion, combining diffusion and adversarial methods for high-quality image restoration.
Findings
Achieved 6% improvement in Structural Similarity Index.
Enabled zero-shot application to synthetic datasets.
Produced realistic, complete surgical scenes without explicit ground-truth labels.
Abstract
Recent deep learning-based image completion methods, including both inpainting and outpainting, have demonstrated promising results in restoring corrupted images by effectively filling various missing regions. Among these, Generative Adversarial Networks (GANs) and Denoising Diffusion Probabilistic Models (DDPMs) have been employed as key generative image completion approaches, excelling in the field of generating high-quality restorations with reduced artifacts and improved fine details. In previous work, we developed a method aimed at synthesizing views from novel microscope positions for mastoidectomy surgeries; however, that approach did not have the ability to restore the surrounding surgical scene environment. In this paper, we propose an efficient method to complete the surgical scene of the synthetic postmastoidectomy dataset. Our approach leverages self-supervised learning on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Hearing Loss and Rehabilitation · Speech Recognition and Synthesis
