SSDD-GAN: Single-Step Denoising Diffusion GAN for Cochlear Implant   Surgical Scene Completion

Yike Zhang; Eduardo Davalos; Jack Noble

arXiv:2502.05710·cs.CV·February 11, 2025

SSDD-GAN: Single-Step Denoising Diffusion GAN for Cochlear Implant Surgical Scene Completion

Yike Zhang, Eduardo Davalos, Jack Noble

PDF

Open Access

TL;DR

This paper introduces SSDD-GAN, a novel single-step denoising diffusion GAN that effectively completes surgical scenes in microscopy images, improving realism and structural similarity without requiring explicit labels.

Contribution

The paper presents a new diffusion GAN model trained via self-supervised learning for surgical scene completion, combining diffusion and adversarial methods for high-quality image restoration.

Findings

01

Achieved 6% improvement in Structural Similarity Index.

02

Enabled zero-shot application to synthetic datasets.

03

Produced realistic, complete surgical scenes without explicit ground-truth labels.

Abstract

Recent deep learning-based image completion methods, including both inpainting and outpainting, have demonstrated promising results in restoring corrupted images by effectively filling various missing regions. Among these, Generative Adversarial Networks (GANs) and Denoising Diffusion Probabilistic Models (DDPMs) have been employed as key generative image completion approaches, excelling in the field of generating high-quality restorations with reduced artifacts and improved fine details. In previous work, we developed a method aimed at synthesizing views from novel microscope positions for mastoidectomy surgeries; however, that approach did not have the ability to restore the surrounding surgical scene environment. In this paper, we propose an efficient method to complete the surgical scene of the synthetic postmastoidectomy dataset. Our approach leverages self-supervised learning on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Hearing Loss and Rehabilitation · Speech Recognition and Synthesis