TL;DR
This paper introduces a structure-adaptive sparse diffusion framework for 3D medical image enhancement that accelerates training and preserves anatomical details across CT, PET, and MRI modalities.
Contribution
It proposes a novel sparse voxel-space diffusion method with structure-aware trajectory modulation for efficient and detailed 3D medical image enhancement.
Findings
Achieves up to 10x training acceleration.
Demonstrates state-of-the-art results on denoising and super-resolution.
Operates effectively across multiple medical imaging modalities.
Abstract
Three-dimensional (3D) medical image enhancement, including denoising and super-resolution, is critical for clinical diagnosis in CT, PET, and MRI. Although diffusion models have shown remarkable success in 2D medical imaging, scaling them to high-resolution 3D volumes remains computationally prohibitive due to lengthy diffusion trajectories over high-dimensional volumetric data. We observe that in conditional enhancement, strong anatomical priors in the degraded input render dense noise schedules largely redundant. Leveraging this insight, we propose a sparse voxel-space diffusion framework that trains and samples on a compact set of uniformly subsampled timesteps. The network predicts clean data directly on the data manifold, supervised in velocity space for stable gradient scaling. A lightweight Structure-aware Trajectory Modulation (STM) module recalibrates time embeddings at each…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
