Score Distillation via Reparametrized DDIM
Artem Lukoianov, Haitz S\'aez de Oc\'ariz Borde, Kristjan Greenewald,, Vitor Campagnolo Guizilini, Timur Bagautdinov, Vincent Sitzmann, Justin, Solomon

TL;DR
This paper improves 3D shape generation from 2D diffusion models by reparametrizing SDS with DDIM, reducing over-smoothing and enhancing detail without extra training or supervision.
Contribution
It introduces a novel reparametrization of SDS using DDIM, leading to higher-quality 3D outputs and better understanding of 2D-3D diffusion model relationships.
Findings
Enhanced 3D generation quality close to 2D diffusion models
Reduced over-smoothing and preserved high-frequency details
Achieved comparable or better results than state-of-the-art methods
Abstract
While 2D diffusion models generate realistic, high-detail images, 3D shape generation methods like Score Distillation Sampling (SDS) built on these 2D diffusion models produce cartoon-like, over-smoothed shapes. To help explain this discrepancy, we show that the image guidance used in Score Distillation can be understood as the velocity field of a 2D denoising generative process, up to the choice of a noise term. In particular, after a change of variables, SDS resembles a high-variance version of Denoising Diffusion Implicit Models (DDIM) with a differently-sampled noise term: SDS introduces noise i.i.d. randomly at each step, while DDIM infers it from the previous noise predictions. This excessive variance can lead to over-smoothing and unrealistic outputs. We show that a better noise approximation can be recovered by inverting DDIM in each SDS update step. This modification makes…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsTime Series Analysis and Forecasting · Anomaly Detection Techniques and Applications · Neural Networks and Applications
MethodsDiffusion
