Skipping the Zeros in Diffusion Models for Sparse Data Generation
Phil Sidney Ostheimer, Mayank Nagda, Andriy Balinskyy, Gabriel Vicente Rodrigues, Jean Radig, Carl Herrmann, Stephan Mandt, Marius Kloft, Sophie Fellenz

TL;DR
SED is a novel diffusion model that efficiently handles sparse data by modeling only non-zero values, saving computation and improving quality across various benchmarks.
Contribution
Introducing Sparsity-Exploiting Diffusion (SED), a method that preserves sparsity and reduces computation in diffusion models for sparse data.
Findings
SED matches or surpasses traditional DMs on benchmarks.
SED reduces unnecessary computation on zero entries.
Vision experiments highlight benefits of sparse modeling.
Abstract
Diffusion models (DMs) excel on dense continuous data, but are not designed for sparse continuous data. They do not model exact zeros that represent the deliberate absence of a signal. As a result, they erase sparsity patterns and perform unnecessary computation on mostly zero entries. With Sparsity-Exploiting Diffusion (SED), we model only non-zero values, preserving sparsity. SED delivers computational savings while maintaining or improving generation quality by skipping zeros during training and inference. Across physics and biology benchmarks, SED matches or surpasses conventional DMs and domain-specific baselines, while vision experiments provide intuitive insights into the limitations of dense DMs and the benefits of SED.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
