Diffusion Recommender Models and the Illusion of Progress: A Concerning Study of Reproducibility and a Conceptual Mismatch

Michael Benigni; Maurizio Ferrari Dacrema; Dietmar Jannach

arXiv:2505.09364·cs.IR·March 30, 2026

Diffusion Recommender Models and the Illusion of Progress: A Concerning Study of Reproducibility and a Conceptual Mismatch

Michael Benigni, Maurizio Ferrari Dacrema, Dietmar Jannach

PDF

TL;DR

This study critically examines recent diffusion recommender models, revealing reproducibility issues, weak baselines, and a conceptual mismatch with traditional recommendation tasks, questioning their claimed progress.

Contribution

It provides a reproducibility assessment of diffusion-based recommendation models, highlighting methodological flaws and mismatches with the recommendation task, and calls for improved scientific rigor.

Findings

01

Only 25% of results are fully reproducible.

02

Well-tuned simple baselines outperform diffusion models.

03

Identified key mismatches between diffusion models and recommendation tasks.

Abstract

Countless new machine learning models are published every year and are reported to significantly advance the state-of-the-art in top-n recommendation. However, earlier reproducibility studies indicate that progress in this area may be quite limited, due to widespread methodological issues, e.g., comparisons with untuned baseline models, creating an illusion of progress. In this work, we examine whether these problems persist in today's research by attempting to reproduce nine SIGIR 2023 and 2024 recommendation algorithms based on Denoising Diffusion Probabilistic Models, a recent but rapidly expanding research area. Only 25% of reported results are fully reproducible and, since the original papers relied on weak baselines, they do not establish the superiority of diffusion models over state-of-the-art methods. In our controlled evaluations, well-tuned simpler baselines consistently…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.