Diffusion Mental Averages

Phonphrm Thawatdamrongkit; Sukit Seripanitkarn; Supasorn Suwajanakorn

arXiv:2603.29239·cs.CV·April 1, 2026

Diffusion Mental Averages

Phonphrm Thawatdamrongkit, Sukit Seripanitkarn, Supasorn Suwajanakorn

PDF

TL;DR

This paper introduces Diffusion Mental Averages (DMA), a novel method for generating sharp, realistic concept averages within diffusion models by aligning denoising trajectories, extending to multimodal concepts with clustering and adaptation techniques.

Contribution

DMA is the first approach to produce consistent, realistic averages within diffusion models, capturing abstract concepts and serving as a visual summary and bias analysis tool.

Findings

01

DMA produces sharp, realistic concept averages.

02

The method extends to multimodal concepts using clustering and adaptation.

03

DMA offers insights into model biases and concept representations.

Abstract

Can a diffusion model produce its own "mental average" of a concept-one that is as sharp and realistic as a typical sample? We introduce Diffusion Mental Averages (DMA), a model-centric answer to this question. While prior methods aim to average image collections, they produce blurry results when applied to diffusion samples from the same prompt. These data-centric techniques operate outside the model, ignoring the generative process. In contrast, DMA averages within the diffusion model's semantic space, as discovered by recent studies. Since this space evolves across timesteps and lacks a direct decoder, we cast averaging as trajectory alignment: optimize multiple noise latents so their denoising trajectories progressively converge toward shared coarse-to-fine semantics, yielding a single sharp prototype. We extend our approach to multimodal concepts (e.g., dogs with many breeds) by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.