DCR: Counterfactual Attractor Guidance for Rare Compositional Generation
Taewon Kang, Matthias Zwicker

TL;DR
DCR is a training-free method that improves rare compositional image generation by explicitly modeling and suppressing default completion bias during diffusion sampling.
Contribution
It introduces a novel counterfactual guidance framework that enhances rare composition fidelity without retraining or modifying diffusion model architectures.
Findings
DCR improves compositional fidelity on rare prompts.
DCR maintains high visual quality in generated images.
The framework reveals intrinsic model biases.
Abstract
Diffusion models generate realistic visual content, yet often fail to produce rare but plausible compositions. When prompted with combinations that are valid but underrepresented in training data, such as a snowy beach or a rainbow at night, the generation process frequently collapses toward more common alternatives. We identify this failure mode as default completion bias, where denoising trajectories are implicitly attracted toward high-frequency semantic configurations. Existing guidance mechanisms do not explicitly model this competing tendency and therefore struggle to prevent such collapse. We introduce Default Completion Repulsion (DCR), a training-free framework that explicitly models and suppresses default completion behavior. DCR constructs a counterfactual attractor by relaxing the rare compositional factor while preserving surrounding semantics, inducing an alternative…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
