Repulsor: Accelerating Generative Modeling with a Contrastive Memory Bank

Shaofeng Zhang; Xuanqi Chen; Ning Liao; Haoxiang Zhao; Xiaoxing Wang; Haoru Tan; Sitong Wu; Xiaosong Jia; Qi Fan; Junchi Yan

arXiv:2512.08648·cs.CV·December 16, 2025

Repulsor: Accelerating Generative Modeling with a Contrastive Memory Bank

Shaofeng Zhang, Xuanqi Chen, Ning Liao, Haoxiang Zhao, Xiaoxing Wang, Haoru Tan, Sitong Wu, Xiaosong Jia, Qi Fan, Junchi Yan

PDF

Open Access

TL;DR

Repulsor introduces a contrastive memory bank framework for generative models that improves training efficiency and quality without external encoders, achieving state-of-the-art results on ImageNet-256.

Contribution

It presents a self-contained, memory bank-based contrastive learning method that accelerates generative modeling without additional inference costs or external models.

Findings

01

Achieves a state-of-the-art FID of 2.40 on ImageNet-256 within 400k steps.

02

Enables faster convergence and higher quality in generative models.

03

Removes dependency on pre-trained encoders, reducing overhead.

Abstract

The dominance of denoising generative models (e.g., diffusion, flow-matching) in visual synthesis is tempered by their substantial training costs and inefficiencies in representation learning. While injecting discriminative representations via auxiliary alignment has proven effective, this approach still faces key limitations: the reliance on external, pre-trained encoders introduces overhead and domain shift. A dispersed-based strategy that encourages strong separation among in-batch latent representations alleviates this specific dependency. To assess the effect of the number of negative samples in generative modeling, we propose {\mname}, a plug-and-play training framework that requires no external encoders. Our method integrates a memory bank mechanism that maintains a large, dynamically updated queue of negative samples across training iterations. This decouples the number of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Face recognition and analysis · Domain Adaptation and Few-Shot Learning