Repulsor: Accelerating Generative Modeling with a Contrastive Memory Bank
Shaofeng Zhang, Xuanqi Chen, Ning Liao, Haoxiang Zhao, Xiaoxing Wang, Haoru Tan, Sitong Wu, Xiaosong Jia, Qi Fan, Junchi Yan

TL;DR
Repulsor introduces a contrastive memory bank framework for generative models that improves training efficiency and quality without external encoders, achieving state-of-the-art results on ImageNet-256.
Contribution
It presents a self-contained, memory bank-based contrastive learning method that accelerates generative modeling without additional inference costs or external models.
Findings
Achieves a state-of-the-art FID of 2.40 on ImageNet-256 within 400k steps.
Enables faster convergence and higher quality in generative models.
Removes dependency on pre-trained encoders, reducing overhead.
Abstract
The dominance of denoising generative models (e.g., diffusion, flow-matching) in visual synthesis is tempered by their substantial training costs and inefficiencies in representation learning. While injecting discriminative representations via auxiliary alignment has proven effective, this approach still faces key limitations: the reliance on external, pre-trained encoders introduces overhead and domain shift. A dispersed-based strategy that encourages strong separation among in-batch latent representations alleviates this specific dependency. To assess the effect of the number of negative samples in generative modeling, we propose {\mname}, a plug-and-play training framework that requires no external encoders. Our method integrates a memory bank mechanism that maintains a large, dynamically updated queue of negative samples across training iterations. This decouples the number of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Face recognition and analysis · Domain Adaptation and Few-Shot Learning
