Psi-Sampler: Initial Particle Sampling for SMC-Based Inference-Time Reward Alignment in Score Models

Taehoon Yoon; Yunhong Min; Kyeongmin Yeo; Minhyuk Sung

arXiv:2506.01320·cs.LG·October 28, 2025

Psi-Sampler: Initial Particle Sampling for SMC-Based Inference-Time Reward Alignment in Score Models

Taehoon Yoon, Yunhong Min, Kyeongmin Yeo, Minhyuk Sung

PDF

Open Access

TL;DR

Psi-Sampler introduces a novel SMC-based framework with pCNL initialization for more effective inference-time reward alignment in score-based generative models, improving sampling efficiency and performance across various tasks.

Contribution

It proposes the pCNL algorithm for high-dimensional posterior sampling and demonstrates its effectiveness in reward alignment tasks, advancing score-based generative modeling.

Findings

01

Improved reward alignment performance in experiments

02

Efficient sampling in high-dimensional latent spaces

03

Consistent performance gains across multiple tasks

Abstract

We introduce $Ψ$ -Sampler, an SMC-based framework incorporating pCNL-based initial particle sampling for effective inference-time reward alignment with a score-based generative model. Inference-time reward alignment with score-based generative models has recently gained significant traction, following a broader paradigm shift from pre-training to post-training optimization. At the core of this trend is the application of Sequential Monte Carlo (SMC) to the denoising process. However, existing methods typically initialize particles from the Gaussian prior, which inadequately captures reward-relevant regions and results in reduced sampling efficiency. We demonstrate that initializing from the reward-aware posterior significantly improves alignment performance. To enable posterior sampling in high-dimensional latent spaces, we introduce the preconditioned Crank-Nicolson Langevin (pCNL)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Gaussian Processes and Bayesian Inference · Markov Chains and Monte Carlo Methods