Prism: Efficient Test-Time Scaling via Hierarchical Search and Self-Verification for Discrete Diffusion Language Models

Jinbin Bai; Yixuan Li; Yuchen Zhu; Yi Xin; Qingyu Shi; Aosong Feng; Xiaohong Liu; Molei Tao; Jianru Xue; Xiangtai Li; Ming-Hsuan Yang

arXiv:2602.01842·cs.LG·May 6, 2026

Prism: Efficient Test-Time Scaling via Hierarchical Search and Self-Verification for Discrete Diffusion Language Models

Jinbin Bai, Yixuan Li, Yuchen Zhu, Yi Xin, Qingyu Shi, Aosong Feng, Xiaohong Liu, Molei Tao, Jianru Xue, Xiangtai Li, Ming-Hsuan Yang

PDF

1 Repo

TL;DR

Prism introduces a hierarchical search and self-verification framework to improve test-time scaling for discrete diffusion language models, enhancing efficiency and performance in reasoning and code generation tasks.

Contribution

It proposes a novel test-time scaling method combining hierarchical trajectory search, local remasking, and self-verification, tailored for discrete diffusion language models.

Findings

01

Achieves comparable performance to best-of-N methods with fewer function evaluations.

02

Demonstrates effectiveness across multiple benchmarks and models.

03

Reduces computational cost while maintaining high-quality outputs.

Abstract

Inference-time compute has re-emerged as a practical way to improve LLM reasoning. Most test-time scaling (TTS) algorithms rely on autoregressive decoding, which is ill-suited to discrete diffusion language models (dLLMs) due to their parallel decoding over the entire sequence. As a result, developing effective and efficient TTS methods to unlock dLLMs' full generative potential remains an underexplored challenge. To address this, we propose Prism (Pruning, Remasking, and Integrated Self-verification Method), an efficient TTS framework for dLLMs that (i) performs Hierarchical Trajectory Search (HTS) which dynamically prunes and reallocates compute in an early-to-mid denoising window, (ii) introduces Local branching with partial remasking to explore diverse implementations while preserving high-confidence tokens, and (iii) replaces external verifiers with Self-Verified Feedback (SVF)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

viiika/Prism
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.