PETS: A Principled Framework Towards Optimal Trajectory Allocation for Efficient Test-Time Self-Consistency

Zhangyi Liu; Huaizhi Qu; Xiaowei Yin; He Sun; Yanjun Han; Tianlong Chen; Zhun Deng

arXiv:2602.16745·cs.LG·February 20, 2026

PETS: A Principled Framework Towards Optimal Trajectory Allocation for Efficient Test-Time Self-Consistency

Zhangyi Liu, Huaizhi Qu, Xiaowei Yin, He Sun, Yanjun Han, Tianlong Chen, Zhun Deng

PDF

Open Access

TL;DR

PETS introduces a theoretically grounded framework for optimal trajectory allocation in test-time self-consistency, significantly reducing sampling costs while maintaining high performance in stochastic reasoning models.

Contribution

It proposes a novel optimization framework and algorithms for trajectory allocation, connecting to crowdsourcing theory and providing strong theoretical guarantees.

Findings

01

Achieves perfect self-consistency on GPQA.

02

Reduces sampling budget by up to 75%.

03

Outperforms uniform allocation consistently.

Abstract

Test-time scaling can improve model performance by aggregating stochastic reasoning trajectories. However, achieving sample-efficient test-time self-consistency under a limited budget remains an open challenge. We introduce PETS (Principled and Efficient Test-TimeSelf-Consistency), which initiates a principled study of trajectory allocation through an optimization framework. Central to our approach is the self-consistency rate, a new measure defined as agreement with the infinite-budget majority vote. This formulation makes sample-efficient test-time allocation theoretically grounded and amenable to rigorous analysis. We study both offline and online settings. In the offline regime, where all questions are known in advance, we connect trajectory allocation to crowdsourcing, a classic and well-developed area, by modeling reasoning traces as workers. This perspective allows us to leverage…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMobile Crowdsensing and Crowdsourcing · Topic Modeling · Expert finding and Q&A systems