PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning
Jingcheng Hu, Yinmin Zhang, Shijie Shang, Xiaobo Yang, Yue Peng, Zhewei Huang, Hebin Zhou, Xin Wu, Jie Cheng, Fanqi Wan, Xiangwen Kong, Chengyuan Yao, Kaiwen Yan, Ailin Huang, Hongyu Zhou, Qi Han, Zheng Ge, Daxin Jiang, Xiangyu Zhang, Heung-Yeung Shum

TL;DR
PaCoRe introduces a parallel reasoning framework that significantly enhances test-time compute scalability in language models, enabling multi-million-token reasoning and surpassing existing systems in mathematics benchmarks.
Contribution
The paper presents a novel parallel coordinated reasoning framework that scales test-time compute beyond traditional sequential methods using message-passing and reinforcement learning.
Findings
Achieves 94.5% on HMMT 2025, surpassing GPT-5.
Scales effective TTC to roughly two million tokens.
Demonstrates strong improvements across diverse domains.
Abstract
We introduce Parallel Coordinated Reasoning (PaCoRe), a training-and-inference framework designed to overcome a central limitation of contemporary language models: their inability to scale test-time compute (TTC) far beyond sequential reasoning under a fixed context window. PaCoRe departs from the traditional sequential paradigm by driving TTC through massive parallel exploration coordinated via a message-passing architecture in multiple rounds. Each round launches many parallel reasoning trajectories, compacts their findings into context-bounded messages, and synthesizes these messages to guide the next round and ultimately produce the final answer. Trained end-to-end with large-scale, outcome-based reinforcement learning, the model masters the synthesis abilities required by PaCoRe and scales to multi-million-token effective TTC without exceeding context limits. The approach yields…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗stepfun-ai/Step-3.5-Flashmodel· 93k dl· ♡ 75493k dl♡ 754
- 🤗stepfun-ai/Step-3.5-Flash-Base-Midtrainmodel· 242 dl· ♡ 39242 dl♡ 39
- 🤗stepfun-ai/Step-3.5-Flash-Basemodel· 802 dl· ♡ 81802 dl♡ 81
- 🤗stepfun-ai/Step-3.5-Flash-FP8model· 298k dl· ♡ 51298k dl♡ 51
- 🤗stepfun-ai/PaCoRe-8Bmodel· 507 dl· ♡ 46507 dl♡ 46
- 🤗stepfun-ai/RLVR-8B-0926model· 840 dl· ♡ 8840 dl♡ 8
- 🤗stepfun-ai/Step-3.5-Flash-GGUF-Q4_K_Smodel· 15k dl· ♡ 14015k dl♡ 140
- 🤗void-818/Affine-luca_v9-5CtFSMCbvHryns4E7YrACNDyFYAcxGU9SkokGPHiJuvPNUcimodel· 25 dl25 dl
- 🤗milkowski/Step-3.5-Flash-GGUFmodel· 6 dl· ♡ 26 dl♡ 2
- 🤗stepfun-ai/Step-3.5-Flash-GGUF-Q8_0model· 232 dl· ♡ 3232 dl♡ 3
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Multimodal Machine Learning Applications · Natural Language Processing Techniques
