Dyve: Thinking Fast and Slow for Dynamic Process Verification

Jianyuan Zhong; Zeju Li; Zhijian Xu; Xiangyu Wen; Qiang Xu

arXiv:2502.11157·cs.AI·February 18, 2025

Dyve: Thinking Fast and Slow for Dynamic Process Verification

Jianyuan Zhong, Zeju Li, Zhijian Xu, Xiangyu Wen, Qiang Xu

PDF

Open Access 1 Repo 1 Models 1 Datasets 1 Video

TL;DR

Dyve is a dynamic process verifier that combines fast and slow reasoning methods, using a novel supervision technique to improve large language model verification accuracy on complex tasks.

Contribution

It introduces a novel adaptive reasoning framework inspired by Kahneman's theory, with a step-wise consensus filtering method for high-quality supervision signals.

Findings

01

Outperforms existing process-based verifiers on ProcessBench and MATH datasets.

02

Significantly improves performance in Best-of-N settings.

03

Demonstrates effective integration of fast and slow reasoning in LLM verification.

Abstract

We present Dyve, a dynamic process verifier that enhances reasoning error detection in large language models by integrating fast and slow thinking, inspired by Kahneman's Systems Theory. Dyve adaptively applies immediate token-level confirmation System 1 for straightforward steps and comprehensive analysis System 2 for complex ones. Leveraging a novel step-wise consensus-filtered process supervision technique, combining Monte Carlo estimation with LLM based evaluation, Dyve curates high-quality supervision signals from noisy data. Experimental results on ProcessBench and the MATH dataset confirm that Dyve significantly outperforms existing process-based verifiers and boosts performance in Best-of-N settings.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

staymylove/Dyve
noneOfficial

Models

🤗
Jianyuan1/deepseek-r1-14b-cot-math-reasoning-full
model· 11 dl· ♡ 2
11 dl♡ 2

Datasets

Jianyuan1/cot-data
dataset· 105 dl
105 dl

Videos

Dyve: Thinking Fast and Slow for Dynamic Process Verification· underline

Taxonomy

TopicsBusiness Process Modeling and Analysis · Simulation Techniques and Applications · Manufacturing Process and Optimization