Zero-Shot Verification-guided Chain of Thoughts

Jishnu Ray Chowdhury; Cornelia Caragea

arXiv:2501.13122·cs.CL·January 24, 2025

Zero-Shot Verification-guided Chain of Thoughts

Jishnu Ray Chowdhury, Cornelia Caragea

PDF

Open Access

TL;DR

This paper introduces a zero-shot approach for LLM-based self-verification of reasoning steps in Chain-of-Thought prompting, eliminating the need for fine-tuning or manual examples, and evaluates its effectiveness across reasoning tasks.

Contribution

The paper proposes new zero-shot prompts for reasoning decomposition and verification, enabling LLMs to self-assess reasoning correctness without prior training or handcrafted examples.

Findings

01

Zero-shot verifiers can classify reasoning correctness effectively.

02

Verifier scores can guide reasoning processes.

03

Method improves reasoning accuracy in mathematical and commonsense tasks.

Abstract

Previous works have demonstrated the effectiveness of Chain-of-Thought (COT) prompts and verifiers in guiding Large Language Models (LLMs) through the space of reasoning. However, most such studies either use a fine-tuned verifier or rely on manually handcrafted few-shot examples. In contrast, in this paper, we focus on LLM-based self-verification of self-generated reasoning steps via COT prompts in a completely zero-shot regime. To explore this setting, we design a new zero-shot prompt, which we call COT STEP, to aid zero-shot decomposition of reasoning steps and design two new zero-shot prompts for LLM-based verifiers. We evaluate the verifiers' ability to classify the correctness of reasoning chains and explore different ways to use verifier scores in guiding reasoning for various mathematical and commonsense reasoning tasks with different LLMs.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMedical Imaging Techniques and Applications · Advanced Radiotherapy Techniques

MethodsFocus