Faithful Chain-of-Thought Reasoning

Qing Lyu; Shreya Havaldar; Adam Stein; Li Zhang; Delip Rao; Eric Wong,; Marianna Apidianaki; Chris Callison-Burch

arXiv:2301.13379·cs.CL·September 22, 2023·23 cites

Faithful Chain-of-Thought Reasoning

Qing Lyu, Shreya Havaldar, Adam Stein, Li Zhang, Delip Rao, Eric Wong,, Marianna Apidianaki, Chris Callison-Burch

PDF

Open Access 1 Repo

TL;DR

Faithful CoT enhances reasoning interpretability and accuracy by translating natural language queries into symbolic reasoning, then solving deterministically, outperforming standard CoT on multiple benchmarks and setting new state-of-the-art results.

Contribution

It introduces a two-stage Faithful CoT framework that ensures reasoning faithfulness and improves empirical performance across diverse reasoning tasks.

Findings

01

Outperforms standard CoT on 9 of 10 benchmarks

02

Achieves 6.3% accuracy gain on Math Word Problems

03

Sets new state-of-the-art few-shot performance with GPT-4 and Codex

Abstract

While Chain-of-Thought (CoT) prompting boosts Language Models' (LM) performance on a gamut of complex reasoning tasks, the generated reasoning chain does not necessarily reflect how the model arrives at the answer (aka. faithfulness). We propose Faithful CoT, a reasoning framework involving two stages: Translation (Natural Language query $\to$ symbolic reasoning chain) and Problem Solving (reasoning chain $\to$ answer), using an LM and a deterministic solver respectively. This guarantees that the reasoning chain provides a faithful explanation of the final answer. Aside from interpretability, Faithful CoT also improves empirical performance: it outperforms standard CoT on 9 of 10 benchmarks from 4 diverse domains, with a relative accuracy gain of 6.3% on Math Word Problems (MWP), 3.4% on Planning, 5.5% on Multi-hop Question Answering (QA), and 21.4% on Relational…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

veronica320/faithful-cot
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Graph Neural Networks

MethodsChain-of-thought prompting