Faithful Chain-of-Thought Reasoning
Qing Lyu, Shreya Havaldar, Adam Stein, Li Zhang, Delip Rao, Eric Wong,, Marianna Apidianaki, Chris Callison-Burch

TL;DR
Faithful CoT enhances reasoning interpretability and accuracy by translating natural language queries into symbolic reasoning, then solving deterministically, outperforming standard CoT on multiple benchmarks and setting new state-of-the-art results.
Contribution
It introduces a two-stage Faithful CoT framework that ensures reasoning faithfulness and improves empirical performance across diverse reasoning tasks.
Findings
Outperforms standard CoT on 9 of 10 benchmarks
Achieves 6.3% accuracy gain on Math Word Problems
Sets new state-of-the-art few-shot performance with GPT-4 and Codex
Abstract
While Chain-of-Thought (CoT) prompting boosts Language Models' (LM) performance on a gamut of complex reasoning tasks, the generated reasoning chain does not necessarily reflect how the model arrives at the answer (aka. faithfulness). We propose Faithful CoT, a reasoning framework involving two stages: Translation (Natural Language query symbolic reasoning chain) and Problem Solving (reasoning chain answer), using an LM and a deterministic solver respectively. This guarantees that the reasoning chain provides a faithful explanation of the final answer. Aside from interpretability, Faithful CoT also improves empirical performance: it outperforms standard CoT on 9 of 10 benchmarks from 4 diverse domains, with a relative accuracy gain of 6.3% on Math Word Problems (MWP), 3.4% on Planning, 5.5% on Multi-hop Question Answering (QA), and 21.4% on Relational…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Graph Neural Networks
MethodsChain-of-thought prompting
