Rethinking with Retrieval: Faithful Large Language Model Inference
Hangfeng He, Hongming Zhang, Dan Roth

TL;DR
This paper introduces a lightweight, post-processing retrieval method called Rethinking with Retrieval (RR) that enhances large language models' reasoning and explanation faithfulness without additional training, using external knowledge based on chain-of-thought prompts.
Contribution
The paper proposes a novel retrieval-based post-processing approach for LLMs that improves reasoning accuracy and explanation faithfulness without requiring fine-tuning or extra training.
Findings
RR improves LLM performance on reasoning tasks
RR produces more faithful explanations
The method is effective across multiple reasoning domains
Abstract
Despite the success of large language models (LLMs) in various natural language processing (NLP) tasks, the stored knowledge in these models may inevitably be incomplete, out-of-date, or incorrect. This motivates the need to utilize external knowledge to assist LLMs. Unfortunately, current methods for incorporating external knowledge often require additional training or fine-tuning, which can be costly and may not be feasible for LLMs. To address this issue, we propose a novel post-processing approach, rethinking with retrieval (RR), which retrieves relevant external knowledge based on the decomposed reasoning steps obtained from the chain-of-thought (CoT) prompting. This lightweight approach does not require additional training or fine-tuning and is not limited by the input length of LLMs. We evaluate the effectiveness of RR through extensive experiments with GPT-3 on three complex…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Explainable Artificial Intelligence (XAI)
MethodsMulti-Head Attention · Attention Is All You Need · Cosine Annealing · Linear Warmup With Cosine Annealing · Linear Layer · Refunds@Expedia|||How do I get a full refund from Expedia? · Layer Normalization · Softmax · {Dispute@FaQ-s}How to file a dispute with Expedia? · Adam
