Rethinking with Retrieval: Faithful Large Language Model Inference

Hangfeng He; Hongming Zhang; Dan Roth

arXiv:2301.00303·cs.CL·January 3, 2023·47 cites

Rethinking with Retrieval: Faithful Large Language Model Inference

Hangfeng He, Hongming Zhang, Dan Roth

PDF

Open Access 1 Repo

TL;DR

This paper introduces a lightweight, post-processing retrieval method called Rethinking with Retrieval (RR) that enhances large language models' reasoning and explanation faithfulness without additional training, using external knowledge based on chain-of-thought prompts.

Contribution

The paper proposes a novel retrieval-based post-processing approach for LLMs that improves reasoning accuracy and explanation faithfulness without requiring fine-tuning or extra training.

Findings

01

RR improves LLM performance on reasoning tasks

02

RR produces more faithful explanations

03

The method is effective across multiple reasoning domains

Abstract

Despite the success of large language models (LLMs) in various natural language processing (NLP) tasks, the stored knowledge in these models may inevitably be incomplete, out-of-date, or incorrect. This motivates the need to utilize external knowledge to assist LLMs. Unfortunately, current methods for incorporating external knowledge often require additional training or fine-tuning, which can be costly and may not be feasible for LLMs. To address this issue, we propose a novel post-processing approach, rethinking with retrieval (RR), which retrieves relevant external knowledge based on the decomposed reasoning steps obtained from the chain-of-thought (CoT) prompting. This lightweight approach does not require additional training or fine-tuning and is not limited by the input length of LLMs. We evaluate the effectiveness of RR through extensive experiments with GPT-3 on three complex…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hornhehhf/rr
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Explainable Artificial Intelligence (XAI)

MethodsMulti-Head Attention · Attention Is All You Need · Cosine Annealing · Linear Warmup With Cosine Annealing · Linear Layer · Refunds@Expedia|||How do I get a full refund from Expedia? · Layer Normalization · Softmax · {Dispute@FaQ-s}How to file a dispute with Expedia? · Adam