Supportiveness-based Knowledge Rewriting for Retrieval-augmented Language Modeling
Zile Qiao, Wei Ye, Yong Jiang, Tong Mo, Pengjun Xie, Weiping Li, Fei, Huang, Shikun Zhang

TL;DR
This paper proposes Supportiveness-based Knowledge Rewriting (SKR), a method to improve retrieval-augmented language models by filtering and optimizing external knowledge based on how effectively it supports downstream tasks.
Contribution
The paper introduces a novel supportiveness metric and a training strategy for knowledge rewriting, enhancing the reliability and usefulness of retrieved knowledge for LLMs.
Findings
SKR outperforms GPT-4 in knowledge rewriting with only 7B parameters.
SKR improves performance across six knowledge-intensive tasks.
The method effectively filters irrelevant knowledge and aligns rewrites with task supportiveness.
Abstract
Retrieval-augmented language models (RALMs) have recently shown great potential in mitigating the limitations of implicit knowledge in LLMs, such as untimely updating of the latest expertise and unreliable retention of long-tail knowledge. However, since the external knowledge base, as well as the retriever, can not guarantee reliability, potentially leading to the knowledge retrieved not being helpful or even misleading for LLM generation. In this paper, we introduce Supportiveness-based Knowledge Rewriting (SKR), a robust and pluggable knowledge rewriter inherently optimized for LLM generation. Specifically, we introduce the novel concept of "supportiveness"--which represents how effectively a knowledge piece facilitates downstream tasks--by considering the perplexity impact of augmented knowledge on the response text of a white-box LLM. Based on knowledge supportiveness, we first…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Semantic Web and Ontologies
MethodsAttention Is All You Need · Residual Connection · Softmax · ALIGN · Layer Normalization · Byte Pair Encoding · Label Smoothing · Adam · Linear Layer · Multi-Head Attention
