The Web Can Be Your Oyster for Improving Large Language Models

Junyi Li; Tianyi Tang; Wayne Xin Zhao; Jingyuan Wang; Jian-Yun Nie and; Ji-Rong Wen

arXiv:2305.10998·cs.CL·May 25, 2023·5 cites

The Web Can Be Your Oyster for Improving Large Language Models

Junyi Li, Tianyi Tang, Wayne Xin Zhao, Jingyuan Wang, Jian-Yun Nie and, Ji-Rong Wen

PDF

Open Access 1 Repo

TL;DR

This paper introduces UNIWEB, a web-augmented large language model that uses adaptive web retrieval and continual knowledge learning to enhance performance on knowledge-intensive tasks, outperforming previous methods.

Contribution

The paper presents a novel web-augmented LLM with adaptive retrieval and a new pretraining task, improving knowledge integration and task performance.

Findings

01

Significant performance improvements over previous retrieval-augmented models.

02

Effective use of adaptive search to reduce noise and irrelevant web data.

03

Successful application across 16 knowledge-intensive tasks.

Abstract

Large language models (LLMs) encode a large amount of world knowledge. However, as such knowledge is frozen at the time of model training, the models become static and limited by the training data at that time. In order to further improve the capacity of LLMs for knowledge-intensive tasks, we consider augmenting LLMs with the large-scale web using search engine. Unlike previous augmentation sources (e.g., Wikipedia data dump), the web provides broader, more comprehensive and constantly updated information. In this paper, we present a web-augmented LLM UNIWEB, which is trained over 16 knowledge-intensive tasks in a unified text-to-text format. Instead of simply using the retrieved contents from web, our approach has made two major improvements. Firstly, we propose an adaptive search engine assisted learning method that can self-evaluate the confidence level of LLM's predictions, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

rucaibox/uniweb
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification