Zero-Indexing Internet Search Augmented Generation for Large Language Models
Guangxin He, Zonghong Dai, Jiangcheng Zhu, Binqiang Zhao, Qicheng Hu,, Chenyue Li, You Peng, Chen Wang, Binhang Yuan

TL;DR
This paper introduces a dynamic, search engine API-based augmentation method for large language models, enabling real-time access to the latest online information to enhance content quality without static indexing.
Contribution
It proposes a novel LLM-based framework that dynamically retrieves and extracts online information, bypassing traditional static corpus indexing for improved generation quality.
Findings
Significantly improved content quality in generated outputs.
Successful deployment in a production environment for real-time inference.
Effective re-ranking and extraction strategies for fresh online data.
Abstract
Retrieval augmented generation has emerged as an effective method to enhance large language model performance. This approach typically relies on an internal retrieval module that uses various indexing mechanisms to manage a static pre-processed corpus. However, such a paradigm often falls short when it is necessary to integrate the most up-to-date information that has not been updated into the corpus during generative inference time. In this paper, we explore an alternative approach that leverages standard search engine APIs to dynamically integrate the latest online information (without maintaining any index for any fixed corpus), thereby improving the quality of generated content. We design a collaborative LLM-based paradigm, where we include: (i) a parser-LLM that determines if the Internet augmented generation is demanded and extracts the search keywords if so with a single…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Topic Modeling
