SimplyRetrieve: A Private and Lightweight Retrieval-Centric Generative AI Tool
Youyang Ng, Daisuke Miyashita, Yasuto Hoshi, Yasuhiro Morioka, Osamu, Torii, Tomoya Kodama, Jun Deguchi

TL;DR
SimplyRetrieve is an open-source, lightweight tool that enables private, retrieval-centric generative AI with a user-friendly interface, facilitating knowledge integration without model fine-tuning.
Contribution
It introduces a GUI and API platform for retrieval-centric generation, including private knowledge base construction and retrieval tuning, enhancing privacy and efficiency in generative AI.
Findings
Supports private data integration into LLMs without fine-tuning
Provides a user-friendly interface for retrieval-centric AI
Enables exploration of RCG benefits in generative tasks
Abstract
Large Language Model (LLM) based Generative AI systems have seen significant progress in recent years. Integrating a knowledge retrieval architecture allows for seamless integration of private data into publicly available Generative AI systems using pre-trained LLM without requiring additional model fine-tuning. Moreover, Retrieval-Centric Generation (RCG) approach, a promising future research direction that explicitly separates roles of LLMs and retrievers in context interpretation and knowledge memorization, potentially leads to more efficient implementation. SimplyRetrieve is an open-source tool with the goal of providing a localized, lightweight, and user-friendly interface to these sophisticated advancements to the machine learning community. SimplyRetrieve features a GUI and API based RCG platform, assisted by a Private Knowledge Base Constructor and a Retrieval Tuning Module. By…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Advanced Data Storage Technologies · Stochastic Gradient Optimization Techniques
MethodsBalanced Selection
