Generation-Augmented Generation: A Plug-and-Play Framework for Private Knowledge Injection in Large Language Models

Rongji Li; Jian Xu; Yi Chen; Xueqing Chen; Yisheng Yang; Jiayi Wang; Xingyu Chen; Chunyu Xie; Dawei Leng; Xu-Yao Zhang

arXiv:2601.08209·cs.CL·April 15, 2026

Generation-Augmented Generation: A Plug-and-Play Framework for Private Knowledge Injection in Large Language Models

Rongji Li, Jian Xu, Yi Chen, Xueqing Chen, Yisheng Yang, Jiayi Wang, Xingyu Chen, Chunyu Xie, Dawei Leng, Xu-Yao Zhang

PDF

1 Repo 1 Datasets

TL;DR

GAG is a novel framework that injects private domain knowledge into large language models by treating it as an auxiliary modality, improving specialist question-answering while maintaining general capabilities.

Contribution

It introduces Generation-Augmented Generation (GAG), a plug-and-play method that distills domain expertise into latent memories and integrates them into frozen models, outperforming existing baselines.

Findings

01

GAG outperforms retrieval-based and fine-tuning baselines on private-domain QA tasks.

02

GAG maintains general-domain performance while enhancing private knowledge injection.

03

Code and datasets are publicly available at https://github.com/360CVGroup/GAG.

Abstract

In domains such as materials science, biomedicine, and finance, high-stakes deployment of large language models (LLMs) requires injecting private, domain-specific knowledge that is proprietary, fast-evolving, and under-represented in public pretraining. However, the two dominant paradigms for private knowledge injection each have clear drawbacks: fine-tuning is expensive to iterate under continual updates that can induce catastrophic forgetting and general-capability regression; retrieval-augmented generation (RAG) keeps the base model intact but remains brittle in specialized private corpora due to chunk-induced evidence fragmentation, retrieval mismatch, and long-context pressure. Inspired by how multimodal LLMs align heterogeneous modalities into a shared semantic space, we propose Generation-Augmented Generation (GAG), which treats private expertise as an auxiliary modality and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

360CVGroup/GAG
github

Datasets

rongjili/GAG
dataset· 59 dl
59 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.