Generative Prompt Internalization

Haebin Shin; Lei Ji; Yeyun Gong; Sungdong Kim; Eunbi Choi; Minjoon Seo

arXiv:2411.15927·cs.CL·February 12, 2026

Generative Prompt Internalization

Haebin Shin, Lei Ji, Yeyun Gong, Sungdong Kim, Eunbi Choi, Minjoon Seo

PDF

Open Access 1 Repo 1 Video

TL;DR

Generative Prompt Internalization (GenPI) is a lightweight training method that internalizes complex prompts into language models, reducing computational overhead and enabling efficient inference without explicit prompts.

Contribution

The paper introduces GenPI, a novel joint training approach that internalizes prompts and their reasoning into models, along with a data synthesis technique for training without environment interactions.

Findings

01

Effectively internalizes complex prompts across various scenarios.

02

Reduces inference costs by eliminating explicit prompts.

03

Maintains high performance without prompt input.

Abstract

Prompts used in recent large language model based applications are often fixed and lengthy, leading to significant computational overhead. To address this challenge, we propose Generative Prompt Internalization (GenPI), a lightweight method that employs a joint training approach. GenPI not only replicates the behavior of models with prompt inputs but also generates the content of the prompt along with reasons for why the model's behavior should change accordingly. We demonstrate that our approach effectively internalizes complex prompts across various agent-based application scenarios. For effective training without interactions with the dedicated environments, we introduce a data synthesis technique that autonomously collects conversational datasets by swapping the roles of the agent and environment. This method is especially useful in scenarios where only a predefined prompt is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

kaistai/generative-context-distillation
noneOfficial

Videos

Generative Prompt Internalization· underline

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Machine Learning in Healthcare