The Compressor-Retriever Architecture for Language Model OS
Yuan Yang, Siheng Xiong, Ehsan Shareghi, Faramarz Fekri

TL;DR
This paper proposes a compressor-retriever architecture that enables large language models to manage long-term, stateful context across sessions, advancing towards an OS-like functionality for LLMs.
Contribution
It introduces a novel, model-agnostic, end-to-end differentiable architecture for lifelong context management in language models, addressing a key limitation of current session-based interactions.
Findings
Effective in in-context learning tasks
Enables lifelong context retention
Advances towards stateful LLM operating systems
Abstract
Recent advancements in large language models (LLMs) have significantly enhanced their capacity to aggregate and process information across multiple modalities, enabling them to perform a wide range of tasks such as multimodal data querying, tool usage, web interactions, and handling long documents. These capabilities pave the way for transforming LLMs from mere chatbots into general-purpose agents capable of interacting with the real world. This paper explores the concept of using a language model as the core component of an operating system (OS), effectively acting as a CPU that processes data stored in a context window, which functions as RAM. A key challenge in realizing such an LM OS is managing the life-long context and ensuring statefulness across sessions, a feature limited by the current session-based interaction paradigm due to context window size limit. To address this, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Model-Driven Software Engineering Techniques · Distributed and Parallel Computing Systems
MethodsBalanced Selection
