MemGPT: Towards LLMs as Operating Systems
Charles Packer, Sarah Wooders, Kevin Lin, Vivian Fang, Shishir G., Patil, Ion Stoica, Joseph E. Gonzalez

TL;DR
MemGPT introduces a hierarchical memory management system inspired by operating systems to extend the effective context window of large language models, enabling better document analysis and multi-session chat capabilities.
Contribution
The paper presents MemGPT, a novel system that manages multiple memory tiers to extend LLM context windows using OS-inspired techniques, allowing for long-term interactions and large document processing.
Findings
MemGPT successfully analyzes documents larger than the LLM's context window.
It enables multi-session chat with long-term memory and dynamic reflection.
The system improves performance in tasks constrained by limited context size.
Abstract
Large language models (LLMs) have revolutionized AI, but are constrained by limited context windows, hindering their utility in tasks like extended conversations and document analysis. To enable using context beyond limited context windows, we propose virtual context management, a technique drawing inspiration from hierarchical memory systems in traditional operating systems that provide the appearance of large memory resources through data movement between fast and slow memory. Using this technique, we introduce MemGPT (Memory-GPT), a system that intelligently manages different memory tiers in order to effectively provide extended context within the LLM's limited context window, and utilizes interrupts to manage control flow between itself and the user. We evaluate our OS-inspired design in two domains where the limited context windows of modern LLMs severely handicaps their…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Context-Aware Activity Recognition Systems · Scientific Computing and Data Management
