Cooperative Memory Paging with Keyword Bookmarks for Long-Horizon LLM Conversations

Ziyang Liu

arXiv:2604.12376·cs.CL·April 15, 2026

Cooperative Memory Paging with Keyword Bookmarks for Long-Horizon LLM Conversations

Ziyang Liu

PDF

TL;DR

This paper introduces cooperative paging with keyword bookmarks and a recall tool to enhance long-horizon LLM conversations, outperforming existing methods in answer quality on the LoCoMo benchmark.

Contribution

It proposes a novel cooperative paging method with keyword bookmarks and evaluates various design choices, significantly improving retrieval accuracy and conversation quality.

Findings

01

Coarse fixed-size pages achieve 96.7% coverage; content-aware topic shifts drop to 56.7%.

02

Eviction policy effectiveness varies with data; FIFO works best on synthetic data, LFU on LoCoMo.

03

Bookmark generation strategies improve performance; keyword specificity crucial for correct page selection.

Abstract

When LLM conversations grow beyond the context window, old content must be evicted -- but how does the model recover it when needed? We propose cooperative paging: evicted segments are replaced with minimal keyword bookmarks ([pN:keywords], ~8-24 tokens each), and the model is given a recall() tool to retrieve full content on demand. On the LoCoMo benchmark (10 real multi-session conversations, 300+ turns), cooperative paging achieves the highest answer quality among six methods -- outperforming truncation, BM25, word-overlap retrieval, a search-tool baseline, and full context -- on four models (GPT-4o-mini, DeepSeek-v3.2, Claude Haiku, GLM-5), confirmed by four independent LLM judges ( $p = 0.017$ , paired bootstrap). We then study the paging design space with a 5x4 ablation over boundary strategies and eviction policies (3,176 synthetic probes, 1,600 LoCoMo probes). Key findings: (1)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.