InfMem: Learning System-2 Memory Control for Long-Context Agent

Xinyu Wang; Mingze Li; Peng Lu; Xiao-Wen Chang; Lifeng Shang; Jinping Li; Fei Mi; Prasanna Parthasarathi; Yufei Cui

arXiv:2602.02704·cs.CL·February 4, 2026

InfMem: Learning System-2 Memory Control for Long-Context Agent

Xinyu Wang, Mingze Li, Peng Lu, Xiao-Wen Chang, Lifeng Shang, Jinping Li, Fei Mi, Prasanna Parthasarathi, Yufei Cui

PDF

Open Access 1 Datasets

TL;DR

InfMem introduces a control-centric approach for long-context reasoning, actively managing memory through retrieval and compression, significantly improving accuracy and efficiency on ultra-long document QA tasks.

Contribution

The paper presents InfMem, a novel System-2 control-based agent with a PreThink-Retrieve-Write protocol for active memory management in long-document reasoning.

Findings

01

Outperforms MemAgent on ultra-long QA benchmarks.

02

Achieves +10.17 to +11.84 accuracy points over baselines.

03

Reduces inference time by up to 5.1 times with early stopping.

Abstract

Reasoning over ultra-long documents requires synthesizing sparse evidence scattered across distant segments under strict memory constraints. While streaming agents enable scalable processing, their passive memory update strategy often fails to preserve low-salience bridging evidence required for multi-hop reasoning. We propose InfMem, a control-centric agent that instantiates System-2-style control via a PreThink-Retrieve-Write protocol. InfMem actively monitors evidence sufficiency, performs targeted in-document retrieval, and applies evidence-aware joint compression to update a bounded memory. To ensure reliable control, we introduce a practical SFT-to-RL training recipe that aligns retrieval, writing, and stopping decisions with end-task correctness. On ultra-long QA benchmarks from 32k to 1M tokens, InfMem consistently outperforms MemAgent across backbones. Specifically, InfMem…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

ucmp137538/infmem_superlong
dataset· 84 dl
84 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Topic Modeling · Advanced Neural Network Applications