LSTM-MAS: A Long Short-Term Memory Inspired Multi-Agent System for Long-Context Understanding

Yichen Jiang; Jiakang Yuan; Chongjun Tu; Peng Ye; Tao Chen

arXiv:2601.11913·cs.CL·April 22, 2026

LSTM-MAS: A Long Short-Term Memory Inspired Multi-Agent System for Long-Context Understanding

Yichen Jiang, Jiakang Yuan, Chongjun Tu, Peng Ye, Tao Chen

PDF

1 Datasets

TL;DR

LSTM-MAS is a multi-agent system inspired by LSTM architecture designed to improve long-context understanding in large language models by controlling information flow and reducing errors.

Contribution

The paper introduces a novel multi-agent framework that emulates LSTM's gating mechanisms to enhance long-context processing and mitigate error propagation in language models.

Findings

01

Achieves significant performance improvements over previous multi-agent methods.

02

Effectively reduces error accumulation and hallucinations in long-text comprehension.

03

Demonstrates strong results across multiple question-answering datasets.

Abstract

Effectively processing long contexts remains a fundamental yet unsolved challenge for large language models (LLMs). Existing single-LLM-based methods primarily reduce the context window or optimize the attention mechanism, but they often encounter additional computational costs or constrained expanded context length. While multi-agent-based frameworks can mitigate these limitations, they remain susceptible to the accumulation of errors and the propagation of hallucinations. In this work, we draw inspiration from the Long Short-Term Memory (LSTM) architecture to design a Multi-Agent System called LSTM-MAS, emulating LSTM's hierarchical information flow and gated memory mechanisms for long-context understanding. Specifically, LSTM-MAS organizes agents in a chained architecture, where each node comprises a worker agent for segment-level comprehension, a filter agent for redundancy…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

molmohsen/awesome-ai-agent-papers
dataset· 41 dl
41 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.