ToM: Leveraging Tree-oriented MapReduce for Long-Context Reasoning in Large Language Models

Jiani Guo; Zuchao Li; Jie Wu; Qianren Wang; Yun Li; Lefei Zhang; Hai Zhao; Yujiu Yang

arXiv:2511.00489·cs.CL·November 4, 2025

ToM: Leveraging Tree-oriented MapReduce for Long-Context Reasoning in Large Language Models

Jiani Guo, Zuchao Li, Jie Wu, Qianren Wang, Yun Li, Lefei Zhang, Hai Zhao, Yujiu Yang

PDF

Open Access 1 Video

TL;DR

ToM introduces a hierarchical Tree-oriented MapReduce framework that enhances long-context reasoning in large language models by leveraging document structure for recursive, coherent reasoning over extended texts.

Contribution

It presents a novel hierarchical framework that improves long-context reasoning by constructing a DocTree and performing recursive reasoning with MapReduce, outperforming existing methods.

Findings

01

Significantly outperforms existing frameworks in long-context reasoning.

02

Achieves better logical coherence in reasoning tasks.

03

Effective on 70B+ parameter language models.

Abstract

Large Language Models (LLMs), constrained by limited context windows, often face significant performance degradation when reasoning over long contexts. To address this, Retrieval-Augmented Generation (RAG) retrieves and reasons over chunks but frequently sacrifices logical coherence due to its reliance on similarity-based rankings. Similarly, divide-and-conquer frameworks (DCF) split documents into small chunks for independent reasoning and aggregation. While effective for local reasoning, DCF struggles to capture long-range dependencies and risks inducing conflicts by processing chunks in isolation. To overcome these limitations, we propose ToM, a novel Tree-oriented MapReduce framework for long-context reasoning. ToM leverages the inherent hierarchical structure of long documents (e.g., main headings and subheadings) by constructing a DocTree through hierarchical semantic parsing and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

ToM: Leveraging Tree-oriented MapReduce for Long-Context Reasoning in Large Language Models· underline

Taxonomy

TopicsTopic Modeling · Advanced Graph Neural Networks · Explainable Artificial Intelligence (XAI)