Directional Attractors in LLM Reasoning: How Similarity Retrieval Steers Iterative Summarization Based Reasoning
Cagatay Tekin, Charbel Barakat, Luis Joseph Luna Limgenco

TL;DR
This paper introduces InftyThink with Cross-Chain Memory, a method that uses semantic retrieval of reasoning patterns to improve LLM reasoning accuracy, revealing both benefits and limitations of similarity-based memory in iterative reasoning.
Contribution
The paper proposes a novel semantic cache mechanism for iterative LLM reasoning, guiding inference through similarity retrieval and analyzing its effects on reasoning trajectories.
Findings
Semantic retrieval improves accuracy in structured domains.
Cache retrieval induces directional biases in reasoning trajectories.
Limitations include failure modes in heterogeneous domain tests.
Abstract
Iterative summarization based reasoning frameworks such as InftyThink enable long-horizon reasoning in large language models (LLMs) by controlling context growth, but they repeatedly regenerate similar reasoning strategies across tasks. We introduce InftyThink with Cross-Chain Memory, an extension that augments iterative reasoning with an embedding-based semantic cache of previously successful reasoning patterns. At each reasoning step, the model retrieves and conditions on the most semantically similar stored lemmas, guiding inference without expanding the context window indiscriminately. Experiments on MATH500, AIME2024, and GPQA-Diamond demonstrate that semantic lemma retrieval improves accuracy in structured domains while exposing failure modes in tests that include heterogeneous domains. Geometric analyses of reasoning trajectories reveal that cache retrieval induces directional…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
