Reasoning Primitives in Hybrid and Non-Hybrid LLMs
Shivam Rawat, Lucie Flek, Florian Mai, Nicholas Kluge Corr\^ea

TL;DR
This paper investigates how reasoning primitives like recall and state-tracking affect large language models, comparing attention-only transformers with hybrid models that combine attention and recurrent state updates, showing that hybrid models are more robust on complex reasoning tasks.
Contribution
It introduces a study of reasoning primitives in hybrid versus attention-only models, demonstrating that explicit reasoning and architectural biases enhance model robustness and effectiveness.
Findings
Hybrid models outperform attention-only models on complex reasoning tasks.
Reasoning augmentation extends the effective operational range of models.
Hybrid models maintain performance better as task difficulty and sequential dependence increase.
Abstract
Reasoning in large language models is often treated as a monolithic capability, but its observed gains may arise from more basic operations. We study reasoning through two such primitives, recall and state-tracking, and ask whether hybrid architectures that combine attention-based retrieval with recurrent state updates are better suited than attention-only models for tasks that jointly require both. Using matched Olmo3 transformer and hybrid models in instruction-tuned and reasoning-augmented variants, we evaluate these models on a set of controlled tasks involving a mixture of state-tracking and recall primitives, state-based recall. Across tasks, we notice that reasoning augmentation provides the largest overall improvement, substantially extending the range of difficulty over which models remain effective. We also notice that in certain tasks, the hybrid reasoning model remains…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
