MAR:Multi-Agent Reflexion Improves Reasoning Abilities in LLMs

Onat Ozer; Grace Wu; Yuchen Wang; Daniel Dosti; Honghao Zhang; Vivi De La Rue

arXiv:2512.20845·cs.AI·December 25, 2025

MAR:Multi-Agent Reflexion Improves Reasoning Abilities in LLMs

Onat Ozer, Grace Wu, Yuchen Wang, Daniel Dosti, Honghao Zhang, Vivi De La Rue

PDF

Open Access

TL;DR

This paper introduces a multi-agent debating approach with diverse personas to enhance reasoning in LLMs, overcoming degeneration from repeated reflections and achieving superior accuracy on reasoning benchmarks.

Contribution

The novel multi-agent debating framework with multiple personas improves reflection diversity and reasoning accuracy in LLMs compared to single-agent methods.

Findings

01

Achieved 47% EM HotPot QA accuracy

02

Reached 82.7% on HumanEval programming tasks

03

Outperformed single-LLM reflection methods

Abstract

LLMs have shown the capacity to improve their performance on reasoning tasks through reflecting on their mistakes, and acting with these reflections in mind. However, continual reflections of the same LLM onto itself exhibit degeneration of thought, where the LLM continues to repeat the same errors again and again even with the knowledge that its wrong. To address this problem, we instead introduce multi-agent with multi-persona debators as the method to generate reflections. Through out extensive experimentation, we've found that the leads to better diversity of in the reflections generated by the llm agent. We demonstrate an accuracy of 47% EM HotPot QA (question answering) and 82.7% on HumanEval (programming), both performances surpassing reflection with a single llm.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Intelligent Tutoring Systems and Adaptive Learning · Multimodal Machine Learning Applications