Solving a Million-Step LLM Task with Zero Errors

Elliot Meyerson; Giuseppe Paolo; Roberto Dailey; Hormoz Shahrzad; Olivier Francon; Conor F. Hayes; Xin Qiu; Babak Hodjat; Risto Miikkulainen

arXiv:2511.09030·cs.AI·November 13, 2025

Solving a Million-Step LLM Task with Zero Errors

Elliot Meyerson, Giuseppe Paolo, Roberto Dailey, Hormoz Shahrzad, Olivier Francon, Conor F. Hayes, Xin Qiu, Babak Hodjat, Risto Miikkulainen

PDF

Open Access

TL;DR

This paper introduces MAKER, a system that achieves error-free execution of over one million LLM steps by extreme task decomposition and multi-agent error correction, enabling scalable long-range reasoning beyond current limitations.

Contribution

MAKER is the first system to solve a million-step LLM task with zero errors through modular decomposition and multi-agent voting, surpassing previous capabilities.

Findings

01

Successfully completed over one million LLM steps with zero errors

02

Extreme task decomposition enables scalable long-range reasoning

03

Error correction via multi-agent voting enhances reliability

Abstract

LLMs have achieved remarkable breakthroughs in reasoning, insights, and tool use, but chaining these abilities into extended processes at the scale of those routinely executed by humans, organizations, and societies has remained out of reach. The models have a persistent error rate that prevents scale-up: for instance, recent experiments in the Towers of Hanoi benchmark domain showed that the process inevitably becomes derailed after at most a few hundred steps. Thus, although LLM research is often still benchmarked on tasks with relatively few dependent logical steps, there is increasing attention on the ability (or inability) of LLMs to perform long range tasks. This paper describes MAKER, the first system that successfully solves a task with over one million LLM steps with zero errors, and, in principle, scales far beyond this level. The approach relies on an extreme decomposition of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMulti-Agent Systems and Negotiation · AI-based Problem Solving and Planning · Topic Modeling