Self-Resource Allocation in Multi-Agent LLM Systems
Alfonso Amayuelas, Jingbo Yang, Saaket Agashe, Ashwin Nagarajan,, Antonis Antoniades, Xin Eric Wang, William Wang

TL;DR
This paper investigates how large language models can serve as effective resource allocators in multi-agent systems, improving task assignment, coordination, and efficiency through different strategies and explicit worker information.
Contribution
It introduces and compares LLM-based orchestrator and planner methods for resource allocation, highlighting the superiority of planners with explicit worker capability info.
Findings
Planners outperform orchestrators in efficiency and utilization.
LLMs achieve high accuracy in resource allocation tasks.
Explicit worker capability info enhances planner strategies.
Abstract
With the development of LLMs as agents, there is a growing interest in connecting multiple agents into multi-agent systems to solve tasks concurrently, focusing on their role in task assignment and coordination. This paper explores how LLMs can effectively allocate computational tasks among multiple agents, considering factors such as cost, efficiency, and performance. In this work, we address key questions, including the effectiveness of LLMs as orchestrators and planners, comparing their effectiveness in task assignment and coordination. Our experiments demonstrate that LLMs can achieve high validity and accuracy in resource allocation tasks. We find that the planner method outperforms the orchestrator method in handling concurrent actions, resulting in improved efficiency and better utilization of agents. Additionally, we show that providing explicit information about worker…
Peer Reviews
Decision·Submitted to ICLR 2026
- By casting self-resource allocation as a cost-aware comparison of Individual / Orchestrator / Planner regimes, the paper adds a fresh systems lens beyond prior role-playing multi-agent frameworks. - The methodology pairs static assignment with a cooperative CuisineWorld/Overcooked-style concurrent benchmark, yielding both groundable checks and ecologically valid coordination stress-tests. - The three regimes are distinguished crisply, and the Planner is explained via a ReAct-like separation of
- The experiments are currently restricted to a single benchmark, CuisineWorld, which does not guarantee that the observed behaviors generalize to other domains such as web-based agents tasks. - Several key tables and figures present point estimates without reporting variance, confidence intervals, or statistical significance across multiple runs. Without such analysis, it remains unclear whether the reported efficiency gains are consistent, reproducible, or simply artifacts of stochastic samp
1. The topic addressed in the paper is important: enabling LLM-based agents to perform self-resource allocation in multi-agent systems, which is critical for advancing the field of autonomous agents. 2. The findings are intuitive and practically relevant, for example, the planner approach outperforms the orchestrator, and that explicitly providing information about worker capabilities improves allocation strategies.
1. Insufficient analysis: While the experiments provide empirical evidence, the paper falls short in explaining the underlying reasons behind the observed results. Merely presenting factual outcomes without deeper interpretation limits the contribution. 2. Experimental setup limitations: It would be better to include tests with state-of-the-art models such as GPT-5, Claude-4.5 Sonnet, or Qwen3. And the evaluation relies solely on toy environments (e.g., CuisineWorld). Demonstrating applicability
- The paper systematically studies three coordination paradigms across multiple domains, providing a multi-dimensional evaluation of LLM-based resource allocation. - By explicitly encoding worker capabilities, the paper introduces a practical and generalizable way to improve multi-agent collaboration. - The study systematically analyzes how the Planner adapts task distribution based on agents’ abilities, demonstrating that explicit capability information significantly improves overall system p
- In the CuisineWorld concurrent-action experiments, comparisons are limited to Orchestrator and Planner paradigms. Including algorithmic or learning-based baselines (e.g., heuristic schedulers, RL-based task allocation, or rule-based controllers) would contextualize the performance improvements more convincingly. - The reported cost metric does not clarify whether it includes token usage, API latency, or parallelization overhead. Presenting token-level statistics and latency–cost trade-offs wo
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScheduling and Optimization Algorithms · Service-Oriented Architecture and Web Services · Mobile Agent-Based Network Management
