Self-Resource Allocation in Multi-Agent LLM Systems

Alfonso Amayuelas; Jingbo Yang; Saaket Agashe; Ashwin Nagarajan,; Antonis Antoniades; Xin Eric Wang; William Wang

arXiv:2504.02051·cs.MA·April 22, 2025·3 cites

Self-Resource Allocation in Multi-Agent LLM Systems

Alfonso Amayuelas, Jingbo Yang, Saaket Agashe, Ashwin Nagarajan,, Antonis Antoniades, Xin Eric Wang, William Wang

PDF

Open Access 3 Reviews

TL;DR

This paper investigates how large language models can serve as effective resource allocators in multi-agent systems, improving task assignment, coordination, and efficiency through different strategies and explicit worker information.

Contribution

It introduces and compares LLM-based orchestrator and planner methods for resource allocation, highlighting the superiority of planners with explicit worker capability info.

Findings

01

Planners outperform orchestrators in efficiency and utilization.

02

LLMs achieve high accuracy in resource allocation tasks.

03

Explicit worker capability info enhances planner strategies.

Abstract

With the development of LLMs as agents, there is a growing interest in connecting multiple agents into multi-agent systems to solve tasks concurrently, focusing on their role in task assignment and coordination. This paper explores how LLMs can effectively allocate computational tasks among multiple agents, considering factors such as cost, efficiency, and performance. In this work, we address key questions, including the effectiveness of LLMs as orchestrators and planners, comparing their effectiveness in task assignment and coordination. Our experiments demonstrate that LLMs can achieve high validity and accuracy in resource allocation tasks. We find that the planner method outperforms the orchestrator method in handling concurrent actions, resulting in improved efficiency and better utilization of agents. Additionally, we show that providing explicit information about worker…

Peer Reviews

Decision·Submitted to ICLR 2026

Reviewer 01Rating 6Confidence 4

Strengths

- By casting self-resource allocation as a cost-aware comparison of Individual / Orchestrator / Planner regimes, the paper adds a fresh systems lens beyond prior role-playing multi-agent frameworks. - The methodology pairs static assignment with a cooperative CuisineWorld/Overcooked-style concurrent benchmark, yielding both groundable checks and ecologically valid coordination stress-tests. - The three regimes are distinguished crisply, and the Planner is explained via a ReAct-like separation of

Weaknesses

- The experiments are currently restricted to a single benchmark, CuisineWorld, which does not guarantee that the observed behaviors generalize to other domains such as web-based agents tasks. - Several key tables and figures present point estimates without reporting variance, confidence intervals, or statistical significance across multiple runs. Without such analysis, it remains unclear whether the reported efficiency gains are consistent, reproducible, or simply artifacts of stochastic samp

Reviewer 02Rating 2Confidence 3

Strengths

1. The topic addressed in the paper is important: enabling LLM-based agents to perform self-resource allocation in multi-agent systems, which is critical for advancing the field of autonomous agents. 2. The findings are intuitive and practically relevant, for example, the planner approach outperforms the orchestrator, and that explicitly providing information about worker capabilities improves allocation strategies.

Weaknesses

1. Insufficient analysis: While the experiments provide empirical evidence, the paper falls short in explaining the underlying reasons behind the observed results. Merely presenting factual outcomes without deeper interpretation limits the contribution. 2. Experimental setup limitations: It would be better to include tests with state-of-the-art models such as GPT-5, Claude-4.5 Sonnet, or Qwen3. And the evaluation relies solely on toy environments (e.g., CuisineWorld). Demonstrating applicability

Reviewer 03Rating 6Confidence 3

Strengths

- The paper systematically studies three coordination paradigms across multiple domains, providing a multi-dimensional evaluation of LLM-based resource allocation. - By explicitly encoding worker capabilities, the paper introduces a practical and generalizable way to improve multi-agent collaboration. - The study systematically analyzes how the Planner adapts task distribution based on agents’ abilities, demonstrating that explicit capability information significantly improves overall system p

Weaknesses

- In the CuisineWorld concurrent-action experiments, comparisons are limited to Orchestrator and Planner paradigms. Including algorithmic or learning-based baselines (e.g., heuristic schedulers, RL-based task allocation, or rule-based controllers) would contextualize the performance improvements more convincingly. - The reported cost metric does not clarify whether it includes token usage, API latency, or parallelization overhead. Presenting token-level statistics and latency–cost trade-offs wo

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsScheduling and Optimization Algorithms · Service-Oriented Architecture and Web Services · Mobile Agent-Based Network Management