FutureWeaver: Planning Test-Time Compute for Multi-Agent Systems with Modularized Collaboration
Dongwon Jung, Peng Shi, Yi Zhang

TL;DR
FutureWeaver is a framework that optimizes test-time compute allocation in multi-agent systems, enabling better collaboration and performance under fixed compute budgets through modular workflows and dual-level planning.
Contribution
It introduces a novel modularized collaboration approach and a dual-level planning architecture for optimizing compute in multi-agent systems during inference.
Findings
Outperforms baselines across diverse budget settings
Effectively fosters collaboration among agents
Demonstrates scalability on complex benchmarks
Abstract
Scaling test-time computation improves large language model performance without additional training. Recent work demonstrates that techniques such as repeated sampling, self-verification, and self-reflection can significantly enhance task success by allocating more inference-time compute. However, applying these techniques across multiple agents in a multi-agent system is difficult: there does not exist principled mechanisms to allocate compute to foster collaboration among agents, to extend test-time scaling to collaborative interactions, or to distribute compute across agents under explicit budget constraints. To address this gap, we propose FutureWeaver, a framework for planning and optimizing test-time compute allocation in multi-agent systems under fixed budgets. FutureWeaver introduces modularized collaboration, formalized as callable functions that encapsulate reusable…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · AI-based Problem Solving and Planning · Constraint Satisfaction and Optimization
