OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation
Mengkang Hu, Yuhang Zhou, Wendong Fan, Yuzhou Nie, Bowei Xia, Tao Sun, Ziyu Ye, Zhaoxuan Jin, Yingru Li, Qiguang Chen, Zeyu Zhang, Yifeng Wang, Qianshuo Ye, Bernard Ghanem, Ping Luo, Guohao Li

TL;DR
This paper introduces Workforce, a modular multi-agent framework with a novel training method called OWL, enabling cross-domain transfer and improved performance in real-world task automation.
Contribution
We propose Workforce, a hierarchical, modular multi-agent system with OWL training for enhanced cross-domain transferability and generalization in real-world tasks.
Findings
Achieves state-of-the-art 69.70% performance on GAIA benchmark.
OWL-trained 32B model reaches 52.73% accuracy, close to GPT-4o.
Demonstrates effective cross-domain transfer during inference and training.
Abstract
Large Language Model (LLM)-based multi-agent systems show promise for automating real-world tasks but struggle to transfer across domains due to their domain-specific nature. Current approaches face two critical shortcomings: they require complete architectural redesign and full retraining of all components when applied to new domains. We introduce Workforce, a hierarchical multi-agent framework that decouples strategic planning from specialized execution through a modular architecture comprising: (i) a domain-agnostic Planner for task decomposition, (ii) a Coordinator for subtask management, and (iii) specialized Workers with domain-specific tool-calling capabilities. This decoupling enables cross-domain transferability during both inference and training phases: During inference, Workforce seamlessly adapts to new domains by adding or modifying worker agents; For training, we introduce…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsScheduling and Optimization Algorithms · Advanced Manufacturing and Logistics Optimization · IoT and Edge/Fog Computing
