Agent Lumos: Unified and Modular Training for Open-Source Language Agents
Da Yin, Faeze Brahman, Abhilasha Ravichander, Khyathi Chandu, Kai-Wei, Chang, Yejin Choi, Bill Yuchen Lin

TL;DR
LUMOS is a modular, open-source framework for training language agents that improves generalization, transparency, and performance across diverse complex tasks, surpassing existing open-source and GPT-based agents.
Contribution
Introduces LUMOS, a unified, modular training framework for open-source language agents with high-quality annotations and superior performance on multiple datasets.
Findings
LUMOS outperforms larger open-source agents on unseen datasets.
LUMOS surpasses GPT agents on QA and web tasks.
LUMOS generalizes well to unseen tasks.
Abstract
Closed-source agents suffer from several issues such as a lack of affordability, transparency, and reproducibility, particularly on complex interactive tasks. This motivates the development of open-source alternatives. We introduce LUMOS, one of the first frameworks for training open-source LLM-based agents. LUMOS features a learnable, unified, and modular architecture with a planning module that learns high-level subgoal generation, and a grounding module trained to translate these into actions using various tools in the execution module. The design allows for modular upgrades and wider applicability to diverse interactive tasks. To foster generalizable agent learning, we collect large-scale, unified, and high-quality training annotations derived from diverse ground-truth reasoning rationales across various complex interactive tasks. On 9 datasets, LUMOS exhibits several key…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗ai2lumos/lumos_unified_plan_iterativemodel· 4 dl· ♡ 54 dl♡ 5
- 🤗ai2lumos/lumos_unified_ground_iterativemodel· 8 dl· ♡ 18 dl♡ 1
- 🤗ai2lumos/lumos_complex_qa_ground_iterativemodel· 4 dl4 dl
- 🤗ai2lumos/lumos_complex_qa_plan_iterativemodel· 2 dl2 dl
- 🤗ai2lumos/lumos_complex_qa_plan_onetimemodel· 1 dl1 dl
- 🤗ai2lumos/lumos_complex_qa_ground_onetimemodel· 3 dl3 dl
- 🤗ai2lumos/lumos_web_agent_plan_iterativemodel· 7 dl· ♡ 37 dl♡ 3
- 🤗ai2lumos/lumos_web_agent_ground_iterativemodel· 4 dl4 dl
- 🤗ai2lumos/lumos_maths_plan_onetimemodel· 3 dl3 dl
- 🤗ai2lumos/lumos_maths_ground_onetimemodel· 5 dl5 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMulti-Agent Systems and Negotiation
MethodsAttention Is All You Need · Softmax · Residual Connection · Refunds@Expedia|||How do I get a full refund from Expedia? · Weight Decay · Linear Layer · Dense Connections · Adam · Dropout · Multi-Head Attention
