Agent Lumos: Unified and Modular Training for Open-Source Language   Agents

Da Yin; Faeze Brahman; Abhilasha Ravichander; Khyathi Chandu; Kai-Wei; Chang; Yejin Choi; Bill Yuchen Lin

arXiv:2311.05657·cs.AI·July 11, 2024·2 cites

Agent Lumos: Unified and Modular Training for Open-Source Language Agents

Da Yin, Faeze Brahman, Abhilasha Ravichander, Khyathi Chandu, Kai-Wei, Chang, Yejin Choi, Bill Yuchen Lin

PDF

Open Access 2 Repos 10 Models 5 Datasets

TL;DR

LUMOS is a modular, open-source framework for training language agents that improves generalization, transparency, and performance across diverse complex tasks, surpassing existing open-source and GPT-based agents.

Contribution

Introduces LUMOS, a unified, modular training framework for open-source language agents with high-quality annotations and superior performance on multiple datasets.

Findings

01

LUMOS outperforms larger open-source agents on unseen datasets.

02

LUMOS surpasses GPT agents on QA and web tasks.

03

LUMOS generalizes well to unseen tasks.

Abstract

Closed-source agents suffer from several issues such as a lack of affordability, transparency, and reproducibility, particularly on complex interactive tasks. This motivates the development of open-source alternatives. We introduce LUMOS, one of the first frameworks for training open-source LLM-based agents. LUMOS features a learnable, unified, and modular architecture with a planning module that learns high-level subgoal generation, and a grounding module trained to translate these into actions using various tools in the execution module. The design allows for modular upgrades and wider applicability to diverse interactive tasks. To foster generalizable agent learning, we collect large-scale, unified, and high-quality training annotations derived from diverse ground-truth reasoning rationales across various complex interactive tasks. On 9 datasets, LUMOS exhibits several key…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMulti-Agent Systems and Negotiation

MethodsAttention Is All You Need · Softmax · Residual Connection · Refunds@Expedia|||How do I get a full refund from Expedia? · Weight Decay · Linear Layer · Dense Connections · Adam · Dropout · Multi-Head Attention