xLAM: A Family of Large Action Models to Empower AI Agent Systems

Jianguo Zhang; Tian Lan; Ming Zhu; Zuxin Liu; Thai Hoang; Shirley; Kokane; Weiran Yao; Juntao Tan; Akshara Prabhakar; Haolin Chen; Zhiwei Liu,; Yihao Feng; Tulika Awalgaonkar; Rithesh Murthy; Eric Hu; Zeyuan Chen; Ran Xu,; Juan Carlos Niebles; Shelby Heinecke; Huan Wang; Silvio Savarese; Caiming; Xiong

arXiv:2409.03215·cs.CL·September 6, 2024·5 cites

xLAM: A Family of Large Action Models to Empower AI Agent Systems

Jianguo Zhang, Tian Lan, Ming Zhu, Zuxin Liu, Thai Hoang, Shirley, Kokane, Weiran Yao, Juntao Tan, Akshara Prabhakar, Haolin Chen, Zhiwei Liu,, Yihao Feng, Tulika Awalgaonkar, Rithesh Murthy, Eric Hu, Zeyuan Chen, Ran Xu,, Juan Carlos Niebles, Shelby Heinecke, Huan Wang

PDF

Open Access 1 Repo 10 Models 1 Datasets 1 Video

TL;DR

The paper introduces xLAM, a series of large action models designed for AI agent tasks, demonstrating superior performance and tool use capabilities, and providing open-source models to advance autonomous AI research.

Contribution

It presents a new family of large action models with diverse architectures, trained on synthesized datasets, and benchmarks their performance against leading models like GPT-4.

Findings

01

xLAM models outperform competitors on agent ability benchmarks

02

Achieved 1st place on the Berkeley Function-Calling Leaderboard

03

Models are publicly available for research and development

Abstract

Autonomous agents powered by large language models (LLMs) have attracted significant research interest. However, the open-source community faces many challenges in developing specialized models for agent tasks, driven by the scarcity of high-quality agent datasets and the absence of standard protocols in this area. We introduce and publicly release xLAM, a series of large action models designed for AI agent tasks. The xLAM series includes five models with both dense and mixture-of-expert architectures, ranging from 1B to 8x22B parameters, trained using a scalable, flexible pipeline that unifies, augments, and synthesizes diverse datasets to enhance AI agents' generalizability and performance across varied environments. Our experimental results demonstrate that xLAM consistently delivers exceptional performance across multiple agent ability benchmarks, notably securing the 1st position…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

SalesforceAIResearch/xLAM
pytorchOfficial

Models

Datasets

younissk/tool-calling-mix
dataset· 152 dl
152 dl

Videos

xLAM: A Family of Large Action Models to Empower AI Agent Systems· underline

Taxonomy

TopicsMulti-Agent Systems and Negotiation

MethodsAttention Is All You Need · Byte Pair Encoding · Absolute Position Encodings · Softmax · Label Smoothing · Dropout · Layer Normalization · Position-Wise Feed-Forward Layer · Linear Layer · Adam