xLAM: A Family of Large Action Models to Empower AI Agent Systems
Jianguo Zhang, Tian Lan, Ming Zhu, Zuxin Liu, Thai Hoang, Shirley, Kokane, Weiran Yao, Juntao Tan, Akshara Prabhakar, Haolin Chen, Zhiwei Liu,, Yihao Feng, Tulika Awalgaonkar, Rithesh Murthy, Eric Hu, Zeyuan Chen, Ran Xu,, Juan Carlos Niebles, Shelby Heinecke, Huan Wang

TL;DR
The paper introduces xLAM, a series of large action models designed for AI agent tasks, demonstrating superior performance and tool use capabilities, and providing open-source models to advance autonomous AI research.
Contribution
It presents a new family of large action models with diverse architectures, trained on synthesized datasets, and benchmarks their performance against leading models like GPT-4.
Findings
xLAM models outperform competitors on agent ability benchmarks
Achieved 1st place on the Berkeley Function-Calling Leaderboard
Models are publicly available for research and development
Abstract
Autonomous agents powered by large language models (LLMs) have attracted significant research interest. However, the open-source community faces many challenges in developing specialized models for agent tasks, driven by the scarcity of high-quality agent datasets and the absence of standard protocols in this area. We introduce and publicly release xLAM, a series of large action models designed for AI agent tasks. The xLAM series includes five models with both dense and mixture-of-expert architectures, ranging from 1B to 8x22B parameters, trained using a scalable, flexible pipeline that unifies, augments, and synthesizes diverse datasets to enhance AI agents' generalizability and performance across varied environments. Our experimental results demonstrate that xLAM consistently delivers exceptional performance across multiple agent ability benchmarks, notably securing the 1st position…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗Salesforce/Llama-xLAM-2-8b-fc-r-ggufmodel· 775 dl· ♡ 21775 dl♡ 21
- 🤗Salesforce/xLAM-v0.1-rmodel· 153 dl· ♡ 54153 dl♡ 54
- 🤗Salesforce/xLAM-1b-fc-rmodel· 1.3k dl· ♡ 591.3k dl♡ 59
- 🤗Salesforce/xLAM-7b-fc-rmodel· 307 dl· ♡ 78307 dl♡ 78
- 🤗Salesforce/xLAM-8x7b-rmodel· 69k dl· ♡ 1569k dl♡ 15
- 🤗Salesforce/xLAM-8x22b-rmodel· 10k dl· ♡ 4510k dl♡ 45
- 🤗Salesforce/xLAM-7b-rmodel· 75k dl· ♡ 3275k dl♡ 32
- 🤗RichardErkhov/Salesforce_-_xLAM-8x7b-r-ggufmodel· 135 dl· ♡ 1135 dl♡ 1
- 🤗RichardErkhov/Salesforce_-_xLAM-7b-r-ggufmodel· 72 dl72 dl
- 🤗RichardErkhov/Salesforce_-_xLAM-8x22b-r-ggufmodel· 31 dl31 dl
Videos
Taxonomy
TopicsMulti-Agent Systems and Negotiation
MethodsAttention Is All You Need · Byte Pair Encoding · Absolute Position Encodings · Softmax · Label Smoothing · Dropout · Layer Normalization · Position-Wise Feed-Forward Layer · Linear Layer · Adam
