FireAct: Toward Language Agent Fine-tuning

Baian Chen; Chang Shu; Ehsan Shareghi; Nigel Collier; Karthik; Narasimhan; Shunyu Yao

arXiv:2310.05915·cs.CL·October 10, 2023·6 cites

FireAct: Toward Language Agent Fine-tuning

Baian Chen, Chang Shu, Ehsan Shareghi, Nigel Collier, Karthik, Narasimhan, Shunyu Yao

PDF

Open Access

TL;DR

This paper demonstrates that fine-tuning language models with diverse trajectories significantly enhances their performance as language agents across multiple tasks, offering a promising alternative to prompting-based methods.

Contribution

It introduces FireAct, a novel fine-tuning approach using multi-task trajectories, and provides comprehensive analysis of fine-tuning benefits for language agents.

Findings

01

Fine-tuning improves agent performance across tasks.

02

Diverse data enhances robustness and generalization.

03

Scaling effects influence fine-tuning effectiveness.

Abstract

Recent efforts have augmented language models (LMs) with external tools or environments, leading to the development of language agents that can reason and act. However, most of these agents rely on few-shot prompting techniques with off-the-shelf LMs. In this paper, we investigate and argue for the overlooked direction of fine-tuning LMs to obtain language agents. Using a setup of question answering (QA) with a Google search API, we explore a variety of base LMs, prompting methods, fine-tuning data, and QA tasks, and find language agents are consistently improved after fine-tuning their backbone LMs. For example, fine-tuning Llama2-7B with 500 agent trajectories generated by GPT-4 leads to a 77% HotpotQA performance increase. Furthermore, we propose FireAct, a novel approach to fine-tuning LMs with trajectories from multiple tasks and prompting methods, and show having more diverse…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Position-Wise Feed-Forward Layer · Byte Pair Encoding · Balanced Selection · Dense Connections · Label Smoothing · Adam · Absolute Position Encodings