FireAct: Toward Language Agent Fine-tuning
Baian Chen, Chang Shu, Ehsan Shareghi, Nigel Collier, Karthik, Narasimhan, Shunyu Yao

TL;DR
This paper demonstrates that fine-tuning language models with diverse trajectories significantly enhances their performance as language agents across multiple tasks, offering a promising alternative to prompting-based methods.
Contribution
It introduces FireAct, a novel fine-tuning approach using multi-task trajectories, and provides comprehensive analysis of fine-tuning benefits for language agents.
Findings
Fine-tuning improves agent performance across tasks.
Diverse data enhances robustness and generalization.
Scaling effects influence fine-tuning effectiveness.
Abstract
Recent efforts have augmented language models (LMs) with external tools or environments, leading to the development of language agents that can reason and act. However, most of these agents rely on few-shot prompting techniques with off-the-shelf LMs. In this paper, we investigate and argue for the overlooked direction of fine-tuning LMs to obtain language agents. Using a setup of question answering (QA) with a Google search API, we explore a variety of base LMs, prompting methods, fine-tuning data, and QA tasks, and find language agents are consistently improved after fine-tuning their backbone LMs. For example, fine-tuning Llama2-7B with 500 agent trajectories generated by GPT-4 leads to a 77% HotpotQA performance increase. Furthermore, we propose FireAct, a novel approach to fine-tuning LMs with trajectories from multiple tasks and prompting methods, and show having more diverse…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Position-Wise Feed-Forward Layer · Byte Pair Encoding · Balanced Selection · Dense Connections · Label Smoothing · Adam · Absolute Position Encodings
