Tool Learning with Foundation Models
Yujia Qin, Shengding Hu, Yankai Lin, Weize Chen, Ning Ding, Ganqu Cui,, Zheni Zeng, Yufei Huang, Chaojun Xiao, Chi Han, Yi Ren Fung, Yusheng Su,, Huadong Wang, Cheng Qian, Runchu Tian, Kunlun Zhu, Shihao Liang, Xingyu Shen,, Bokai Xu, Zhen Zhang, Yining Ye, Bowen Li, Ziwei Tang

TL;DR
This paper systematically investigates how foundation models can learn to use tools effectively, highlighting challenges, frameworks, and experimental results with 18 tools to advance AI problem-solving capabilities.
Contribution
It introduces a comprehensive framework for tool learning with foundation models, including task decomposition, reasoning, and tool selection, and provides experimental validation with multiple tools.
Findings
Foundation models can effectively utilize a variety of tools.
A general framework for tool learning improves task decomposition and tool selection.
Experimental results demonstrate the potential of foundation models in tool use.
Abstract
Humans possess an extraordinary ability to create and utilize tools, allowing them to overcome physical limitations and explore new frontiers. With the advent of foundation models, AI systems have the potential to be equally adept in tool use as humans. This paradigm, i.e., tool learning with foundation models, combines the strengths of specialized tools and foundation models to achieve enhanced accuracy, efficiency, and automation in problem-solving. Despite its immense potential, there is still a lack of a comprehensive understanding of key challenges, opportunities, and future endeavors in this field. To this end, we present a systematic investigation of tool learning in this paper. We first introduce the background of tool learning, including its cognitive origins, the paradigm shift of foundation models, and the complementary roles of tools and models. Then we recapitulate existing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMobile Crowdsensing and Crowdsourcing · Software Engineering Research
