From Summary to Action: Enhancing Large Language Models for Complex   Tasks with Open World APIs

Yulong Liu; Yunlong Yuan; Chunwei Wang; Jianhua Han; Yongqiang Ma; Li; Zhang; Nanning Zheng; Hang Xu

arXiv:2402.18157·cs.AI·February 29, 2024·3 cites

From Summary to Action: Enhancing Large Language Models for Complex Tasks with Open World APIs

Yulong Liu, Yunlong Yuan, Chunwei Wang, Jianhua Han, Yongqiang Ma, Li, Zhang, Nanning Zheng, Hang Xu

PDF

Open Access

TL;DR

This paper introduces Sum2Act, a novel pipeline that enhances large language models' ability to invoke real-world APIs for complex tasks, outperforming existing methods on the ToolBench benchmark.

Contribution

The paper presents a new tool invocation pipeline, Sum2Act, that guides LLMs through summarizing results and deciding actions, improving performance on real-world tasks.

Findings

01

Sum2Act outperforms ReAct and DFSDT on ToolBench benchmark.

02

The pipeline effectively controls real-world APIs for complex tasks.

03

Empirical results show significant performance improvements.

Abstract

The distinction between humans and animals lies in the unique ability of humans to use and create tools. Tools empower humans to overcome physiological limitations, fostering the creation of magnificent civilizations. Similarly, enabling foundational models like Large Language Models (LLMs) with the capacity to learn external tool usage may serve as a pivotal step toward realizing artificial general intelligence. Previous studies in this field have predominantly pursued two distinct approaches to augment the tool invocation capabilities of LLMs. The first approach emphasizes the construction of relevant datasets for model fine-tuning. The second approach, in contrast, aims to fully exploit the inherent reasoning abilities of LLMs through in-context learning strategies. In this work, we introduce a novel tool invocation pipeline designed to control massive real-world APIs. This pipeline…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBusiness Process Modeling and Analysis