API-BLEND: A Comprehensive Corpora for Training and Benchmarking API   LLMs

Kinjal Basu; Ibrahim Abdelaziz; Subhajit Chaudhury; Soham Dan; Maxwell; Crouse; Asim Munawar; Sadhana Kumaravel; Vinod Muthusamy; Pavan Kapanipathi,; Luis A. Lastras

arXiv:2402.15491·cs.CL·May 21, 2024·1 cites

API-BLEND: A Comprehensive Corpora for Training and Benchmarking API LLMs

Kinjal Basu, Ibrahim Abdelaziz, Subhajit Chaudhury, Soham Dan, Maxwell, Crouse, Asim Munawar, Sadhana Kumaravel, Vinod Muthusamy, Pavan Kapanipathi,, Luis A. Lastras

PDF

Open Access 1 Repo 1 Video

TL;DR

API-BLEND is a large, curated dataset designed to train and benchmark API-augmented large language models, enabling better tool integration and task performance evaluation.

Contribution

The paper introduces API-BLEND, a comprehensive dataset for training and benchmarking API-using LLMs, focusing on real-world API interaction tasks.

Findings

01

API-BLEND improves training effectiveness for tool-augmented LLMs.

02

The dataset enables systematic benchmarking of API integration capabilities.

03

API-BLEND covers diverse API interaction scenarios.

Abstract

There is a growing need for Large Language Models (LLMs) to effectively use tools and external Application Programming Interfaces (APIs) to plan and complete tasks. As such, there is tremendous interest in methods that can acquire sufficient quantities of train and test data that involve calls to tools / APIs. Two lines of research have emerged as the predominant strategies for addressing this challenge. The first has focused on synthetic data generation techniques, while the second has involved curating task-adjacent datasets which can be transformed into API / Tool-based tasks. In this paper, we focus on the task of identifying, curating, and transforming existing datasets and, in turn, introduce API-BLEND, a large corpora for training and systematic testing of tool-augmented LLMs. The datasets mimic real-world scenarios involving API-tasks such as API / tool detection, slot filling,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ibm/api-blend
noneOfficial

Videos

API-BLEND: A Comprehensive Corpora for Training and Benchmarking API LLMs· underline

Taxonomy

TopicsSemantic Web and Ontologies

MethodsFocus