CallNavi, A Challenge and Empirical Study on LLM Function Calling and   Routing

Yewei Song; Xunzhu Tang; Cedric Lothritz; Saad Ezzini; Jacques Klein,; Tegawend\'e F. Bissyand\'e; Andrey Boytsov; Ulrick Ble; Anne Goujon

arXiv:2501.05255·cs.SE·April 25, 2025

CallNavi, A Challenge and Empirical Study on LLM Function Calling and Routing

Yewei Song, Xunzhu Tang, Cedric Lothritz, Saad Ezzini, Jacques Klein,, Tegawend\'e F. Bissyand\'e, Andrey Boytsov, Ulrick Ble, Anne Goujon

PDF

Open Access

TL;DR

This paper introduces a new dataset and empirical analysis of large language models for API function calling and routing, addressing multi-step, complex API interactions in chatbot systems.

Contribution

It presents a novel benchmarking dataset, evaluates state-of-the-art models, and proposes a hybrid API routing approach combining LLMs and fine-tuning.

Findings

01

Improved API execution accuracy in chatbot systems

02

Effective hybrid approach for API routing and parameter generation

03

Insights into model performance across task complexities

Abstract

API-driven chatbot systems are increasingly integral to software engineering applications, yet their effectiveness hinges on accurately generating and executing API calls. This is particularly challenging in scenarios requiring multi-step interactions with complex parameterization and nested API dependencies. Addressing these challenges, this work contributes to the evaluation and assessment of AI-based software development through three key advancements: (1) the introduction of a novel dataset specifically designed for benchmarking API function selection, parameter generation, and nested API execution; (2) an empirical evaluation of state-of-the-art language models, analyzing their performance across varying task complexities in API function generation and parameter accuracy; and (3) a hybrid approach to API routing, combining general-purpose large language models for API selection…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling