Granite-Function Calling Model: Introducing Function Calling Abilities via Multi-task Learning of Granular Tasks
Ibrahim Abdelaziz, Kinjal Basu, Mayank Agarwal, Sadhana Kumaravel,, Matthew Stallone, Rameswar Panda, Yara Rizk, GP Bhargav, Maxwell Crouse,, Chulaka Gunasekara, Shajith Ikbal, Sachin Joshi, Hima Karanam, Vineet Kumar,, Asim Munawar, Sumit Neelam, Dinesh Raghu, Udit Sharma

TL;DR
This paper introduces GRANITE-20B-FUNCTIONCALLING, an open-source large language model trained via multi-task learning to perform function calling, enabling external tool interaction and improving generalizability across diverse tasks and datasets.
Contribution
The paper presents a new open-source LLM trained specifically for function calling tasks, achieving competitive performance with proprietary models.
Findings
Outperforms other open models on the Berkeley Function Calling Leaderboard
Achieves fourth place overall among tested models
Demonstrates strong generalizability across multiple datasets
Abstract
Large language models (LLMs) have recently shown tremendous promise in serving as the backbone to agentic systems, as demonstrated by their performance in multi-faceted, challenging benchmarks like SWE-Bench and Agent-Bench. However, to realize the true potential of LLMs as autonomous agents, they must learn to identify, call, and interact with external tools and application program interfaces (APIs) to complete complex tasks. These tasks together are termed function calling. Endowing LLMs with function calling abilities leads to a myriad of advantages, such as access to current and domain-specific information in databases and knowledge sources, and the ability to outsource tasks that can be reliably performed by tools, e.g., a Python interpreter or calculator. While there has been significant progress in function calling with LLMs, there is still a dearth of open models that perform on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Automated Systems
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Cosine Annealing · Linear Layer · Residual Connection · Layer Normalization · Adam · Dense Connections · Linear Warmup With Cosine Annealing · Attention Dropout
