HyFunc: Accelerating LLM-based Function Calls for Agentic AI through Hybrid-Model Cascade and Dynamic Templating
Weibin Liao, Jian-guang Lou, Haoyi Xiong

TL;DR
HyFunc significantly reduces inference latency in LLM-based function call systems by employing a hybrid-model cascade and dynamic templating, achieving faster and more efficient agentic AI without sacrificing performance.
Contribution
The paper introduces HyFunc, a novel framework that eliminates redundancies in LLM-based function calling through a hybrid cascade and dynamic templating, improving efficiency and scalability.
Findings
Inference latency reduced to 0.828 seconds
Achieves 80.1% performance on unseen dataset
Outperforms baseline models in efficiency and accuracy
Abstract
While agentic AI systems rely on LLMs to translate user intent into structured function calls, this process is fraught with computational redundancy, leading to high inference latency that hinders real-time applications. This paper identifies and addresses three key redundancies: (1) the redundant processing of a large library of function descriptions for every request; (2) the redundant use of a large, slow model to generate an entire, often predictable, token sequence; and (3) the redundant generation of fixed, boilerplate parameter syntax. We introduce HyFunc, a novel framework that systematically eliminates these inefficiencies. HyFunc employs a hybrid-model cascade where a large model distills user intent into a single "soft token." This token guides a lightweight retriever to select relevant functions and directs a smaller, prefix-tuned model to generate the final call, thus…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Topic Modeling · Natural Language Processing Techniques
