Data-Driven Function Calling Improvements in Large Language Model for Online Financial QA
Xing Tang, Hao Chen, Shiwei Li, Fuyuan Lyu, Weijie Shi, Lingjie Li, Dugang Liu, Weihong Luo, Xiku Du, Xiuqiang He

TL;DR
This paper presents a data-driven pipeline to improve function calling in large language models for online financial question-answering, enhancing adaptability and accuracy in real-world deployment.
Contribution
It introduces a novel dataset construction, data augmentation method, and a two-step training process tailored for financial API integration in LLMs.
Findings
The pipeline outperforms existing methods on offline datasets.
It has been successfully deployed in a large-scale online financial QA system.
The approach improves the LLM's ability to handle diverse, out-of-distribution queries.
Abstract
Large language models (LLMs) have been incorporated into numerous industrial applications. Meanwhile, a vast array of API assets is scattered across various functions in the financial domain. An online financial question-answering system can leverage both LLMs and private APIs to provide timely financial analysis and information. The key is equipping the LLM model with function calling capability tailored to a financial scenario. However, a generic LLM requires customized financial APIs to call and struggles to adapt to the financial domain. Additionally, online user queries are diverse and contain out-of-distribution parameters compared with the required function input parameters, which makes it more difficult for a generic LLM to serve online users. In this paper, we propose a data-driven pipeline to enhance function calling in LLM for our online, deployed financial QA, comprising…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
