Exploring Weaknesses in Function Call Models via Reinforcement Learning: An Adversarial Data Augmentation Approach

Weiran Guo; Bing Bo; Shaoxiang Wu; Jingsheng Yang

arXiv:2601.19122·cs.AI·January 28, 2026

Exploring Weaknesses in Function Call Models via Reinforcement Learning: An Adversarial Data Augmentation Approach

Weiran Guo, Bing Bo, Shaoxiang Wu, Jingsheng Yang

PDF

Open Access

TL;DR

This paper introduces an adversarial data augmentation method using reinforcement learning to systematically identify and address weaknesses in function call capabilities of large language models, enhancing their robustness and generalization.

Contribution

The paper presents a novel RL-based adversarial training framework that targets specific weaknesses in function call models, improving their robustness beyond fixed-pattern data augmentation methods.

Findings

01

Enhanced robustness of function call models against adversarial queries

02

Systematic identification of model weaknesses through RL-generated adversarial data

03

Improved generalization of function call capabilities in LLMs

Abstract

Function call capabilities have become crucial for Large Language Models (LLMs), enabling them to interact more effectively with external tools and APIs. Existing methods for improving the function call capabilities of LLMs rely on data obtained either through manual annotation or automated generation by models, and use this data to finetune the LLMs. However, these methods often lack targeted design and are constrained by fixed patterns and data distributions, which limits their effectiveness in enhancing the generalization and robustness of function call LLMs. To address this limitation, we propose a novel adversarial data augmentation method that employs reinforcement learning to systematically identify and target the weaknesses of function call LLMs. Our training framework introduces a query model trained with reinforcement learning (RL) to generate adversarial queries that are…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Adversarial Robustness in Machine Learning · Machine Learning in Materials Science