MathAttack: Attacking Large Language Models Towards Math Solving Ability

Zihao Zhou; Qiufeng Wang; Mingyu Jin; Jie Yao; Jianan Ye; and Wei Liu; Wei Wang; Xiaowei Huang; Kaizhu Huang

arXiv:2309.01686·cs.CL·September 6, 2023·2 cites

MathAttack: Attacking Large Language Models Towards Math Solving Ability

Zihao Zhou, Qiufeng Wang, Mingyu Jin, Jie Yao, Jianan Ye, and Wei Liu, Wei Wang, Xiaowei Huang, Kaizhu Huang

PDF

Open Access

TL;DR

This paper introduces MathAttack, a method for generating adversarial math word problems that preserve logical structure to evaluate and improve the robustness of large language models in math solving tasks.

Contribution

It proposes a novel attack method that maintains mathematical logic, introduces the RobustMath dataset for robustness evaluation, and demonstrates how adversarial samples can enhance LLM robustness.

Findings

01

Adversarial samples effectively attack LLMs' math solving ability.

02

Higher-accuracy LLMs are vulnerable to attacks from lower-accuracy models.

03

Using adversarial samples in training improves LLM robustness.

Abstract

With the boom of Large Language Models (LLMs), the research of solving Math Word Problem (MWP) has recently made great progress. However, there are few studies to examine the security of LLMs in math solving ability. Instead of attacking prompts in the use of LLMs, we propose a MathAttack model to attack MWP samples which are closer to the essence of security in solving math problems. Compared to traditional text adversarial attack, it is essential to preserve the mathematical logic of original MWPs during the attacking. To this end, we propose logical entity recognition to identify logical entries which are then frozen. Subsequently, the remaining text are attacked by adopting a word-level attacker. Furthermore, we propose a new dataset RobustMath to evaluate the robustness of LLMs in math solving ability. Extensive experiments on our RobustMath and two another math benchmark datasets…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling