NatLogAttack: A Framework for Attacking Natural Language Inference Models with Natural Logic
Zi'ou Zheng, Xiaodan Zhu

TL;DR
NatLogAttack introduces a logic-based adversarial attack framework for natural language inference models, revealing their vulnerabilities and assessing their reasoning capabilities through systematic, label-preserving and label-flipping attacks.
Contribution
The paper presents NatLogAttack, a novel attack method based on natural logic, improving adversarial example quality and efficiency compared to existing approaches.
Findings
Models are more vulnerable under label-flipping attacks.
NatLogAttack generates more effective adversarial examples with fewer model visits.
The framework offers a new perspective for evaluating reasoning in NLI models.
Abstract
Reasoning has been a central topic in artificial intelligence from the beginning. The recent progress made on distributed representation and neural networks continues to improve the state-of-the-art performance of natural language inference. However, it remains an open question whether the models perform real reasoning to reach their conclusions or rely on spurious correlations. Adversarial attacks have proven to be an important tool to help evaluate the Achilles' heel of the victim models. In this study, we explore the fundamental problem of developing attack models based on logic formalism. We propose NatLogAttack to perform systematic attacks centring around natural logic, a classical logic formalism that is traceable back to Aristotle's syllogism and has been closely developed for natural language inference. The proposed framework renders both label-preserving and label-flipping…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Topic Modeling · Explainable Artificial Intelligence (XAI)
