NatLogAttack: A Framework for Attacking Natural Language Inference   Models with Natural Logic

Zi'ou Zheng; Xiaodan Zhu

arXiv:2307.02849·cs.CL·October 14, 2024

NatLogAttack: A Framework for Attacking Natural Language Inference Models with Natural Logic

Zi'ou Zheng, Xiaodan Zhu

PDF

Open Access

TL;DR

NatLogAttack introduces a logic-based adversarial attack framework for natural language inference models, revealing their vulnerabilities and assessing their reasoning capabilities through systematic, label-preserving and label-flipping attacks.

Contribution

The paper presents NatLogAttack, a novel attack method based on natural logic, improving adversarial example quality and efficiency compared to existing approaches.

Findings

01

Models are more vulnerable under label-flipping attacks.

02

NatLogAttack generates more effective adversarial examples with fewer model visits.

03

The framework offers a new perspective for evaluating reasoning in NLI models.

Abstract

Reasoning has been a central topic in artificial intelligence from the beginning. The recent progress made on distributed representation and neural networks continues to improve the state-of-the-art performance of natural language inference. However, it remains an open question whether the models perform real reasoning to reach their conclusions or rely on spurious correlations. Adversarial attacks have proven to be an important tool to help evaluate the Achilles' heel of the victim models. In this study, we explore the fundamental problem of developing attack models based on logic formalism. We propose NatLogAttack to perform systematic attacks centring around natural logic, a classical logic formalism that is traceable back to Aristotle's syllogism and has been closely developed for natural language inference. The proposed framework renders both label-preserving and label-flipping…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Topic Modeling · Explainable Artificial Intelligence (XAI)