Adversarial Attacks on Tables with Entity Swap

Aneta Koleva; Martin Ringsquandl; Volker Tresp

arXiv:2309.08650·cs.CL·September 19, 2023

Adversarial Attacks on Tables with Entity Swap

Aneta Koleva, Martin Ringsquandl, Volker Tresp

PDF

Open Access

TL;DR

This paper introduces the first black-box adversarial attack on tabular language models, specifically targeting the column type annotation task, revealing significant vulnerabilities with up to 70% performance drop.

Contribution

It proposes an entity-swap attack method for tables, highlighting a novel security concern for tabular language models and exposing dataset leakage issues.

Findings

01

Attack causes up to 70% performance degradation

02

First black-box attack on tabular language models

03

Reveals dataset leakage in evaluation datasets

Abstract

The capabilities of large language models (LLMs) have been successfully applied in the context of table representation learning. The recently proposed tabular language models have reported state-of-the-art results across various tasks for table interpretation. However, a closer look into the datasets commonly used for evaluation reveals an entity leakage from the train set into the test set. Motivated by this observation, we explore adversarial attacks that represent a more realistic inference setup. Adversarial attacks on text have been shown to greatly affect the performance of LLMs, but currently, there are no attacks targeting tabular language models. In this paper, we propose an evasive entity-swap attack for the column type annotation (CTA) task. Our CTA attack is the first black-box attack on tables, where we employ a similarity-based sampling strategy to generate adversarial…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Adversarial Robustness in Machine Learning · Natural Language Processing Techniques