CatBack: Universal Backdoor Attacks on Tabular Data via Categorical Encoding

Behrad Tajalli; Stefanos Koffas; Stjepan Picek

arXiv:2511.06072·cs.LG·November 11, 2025

CatBack: Universal Backdoor Attacks on Tabular Data via Categorical Encoding

Behrad Tajalli, Stefanos Koffas, Stjepan Picek

PDF

Open Access

TL;DR

This paper introduces a novel backdoor attack on tabular data that effectively manipulates models using a universal gradient-based perturbation, highlighting a significant security vulnerability in machine learning systems handling mixed data types.

Contribution

The paper presents a new technique to convert categorical data into floating-point representations for backdoor attacks, enabling a universal attack applicable to various features and models.

Findings

01

Achieves up to 100% attack success rate in multiple settings

02

Outperforms previous methods like Tabdoor in effectiveness

03

Successfully bypasses several state-of-the-art defenses

Abstract

Backdoor attacks in machine learning have drawn significant attention for their potential to compromise models stealthily, yet most research has focused on homogeneous data such as images. In this work, we propose a novel backdoor attack on tabular data, which is particularly challenging due to the presence of both numerical and categorical features. Our key idea is a novel technique to convert categorical values into floating-point representations. This approach preserves enough information to maintain clean-model accuracy compared to traditional methods like one-hot or ordinal encoding. By doing this, we create a gradient-based universal perturbation that applies to all features, including categorical ones. We evaluate our method on five datasets and four popular models. Our results show up to a 100% attack success rate in both white-box and black-box settings (including real-world…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Security and Verification in Computing · Explainable Artificial Intelligence (XAI)