Exploiting Missing Data Remediation Strategies using Adversarial Missingness Attacks

Deniz Koyuncu; Alex Gittens; B\"ulent Yener; Moti Yung

arXiv:2409.04407·cs.LG·November 18, 2025

Exploiting Missing Data Remediation Strategies using Adversarial Missingness Attacks

Deniz Koyuncu, Alex Gittens, B\"ulent Yener, Moti Yung

PDF

Open Access

TL;DR

This paper introduces a novel bi-level optimization framework for adversarial missingness attacks that manipulate data imputation methods to bias model outcomes, demonstrated on real-world datasets.

Contribution

It extends adversarial missingness attacks to scenarios with various missing data handling techniques using a bi-level optimization approach.

Findings

01

AM attacks succeed with less than 20% missing data

02

AM can reverse and inflate estimated treatment effects

03

Attacks are effective even with limited data modifications

Abstract

Adversarial Missingness (AM) attacks aim to manipulate model fitting by carefully engineering a missing data problem to achieve a specific malicious objective. AM attacks are significantly different from prior data poisoning attacks in that no malicious data inserted and no data is maliciously perturbed. Current AM attacks are feasible only under the assumption that the modeler (victim) uses full-information maximum likelihood methods to handle missingness. This work aims to remedy this limitation of AM attacks; in the approach taken here, the adversary achieves their goal by solving a bi-level optimization problem to engineer the adversarial missingness mechanism, where the lower level problem incorporates a differentiable approximation of the targeted missingness remediation technique. As instantiations of this framework, AM attacks are provided for three popular techniques: (i)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Machine Learning and Algorithms

MethodsAttention Model · Focus