Exploiting Missing Data Remediation Strategies using Adversarial Missingness Attacks
Deniz Koyuncu, Alex Gittens, B\"ulent Yener, Moti Yung

TL;DR
This paper introduces a novel bi-level optimization framework for adversarial missingness attacks that manipulate data imputation methods to bias model outcomes, demonstrated on real-world datasets.
Contribution
It extends adversarial missingness attacks to scenarios with various missing data handling techniques using a bi-level optimization approach.
Findings
AM attacks succeed with less than 20% missing data
AM can reverse and inflate estimated treatment effects
Attacks are effective even with limited data modifications
Abstract
Adversarial Missingness (AM) attacks aim to manipulate model fitting by carefully engineering a missing data problem to achieve a specific malicious objective. AM attacks are significantly different from prior data poisoning attacks in that no malicious data inserted and no data is maliciously perturbed. Current AM attacks are feasible only under the assumption that the modeler (victim) uses full-information maximum likelihood methods to handle missingness. This work aims to remedy this limitation of AM attacks; in the approach taken here, the adversary achieves their goal by solving a bi-level optimization problem to engineer the adversarial missingness mechanism, where the lower level problem incorporates a differentiable approximation of the targeted missingness remediation technique. As instantiations of this framework, AM attacks are provided for three popular techniques: (i)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Machine Learning and Algorithms
MethodsAttention Model · Focus
