The Crossover Process: Learnability and Data Protection from Inference Attacks
Richard Nock, Giorgio Patrini, Finnian Lattimore, Tiberio Caetano

TL;DR
The paper introduces the Crossover Process (cp), a noise-free data mixing method that can control inference attacks and influence learnability, affecting statistical measures, fairness, and causal inference without compromising generalization.
Contribution
It presents a novel data mixing technique, the cp, which jointly manages inference and learnability, and introduces the Rademacher cp complexity to quantify its effects.
Findings
cp can alter joint distributions without changing marginals
cp can manipulate fairness, causal directions, and dependence measures
Experiments validate theoretical predictions across multiple domains
Abstract
It is usual to consider data protection and learnability as conflicting objectives. This is not always the case: we show how to jointly control inference --- seen as the attack --- and learnability by a noise-free process that mixes training examples, the Crossover Process (cp). One key point is that the cp~is typically able to alter joint distributions without touching on marginals, nor altering the sufficient statistic for the class. In other words, it saves (and sometimes improves) generalization for supervised learning, but can alter the relationship between covariates --- and therefore fool measures of nonlinear independence and causal inference into misleading ad-hoc conclusions. For example, a cp~can increase / decrease odds ratios, bring fairness or break fairness, tamper with disparate impact, strengthen, weaken or reverse causal directions, change observed statistical measures…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Bayesian Modeling and Causal Inference · Explainable Artificial Intelligence (XAI)
MethodsCausal inference
