The Crossover Process: Learnability and Data Protection from Inference   Attacks

Richard Nock; Giorgio Patrini; Finnian Lattimore; Tiberio Caetano

arXiv:1606.04160·cs.LG·March 9, 2017

The Crossover Process: Learnability and Data Protection from Inference Attacks

Richard Nock, Giorgio Patrini, Finnian Lattimore, Tiberio Caetano

PDF

Open Access

TL;DR

The paper introduces the Crossover Process (cp), a noise-free data mixing method that can control inference attacks and influence learnability, affecting statistical measures, fairness, and causal inference without compromising generalization.

Contribution

It presents a novel data mixing technique, the cp, which jointly manages inference and learnability, and introduces the Rademacher cp complexity to quantify its effects.

Findings

01

cp can alter joint distributions without changing marginals

02

cp can manipulate fairness, causal directions, and dependence measures

03

Experiments validate theoretical predictions across multiple domains

Abstract

It is usual to consider data protection and learnability as conflicting objectives. This is not always the case: we show how to jointly control inference --- seen as the attack --- and learnability by a noise-free process that mixes training examples, the Crossover Process (cp). One key point is that the cp~is typically able to alter joint distributions without touching on marginals, nor altering the sufficient statistic for the class. In other words, it saves (and sometimes improves) generalization for supervised learning, but can alter the relationship between covariates --- and therefore fool measures of nonlinear independence and causal inference into misleading ad-hoc conclusions. For example, a cp~can increase / decrease odds ratios, bring fairness or break fairness, tamper with disparate impact, strengthen, weaken or reverse causal directions, change observed statistical measures…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Bayesian Modeling and Causal Inference · Explainable Artificial Intelligence (XAI)

MethodsCausal inference