Fairness Feedback Loops: Training on Synthetic Data Amplifies Bias

Sierra Wyllie; Ilia Shumailov; Nicolas Papernot

arXiv:2403.07857·cs.LG·March 13, 2024·2 cites

Fairness Feedback Loops: Training on Synthetic Data Amplifies Bias

Sierra Wyllie, Ilia Shumailov, Nicolas Papernot

PDF

Open Access 3 Reviews

TL;DR

This paper investigates how model-induced distribution shifts can reinforce biases and unfairness over generations, and proposes a framework for tracking, understanding, and intervening to mitigate these feedback loops in machine learning systems.

Contribution

It introduces a framework to track multiple model-induced distribution shifts over generations and proposes algorithmic reparation to mitigate bias and unfairness in data ecosystems.

Findings

01

Model feedback loops can reduce fairness and representation.

02

Interventions via curated training batches can improve fairness.

03

Tracking MIDS helps understand bias amplification over generations.

Abstract

Model-induced distribution shifts (MIDS) occur as previous model outputs pollute new model training sets over generations of models. This is known as model collapse in the case of generative models, and performative prediction or unfairness feedback loops for supervised models. When a model induces a distribution shift, it also encodes its mistakes, biases, and unfairnesses into the ground truth of its data ecosystem. We introduce a framework that allows us to track multiple MIDS over many generations, finding that they can lead to loss in performance, fairness, and minoritized group representation, even in initially unbiased datasets. Despite these negative consequences, we identify how models might be used for positive, intentional, interventions in their data ecosystems, providing redress for historical discrimination through a framework called algorithmic reparation (AR). We…

Peer Reviews

Decision·Submitted to ICLR 2024

Reviewer 01Rating 3· reject, not good enoughConfidence 3

Strengths

- The paper addresses the important issue of studying the fairness effects of repeated training in a data ecosystem. - The paper presents a clear model of sequentially training classifiers and generators that can be used to simultaneously model various effects like feedback loops, performative effects and model collapse.

Weaknesses

- The paper does not propose a mathematical model for the proposed MIDS scheme. Although various effects of retraining are shown through experiments, there is no theoretical investigation on the root causes of these effects, or the efficacy of the presented Algorithmic Reparation (AR) scheme. - The experimental results with the proposed AR scheme based on resampling are not very convincing. For example, the decrease in the equalized odds difference with reparation in Figure 4 is not monotonic. T

Reviewer 02Rating 3· reject, not good enoughConfidence 3

Strengths

* The feedback loop problem is important and interesting. * The authors propose a setup in which this problem can be studied and explore Algorithmic Reparation as a possible solution.

Weaknesses

Personally, I find the paper a bit hard to understand. * Introduction seems a bit verbose and overly lyrical in moments, making it harder to read and follow\ "recent demographic information of the Black population" - it is written that the maps are from 1939 and 1955. I am not sure that this is very recent. * I am not sure that MIDS require their own "taxonomy", given that there are only label and input drifts (Table 1). Impact of feedback loop in fairness has been acknolwedged as a problem fo

Reviewer 03Rating 3· reject, not good enoughConfidence 3

Strengths

1. The formulation and procedure to observe and evaluate MIDS is clear. The flowcharts in Section 3.1 and Section 3.2 are really helpful 2. The discussions on model collapse for generative models and the performative prediction in Section 4.2 are inspiring

Weaknesses

1. This paper attempts to encompass several issues such as model collapse, performative prediction, unfairness feedback loops, and algorithmic reparation. However, the benefit and motivation for the unifying MIDS is not clear to me. It is encouraged that the authors clearly state what addition challenge could be solved, or what existing challenges could be better solved by the MIDS framework. 2. There are existing algorithms that could solve unfairness feedback loops, class imbalance, etc. Howe

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsForecasting Techniques and Applications · Qualitative Comparative Analysis Research · Big Data and Business Intelligence