Fair Generalized Linear Mixed Models

Jan Pablo Burgard; Jo\~ao Vitor Pamplona

arXiv:2405.09273·cs.LG·December 10, 2024

Fair Generalized Linear Mixed Models

Jan Pablo Burgard, Jo\~ao Vitor Pamplona

PDF

Open Access 1 Repo

TL;DR

This paper introduces a new algorithm for fair machine learning that accounts for stratified sampling in social survey data, improving prediction fairness and accuracy.

Contribution

It presents a novel method for fair generalized linear mixed models that handles stratified sampling and bias in social survey data.

Findings

01

The algorithm effectively accounts for strata correlations.

02

Stratified sampling impacts fairness and accuracy of predictions.

03

Simulation results demonstrate improved fairness in predictions.

Abstract

When using machine learning for automated prediction, it is important to account for fairness in the prediction. Fairness in machine learning aims to ensure that biases in the data and model inaccuracies do not lead to discriminatory decisions. E.g., predictions from fair machine learning models should not discriminate against sensitive variables such as sexual orientation and ethnicity. The training data often in obtained from social surveys. In social surveys, oftentimes the data collection process is a strata sampling, e.g. due to cost restrictions. In strata samples, the assumption of independence between the observation is not fulfilled. Hence, if the machine learning models do not account for the strata correlations, the results may be biased. Especially high is the bias in cases where the strata assignment is correlated to the variable of interest. We present in this paper an…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

JoaoVitorPamplona/Fair-Generalized-Linear-Mixed-Models
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Bayesian Inference · Statistical Methods and Inference