Outlier Detection Bias Busted: Understanding Sources of Algorithmic Bias through Data-centric Factors
Xueying Ding, Rui Xi, Leman Akoglu

TL;DR
This paper investigates the sources of unfairness in unsupervised outlier detection algorithms by systematically injecting various data biases and analyzing their impact, revealing that algorithmic bias is influenced by both data properties and design choices.
Contribution
It provides a comprehensive data-centric bias injection framework to analyze fairness issues in outlier detection, highlighting the interaction between data properties and algorithmic design.
Findings
OD algorithms are susceptible to various data biases.
Bias injection affects fairness in outlier detection.
Natural data properties can also cause unfairness.
Abstract
The astonishing successes of ML have raised growing concern for the fairness of modern methods when deployed in real world settings. However, studies on fairness have mostly focused on supervised ML, while unsupervised outlier detection (OD), with numerous applications in finance, security, etc., have attracted little attention. While a few studies proposed fairness-enhanced OD algorithms, they remain agnostic to the underlying driving mechanisms or sources of unfairness. Even within the supervised ML literature, there exists debate on whether unfairness stems solely from algorithmic biases (i.e. design choices) or from the biases encoded in the data on which they are trained. To close this gap, this work aims to shed light on the possible sources of unfairness in OD by auditing detection models under different data-centric factors. By injecting various known biases into the input data…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Market Dynamics and Volatility
MethodsBalanced Selection
