Recommendation or Discrimination?: Quantifying Distribution Parity in Information Retrieval Systems
Rinat Khaziev, Bryce Casavant, Pearce Washabaugh, Amy A. Winecoff, and, Matthew Graham

TL;DR
This paper introduces a statistical test to measure distribution parity in information retrieval systems, aiming to detect and quantify bias related to protected variables like race or gender in recommendations.
Contribution
It proposes a novel statistical test for fairness in IR results and demonstrates its effectiveness through simulated and real-world fashion search system evaluations.
Findings
Bias detection correlates with catalog size
Test reveals skin tone bias in fashion recommendations
Method ensures fairer IR system outputs
Abstract
Information retrieval (IR) systems often leverage query data to suggest relevant items to users. This introduces the possibility of unfairness if the query (i.e., input) and the resulting recommendations unintentionally correlate with latent factors that are protected variables (e.g., race, gender, and age). For instance, a visual search system for fashion recommendations may pick up on features of the human models rather than fashion garments when generating recommendations. In this work, we introduce a statistical test for "distribution parity" in the top-K IR results, which assesses whether a given set of recommendations is fair with respect to a specific protected variable. We evaluate our test using both simulated and empirical results. First, using artificially biased recommendations, we demonstrate the trade-off between statistically detectable bias and the size of the search…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis · Visual Attention and Saliency Detection · Generative Adversarial Networks and Image Synthesis
