Toward Automatic Group Membership Annotation for Group Fairness   Evaluation

Fumian Chen; Dayu Yang; Hui Fang

arXiv:2407.08926·cs.IR·July 15, 2024

Toward Automatic Group Membership Annotation for Group Fairness Evaluation

Fumian Chen, Dayu Yang, Hui Fang

PDF

Open Access

TL;DR

This paper presents a BERT-based method for automatically annotating group membership in datasets to facilitate fair-ranking evaluations, reducing human effort and maintaining evaluation robustness.

Contribution

It introduces a scalable, low-cost automatic annotation approach using language models, outperforming larger models like GPT in accuracy for fairness evaluation datasets.

Findings

01

BERT-based models outperform GPT and Mistral in annotation accuracy.

02

Minimal annotation errors do not significantly affect fairness evaluation outcomes.

03

The method reduces human annotation efforts and broadens dataset applicability.

Abstract

With the increasing research attention on fairness in information retrieval systems, more and more fairness-aware algorithms have been proposed to ensure fairness for a sustainable and healthy retrieval ecosystem. However, as the most adopted measurement of fairness-aware algorithms, group fairness evaluation metrics, require group membership information that needs massive human annotations and is barely available for general information retrieval datasets. This data sparsity significantly impedes the development of fairness-aware information retrieval studies. Hence, a practical, scalable, low-cost group membership annotation method is needed to assist or replace human annotations. This study explored how to leverage language models to automatically annotate group membership for group fairness evaluations, focusing on annotation accuracy and its impact. Our experimental results show…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman-Automation Interaction and Safety

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Byte Pair Encoding · Cosine Annealing · Layer Normalization · Linear Layer · Attention Dropout · Adam · Dropout · Weight Decay