Toward Automatic Group Membership Annotation for Group Fairness Evaluation
Fumian Chen, Dayu Yang, Hui Fang

TL;DR
This paper presents a BERT-based method for automatically annotating group membership in datasets to facilitate fair-ranking evaluations, reducing human effort and maintaining evaluation robustness.
Contribution
It introduces a scalable, low-cost automatic annotation approach using language models, outperforming larger models like GPT in accuracy for fairness evaluation datasets.
Findings
BERT-based models outperform GPT and Mistral in annotation accuracy.
Minimal annotation errors do not significantly affect fairness evaluation outcomes.
The method reduces human annotation efforts and broadens dataset applicability.
Abstract
With the increasing research attention on fairness in information retrieval systems, more and more fairness-aware algorithms have been proposed to ensure fairness for a sustainable and healthy retrieval ecosystem. However, as the most adopted measurement of fairness-aware algorithms, group fairness evaluation metrics, require group membership information that needs massive human annotations and is barely available for general information retrieval datasets. This data sparsity significantly impedes the development of fairness-aware information retrieval studies. Hence, a practical, scalable, low-cost group membership annotation method is needed to assist or replace human annotations. This study explored how to leverage language models to automatically annotate group membership for group fairness evaluations, focusing on annotation accuracy and its impact. Our experimental results show…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman-Automation Interaction and Safety
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Byte Pair Encoding · Cosine Annealing · Layer Normalization · Linear Layer · Attention Dropout · Adam · Dropout · Weight Decay
