Revisiting gender bias research in bibliometrics: Standardizing methodological variability using Scholarly Data Analysis (SoDA) Cards
HaeJin Lee, Shubhanshu Mishra, Apratim Mishra, Zhiwen You, Jinseok, Kim, and Jana Diesner

TL;DR
This paper highlights the inconsistency in methodologies used in gender bias research in bibliometrics and introduces SoDA Cards as a standardized framework to improve transparency, reproducibility, and comparability of such studies.
Contribution
It proposes the development of SoDA Cards to standardize reporting of methodological choices in scholarly data analysis, addressing current variability and enhancing research reliability.
Findings
Review of 70 publications reveals methodological diversity.
Identification of challenges in author disambiguation and gender classification.
Introduction of SoDA Cards to promote transparency and reproducibility.
Abstract
Gender biases in scholarly metrics remain a persistent concern, despite numerous bibliometric studies exploring their presence and absence across productivity, impact, acknowledgment, and self-citations. However, methodological inconsistencies, particularly in author name disambiguation and gender identification, limit the reliability and comparability of these studies, potentially perpetuating misperceptions and hindering effective interventions. A review of 70 relevant publications over the past 12 years reveals a wide range of approaches, from name-based and manual searches to more algorithmic and gold-standard methods, with no clear consensus on best practices. This variability, compounded by challenges such as accurately disambiguating Asian names and managing unassigned gender labels, underscores the urgent need for standardized and robust methodologies. To address this critical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGender Diversity and Inequality
