When a Paper Has 1000 Authors: Rethinking Citation Metrics in the Era of LLMs
Weihang Guo, Zhao Song, Jiahao Zhang

TL;DR
This paper introduces the SBCI index, a new citation metric designed to better assess individual researcher impact in large-scale collaborations involving thousands of authors, addressing limitations of traditional metrics.
Contribution
The paper proposes the SBCI index, analyzes its theoretical properties, and evaluates its effectiveness on synthetic datasets, improving impact assessment in large-scale research collaborations.
Findings
SBCI offers a more discriminative impact measure.
Traditional metrics like h-index are inadequate for large collaborations.
SBCI demonstrates robustness in synthetic data evaluations.
Abstract
Author-level citation metrics provide a practical, interpretable, and scalable signal of scholarly influence in a complex research ecosystem. It has been widely used as a proxy in hiring decisions. However, the past five years have seen the rapid emergence of large-scale publications in the field of large language models and foundation models, with papers featuring hundreds to thousands of co-authors and receiving tens of thousands of citations within months. For example, Gemini has 1361 authors and has been cited around 4600 times in 19 months. In such cases, traditional metrics, such as total citation count and the -index, fail to meaningfully distinguish individual contributions. Therefore, we propose the following research question: How can one identify standout researchers among thousands of co-authors in large-scale LLM papers? This question is particularly important in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsResearch Data Management Practices · Library Science and Information Systems
