The quality of the Web of Science data: a longitudinal study on the completeness of authors-addresses links
Abdelghani Maddi, Lesya Baudoin (METRICS)

TL;DR
This study evaluates the completeness and quality of author-affiliation links in the Web of Science database from 2000 to 2021, revealing improvements over time and variability across indexes, document types, and author counts.
Contribution
It provides a comprehensive longitudinal analysis of author-affiliation link quality in WoS, highlighting periods of improvement and factors affecting link completeness.
Findings
Author-affiliation links became well-informed after 2008.
From 2016, nearly 100% of publications have complete author-affiliation links.
Higher variability in link completeness across indexes, document types, and number of authors.
Abstract
The author-affiliation links are the essential elements used for multiple purposes, such as the disambiguation of authors, the attribution of credits of a publication and fractional counting, the analysis of scientific networks, etc. In this article we analyzed the author-affiliation link quality in the Web of Science (WoS) database between 2000 and 2021. We analyzed the link completeness for 32,676,914 scientific publications under different angles: WoS index, document type and the number of authors per publication. The analysis showed that the author-affiliation link begins to be well informed from 2008. The share of publications for which all addresses and all authors are linked is close to 100% from 2016. The results show a strong variability according to the WoS index, the document type and the number of authors per publication. AHCI is the index with the highest completeness rate,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
