Normalization of zero-inflated data: An empirical analysis of a new indicator family and its use with altmetrics data
Lutz Bornmann, Robin Haunschild

TL;DR
This paper introduces a new indicator, MHq, for analyzing zero-inflated altmetrics data, demonstrating its effectiveness in distinguishing quality levels and comparing it to existing indicators and citation data.
Contribution
The study proposes the MHq indicator based on the Mantel-Haenszel analysis, showing its validity and superior ability to differentiate quality levels in zero-inflated altmetrics data.
Findings
MHq can distinguish between different quality levels in most cases.
MNPC and EMNPC are less effective in distinguishing quality levels.
Citations have a stronger correlation with peer assessments than altmetrics.
Abstract
Recently, two new indicators (Equalized Mean-based Normalized Proportion Cited, EMNPC; Mean-based Normalized Proportion Cited, MNPC) were proposed which are intended for sparse scientometrics data. The indicators compare the proportion of mentioned papers (e.g. on Facebook) of a unit (e.g., a researcher or institution) with the proportion of mentioned papers in the corresponding fields and publication years (the expected values). In this study, we propose a third indicator (Mantel-Haenszel quotient, MHq) belonging to the same indicator family. The MHq is based on the MH analysis - an established method in statistics for the comparison of proportions. We test (using citations and assessments by peers, i.e. F1000Prime recommendations) if the three indicators can distinguish between different quality levels as defined on the basis of the assessments by peers. Thus, we test their convergent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
