The Annotation Scarcity Paradox in Low-Resource NLP Evaluation: A Decade of Acceleration and Emerging Constraints
Vukosi Marivate

TL;DR
The paper critically examines the evaluation challenges in low-resource NLP, highlighting the Annotation Scarcity Paradox where technical progress outpaces human evaluation capacity, risking validity and equity.
Contribution
It introduces the concept of the Annotation Scarcity Paradox, analyzing its impact on NLP evaluation and proposing community-embedded, relational evaluation paradigms as solutions.
Findings
Evaluation bottleneck threatens progress validity
Emerging methods include data augmentation and participatory curation
Community-driven evaluation is essential for equitable NLP
Abstract
Over the past decade, low-resource natural language processing (NLP) has experienced explosive growth, propelled by cross-lingual transfer, massively multilingual models, and the rapid proliferation of benchmarks. Yet this apparent progress masks a critical, insufficiently examined tension: the deep sociolinguistic expertise required to evaluate increasingly complex generative systems is severely strained, inequitably distributed, and structurally marginalised. We present a critical narrative survey of low-resource NLP evaluation (2014--present), tracing its evolution across three phases: early heuristic optimism, the illusions of top-down benchmark scaling, and the current era of generative bottlenecks. We conceptualise the \emph{Annotation Scarcity Paradox}, the structural friction arising when the technical capacity to scale models vastly outpaces the sovereign human infrastructure…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
