The Annotation Scarcity Paradox in Low-Resource NLP Evaluation: A Decade of Acceleration and Emerging Constraints

Vukosi Marivate

arXiv:2605.19066·cs.CL·May 20, 2026

The Annotation Scarcity Paradox in Low-Resource NLP Evaluation: A Decade of Acceleration and Emerging Constraints

Vukosi Marivate

PDF

TL;DR

The paper critically examines the evaluation challenges in low-resource NLP, highlighting the Annotation Scarcity Paradox where technical progress outpaces human evaluation capacity, risking validity and equity.

Contribution

It introduces the concept of the Annotation Scarcity Paradox, analyzing its impact on NLP evaluation and proposing community-embedded, relational evaluation paradigms as solutions.

Findings

01

Evaluation bottleneck threatens progress validity

02

Emerging methods include data augmentation and participatory curation

03

Community-driven evaluation is essential for equitable NLP

Abstract

Over the past decade, low-resource natural language processing (NLP) has experienced explosive growth, propelled by cross-lingual transfer, massively multilingual models, and the rapid proliferation of benchmarks. Yet this apparent progress masks a critical, insufficiently examined tension: the deep sociolinguistic expertise required to evaluate increasingly complex generative systems is severely strained, inequitably distributed, and structurally marginalised. We present a critical narrative survey of low-resource NLP evaluation (2014--present), tracing its evolution across three phases: early heuristic optimism, the illusions of top-down benchmark scaling, and the current era of generative bottlenecks. We conceptualise the \emph{Annotation Scarcity Paradox}, the structural friction arising when the technical capacity to scale models vastly outpaces the sovereign human infrastructure…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.