We Should Evaluate Real-World Impact

Ehud Reiter

arXiv:2507.05973·cs.CL·July 9, 2025

We Should Evaluate Real-World Impact

Ehud Reiter

PDF

Open Access

TL;DR

The paper highlights the scarcity of real-world impact evaluations in NLP research and argues for increased focus on assessing practical effects to enhance usefulness and adoption.

Contribution

It provides a structured survey showing the minimal presence of impact evaluations in NLP papers and advocates for more comprehensive real-world impact assessments.

Findings

01

Only about 0.1% of papers include impact evaluations

02

Most impact evaluations are superficial and metric-focused

03

Emphasizes the need for thorough real-world impact assessments

Abstract

The ACL community has very little interest in evaluating the real-world impact of NLP systems. A structured survey of the ACL Anthology shows that perhaps 0.1% of its papers contain such evaluations; furthermore most papers which include impact evaluations present them very sketchily and instead focus on metric evaluations. NLP technology would be more useful and more quickly adopted if we seriously tried to understand and evaluate its real-world impact.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification