What is SemEval evaluating? A Systematic Analysis of Evaluation   Campaigns in NLP

Oskar Wysocki; Malina Florea; Andre Freitas

arXiv:2005.14299·cs.CL·August 19, 2022

What is SemEval evaluating? A Systematic Analysis of Evaluation Campaigns in NLP

Oskar Wysocki, Malina Florea, Andre Freitas

PDF

Open Access

TL;DR

This paper systematically analyzes SemEval, the main NLP evaluation campaign, to understand the types of tasks, metrics, architectures, and participation patterns that define its focus and evolution over time.

Contribution

It provides the first comprehensive quantitative overview of SemEval's task types, evaluation metrics, and participation trends, revealing what the NLP community emphasizes in evaluations.

Findings

01

SemEval predominantly features certain task types and metrics.

02

Participation and citation patterns have evolved over time.

03

The analysis highlights the core focus areas of SemEval in NLP evaluation.

Abstract

SemEval is the primary venue in the NLP community for the proposal of new challenges and for the systematic empirical evaluation of NLP systems. This paper provides a systematic quantitative analysis of SemEval aiming to evidence the patterns of the contributions behind SemEval. By understanding the distribution of task types, metrics, architectures, participation and citations over time we aim to answer the question on what is being evaluated by SemEval.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Semantic Web and Ontologies