TL;DR
This paper presents the SemEval-2021 NLPContributionGraph task, which challenges systems to structure scholarly NLP contributions into a research knowledge graph, highlighting the complexity of automating semantic content extraction from scientific texts.
Contribution
It introduces a novel shared task with multi-level annotations for scholarly articles, providing a benchmark for automatic extraction and structuring of research contributions into knowledge graphs.
Findings
Best system achieved 57.27% F1 on contribution sentence classification
F1 score of 46.41% for phrase extraction
22.28% F1 for triple extraction, indicating high difficulty
Abstract
There is currently a gap between the natural language expression of scholarly publications and their structured semantic content modeling to enable intelligent content search. With the volume of research growing exponentially every year, a search feature operating over semantically structured content is compelling. The SemEval-2021 Shared Task NLPContributionGraph (a.k.a. 'the NCG task') tasks participants to develop automated systems that structure contributions from NLP scholarly articles in the English language. Being the first-of-its-kind in the SemEval series, the task released structured data from NLP scholarly articles at three levels of information granularity, i.e. at sentence-level, phrase-level, and phrases organized as triples toward Knowledge Graph (KG) building. The sentence-level annotations comprised the few sentences about the article's contribution. The phrase-level…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
