Computer Science Named Entity Recognition in the Open Research Knowledge   Graph

Jennifer D'Souza; S\"oren Auer

arXiv:2203.14579·cs.CL·November 15, 2022·1 cites

Computer Science Named Entity Recognition in the Open Research Knowledge Graph

Jennifer D'Souza, S\"oren Auer

PDF

Open Access 1 Repo 1 Datasets

TL;DR

This paper introduces a standardized set of scholarly entities for Computer Science NER, compiles a large annotated dataset from research articles, and trains a neural model to improve scientific information extraction.

Contribution

It defines contribution-centric entities for CS NER, creates a large annotated dataset, and develops a neural sequence labeling model tailored for scholarly articles.

Findings

01

Created a large annotated corpus of CS scholarly entities

02

Proposed a standardized set of contribution-centric entities

03

Trained a neural NER model achieving promising results

Abstract

Domain-specific named entity recognition (NER) on Computer Science (CS) scholarly articles is an information extraction task that is arguably more challenging for the various annotation aims that can beset the task and has been less studied than NER in the general domain. Given that significant progress has been made on NER, we believe that scholarly domain-specific NER will receive increasing attention in the years to come. Currently, progress on CS NER -- the focus of this work -- is hampered in part by its recency and the lack of a standardized annotation aim for scientific entities/terms. This work proposes a standardized task by defining a set of seven contribution-centric scholarly entities for CS NER viz., research problem, solution, resource, language, tool, method, and dataset. Following which, its main contributions are: combines existing CS NER resources that maintain their…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jd-coderepos/contributions-ner-cs
noneOfficial

Datasets

jxhzxn/contributions-ner-cs
dataset· 13 dl
13 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Data Quality and Management · Natural Language Processing Techniques