Using the Full-text Content of Academic Articles to Identify and   Evaluate Algorithm Entities in the Domain of Natural Language Processing

Yuzhuo Wang; Chengzhi Zhang

arXiv:2010.10817·cs.CL·October 22, 2020

Using the Full-text Content of Academic Articles to Identify and Evaluate Algorithm Entities in the Domain of Natural Language Processing

Yuzhuo Wang, Chengzhi Zhang

PDF

TL;DR

This paper presents a domain-independent method to identify and evaluate influential algorithms in NLP academic papers by constructing an algorithm dictionary and analyzing their mentions, revealing trends and influential algorithms in the field.

Contribution

It introduces a novel approach for extracting and analyzing algorithms from full-text papers, applicable across domains, and highlights influential algorithms in NLP research.

Findings

01

Classification algorithms are most influential in NLP papers.

02

The influence of algorithms reflects research trends and topic shifts.

03

The methodology can be used for large-scale automatic algorithm extraction.

Abstract

In the era of big data, the advancement, improvement, and application of algorithms in academic research have played an important role in promoting the development of different disciplines. Academic papers in various disciplines, especially computer science, contain a large number of algorithms. Identifying the algorithms from the full-text content of papers can determine popular or classical algorithms in a specific field and help scholars gain a comprehensive understanding of the algorithms and even the field. To this end, this article takes the field of natural language processing (NLP) as an example and identifies algorithms from academic papers in the field. A dictionary of algorithms is constructed by manually annotating the contents of papers, and sentences containing algorithms in the dictionary are extracted through dictionary-based matching. The number of articles mentioning…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.