Predicting Research Trends From Arxiv

Steffen Eger; Chao Li; Florian Netzer; Iryna Gurevych

arXiv:1903.02831·cs.CL·March 8, 2019·1 cites

Predicting Research Trends From Arxiv

Steffen Eger, Chao Li, Florian Netzer, Iryna Gurevych

PDF

Open Access 1 Repo

TL;DR

This paper analyzes Arxiv papers in machine learning and NLP to identify current research trends, revealing dominant paradigms and predicting future focus areas in these fields.

Contribution

It introduces a bottom-up trend detection method based on citation ranking and topic grouping, providing insights into evolving research directions.

Findings

01

Natural language generation dominates cs.CL.

02

Reinforcement learning and adversarial methods dominate cs.LG.

03

Predicted continuation of current research trends.

Abstract

We perform trend detection on two datasets of Arxiv papers, derived from its machine learning (cs.LG) and natural language processing (cs.CL) categories. Our approach is bottom-up: we first rank papers by their normalized citation counts, then group top-ranked papers into different categories based on the tasks that they pursue and the methods they use. We then analyze these resulting topics. We find that the dominating paradigm in cs.CL revolves around natural language generation problems and those in cs.LG revolve around reinforcement learning and adversarial principles. By extrapolation, we predict that these topics will remain lead problems/approaches in their fields in the short- and mid-term.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

UKPLab/refresh2018-predicting-trends-from-arxiv
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Text Analysis Techniques · Data Visualization and Analytics · Biomedical Text Mining and Ontologies