Predicting Research Trends From Arxiv
Steffen Eger, Chao Li, Florian Netzer, Iryna Gurevych

TL;DR
This paper analyzes Arxiv papers in machine learning and NLP to identify current research trends, revealing dominant paradigms and predicting future focus areas in these fields.
Contribution
It introduces a bottom-up trend detection method based on citation ranking and topic grouping, providing insights into evolving research directions.
Findings
Natural language generation dominates cs.CL.
Reinforcement learning and adversarial methods dominate cs.LG.
Predicted continuation of current research trends.
Abstract
We perform trend detection on two datasets of Arxiv papers, derived from its machine learning (cs.LG) and natural language processing (cs.CL) categories. Our approach is bottom-up: we first rank papers by their normalized citation counts, then group top-ranked papers into different categories based on the tasks that they pursue and the methods they use. We then analyze these resulting topics. We find that the dominating paradigm in cs.CL revolves around natural language generation problems and those in cs.LG revolve around reinforcement learning and adversarial principles. By extrapolation, we predict that these topics will remain lead problems/approaches in their fields in the short- and mid-term.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Text Analysis Techniques · Data Visualization and Analytics · Biomedical Text Mining and Ontologies
