Automatic Recognition and Classification of Future Work Sentences from Academic Articles in a Specific Domain
Chengzhi Zhang, Yi Xiang, Wenke Hao, Zhicheng Li, Yuchen Qian, Yuzhuo, Wang

TL;DR
This paper develops machine learning methods to automatically identify and classify future work sentences in academic papers, enhancing data mining and analysis of research directions.
Contribution
It introduces a new annotated corpus and compares models for recognizing and classifying future work sentences, achieving high accuracy especially with Bernoulli Bayesian and SCIBERT models.
Findings
Bernoulli Bayesian model achieves 90.73% Macro F1 in recognition
SCIBERT model achieves 72.63% weighted F1 in classification
Content analysis of FWS reveals key research themes
Abstract
Future work sentences (FWS) are the particular sentences in academic papers that contain the author's description of their proposed follow-up research direction. This paper presents methods to automatically extract FWS from academic papers and classify them according to the different future directions embodied in the paper's content. FWS recognition methods will enable subsequent researchers to locate future work sentences more accurately and quickly and reduce the time and cost of acquiring the corpus. The current work on automatic identification of future work sentences is relatively small, and the existing research cannot accurately identify FWS from academic papers, and thus cannot conduct data mining on a large scale. Furthermore, there are many aspects to the content of future work, and the subdivision of the content is conducive to the analysis of specific development directions.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Text Analysis Techniques · Topic Modeling
