Pre-Trained Language Models for Keyphrase Prediction: A Review
Muhammad Umair, Tangina Sultana, Young-Koo Lee

TL;DR
This review paper comprehensively analyzes the use of pre-trained language models for keyphrase prediction, covering extraction and generation tasks, and highlights future research directions in this domain.
Contribution
It provides a unified, in-depth analysis of PLM-based keyphrase prediction, addressing gaps in previous surveys by exploring both extraction and generation tasks with various training techniques.
Findings
Extensive taxonomy for PLM-KPE and KPG tasks.
Insights into supervised, unsupervised, semi-supervised, and self-supervised learning methods.
Identification of promising future research directions.
Abstract
Keyphrase Prediction (KP) is essential for identifying keyphrases in a document that can summarize its content. However, recent Natural Language Processing (NLP) advances have developed more efficient KP models using deep learning techniques. The limitation of a comprehensive exploration jointly both keyphrase extraction and generation using pre-trained language models spotlights a critical gap in the literature, compelling our survey paper to bridge this deficiency and offer a unified and in-depth analysis to address limitations in previous surveys. This paper extensively examines the topic of pre-trained language models for keyphrase prediction (PLM-KP), which are trained on large text corpora via different learning (supervisor, unsupervised, semi-supervised, and self-supervised) techniques, to provide respective insights into these two types of tasks in NLP, precisely, Keyphrase…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsKollen-Pollack Learning
