Pre-trained Language Models for Keyphrase Generation: A Thorough Empirical Study
Di Wu, Wasi Uddin Ahmad, Kai-Wei Chang

TL;DR
This paper provides a comprehensive empirical comparison of pre-trained language models versus traditional neural models for keyphrase generation, analyzing design choices and domain-specific performance.
Contribution
It systematically evaluates how different PLM configurations and design choices impact keyphrase generation performance across domains.
Findings
PLMs perform well in both high-resource and low-resource settings.
In-domain PLMs enhance keyphrase generation accuracy.
Prioritizing depth over width in models improves performance.
Abstract
Neural models that do not rely on pre-training have excelled in the keyphrase generation task with large annotated datasets. Meanwhile, new approaches have incorporated pre-trained language models (PLMs) for their data efficiency. However, there lacks a systematic study of how the two types of approaches compare and how different design choices can affect the performance of PLM-based models. To fill in this knowledge gap and facilitate a more informed use of PLMs for keyphrase extraction and keyphrase generation, we present an in-depth empirical study. Formulating keyphrase extraction as sequence labeling and keyphrase generation as sequence-to-sequence generation, we perform extensive experiments in three domains. After showing that PLMs have competitive high-resource performance and state-of-the-art low-resource performance, we investigate important design choices including in-domain…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Text Analysis Techniques
