Pre-trained Language Models for Keyphrase Generation: A Thorough   Empirical Study

Di Wu; Wasi Uddin Ahmad; Kai-Wei Chang

arXiv:2212.10233·cs.CL·February 26, 2024·6 cites

Pre-trained Language Models for Keyphrase Generation: A Thorough Empirical Study

Di Wu, Wasi Uddin Ahmad, Kai-Wei Chang

PDF

Open Access 1 Repo 4 Models

TL;DR

This paper provides a comprehensive empirical comparison of pre-trained language models versus traditional neural models for keyphrase generation, analyzing design choices and domain-specific performance.

Contribution

It systematically evaluates how different PLM configurations and design choices impact keyphrase generation performance across domains.

Findings

01

PLMs perform well in both high-resource and low-resource settings.

02

In-domain PLMs enhance keyphrase generation accuracy.

03

Prioritizing depth over width in models improves performance.

Abstract

Neural models that do not rely on pre-training have excelled in the keyphrase generation task with large annotated datasets. Meanwhile, new approaches have incorporated pre-trained language models (PLMs) for their data efficiency. However, there lacks a systematic study of how the two types of approaches compare and how different design choices can affect the performance of PLM-based models. To fill in this knowledge gap and facilitate a more informed use of PLMs for keyphrase extraction and keyphrase generation, we present an in-depth empirical study. Formulating keyphrase extraction as sequence labeling and keyphrase generation as sequence-to-sequence generation, we perform extensive experiments in three domains. After showing that PLMs have competitive high-resource performance and state-of-the-art low-resource performance, we investigate important design choices including in-domain…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

uclanlp/deepkpg
pytorchOfficial

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Text Analysis Techniques