Representation Learning for Resource-Constrained Keyphrase Generation

Di Wu; Wasi Uddin Ahmad; Sunipa Dev; Kai-Wei Chang

arXiv:2203.08118·cs.CL·October 25, 2022

Representation Learning for Resource-Constrained Keyphrase Generation

Di Wu, Wasi Uddin Ahmad, Sunipa Dev, Kai-Wei Chang

PDF

1 Repo

TL;DR

This paper introduces a resource-efficient keyphrase generation method that leverages retrieval-based statistics and pre-trained language models, enabling effective low-resource and zero-shot domain adaptation.

Contribution

It proposes a novel data-oriented approach with salient span recovery and prediction objectives, improving keyphrase generation in low-resource settings.

Findings

01

Effective in low-resource keyphrase generation

02

Enhances zero-shot domain adaptation

03

Generates absent keyphrases close to large-data models

Abstract

State-of-the-art keyphrase generation methods generally depend on large annotated datasets, limiting their performance in domains with limited annotated data. To overcome this challenge, we design a data-oriented approach that first identifies salient information using retrieval-based corpus-level statistics, and then learns a task-specific intermediate representation based on a pre-trained language model using large-scale unlabeled documents. We introduce salient span recovery and salient span prediction as denoising training objectives that condense the intra-article and inter-article knowledge essential for keyphrase generation. Through experiments on multiple keyphrase generation benchmarks, we show the effectiveness of the proposed approach for facilitating low-resource keyphrase generation and zero-shot domain adaptation. Our method especially benefits the generation of absent…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

xiaowu0162/low-resource-kpgen
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.