Keyphrase Generation Beyond the Boundaries of Title and Abstract
Krishna Garg, Jishnu Ray Chowdhury, Cornelia Caragea

TL;DR
This paper investigates how incorporating full text and summaries of scholarly articles enhances neural keyphrase generation, demonstrating significant improvements and introducing a new comprehensive dataset for the task.
Contribution
It is the first to systematically explore full text and summaries for keyphrase generation and introduces the FullTextKP dataset with full article texts.
Findings
Adding full text summaries improves keyphrase generation accuracy.
Transformer models like LED outperform traditional models on full-text data.
The new dataset enables more comprehensive research in scholarly keyphrase extraction.
Abstract
Keyphrase generation aims at generating important phrases (keyphrases) that best describe a given document. In scholarly domains, current approaches have largely used only the title and abstract of the articles to generate keyphrases. In this paper, we comprehensively explore whether the integration of additional information from the full text of a given article or from semantically similar articles can be helpful for a neural keyphrase generation model or not. We discover that adding sentences from the full text, particularly in the form of the extractive summary of the article can significantly improve the generation of both types of keyphrases that are either present or absent from the text. Experimental results with three widely used models for keyphrase generation along with one of the latest transformer models suitable for longer documents, Longformer Encoder-Decoder (LED)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Text Analysis Techniques
MethodsHow do I make a claim with Expedia?*Make FastClaimService · Multi-Head Attention · Attention Is All You Need · Linear Layer · Softmax · AdamW · How do I get a human at Expedia immediately? (2025-2026) · Attention Dropout · Linear Warmup With Linear Decay · Residual Connection
