Key Algorithms for Keyphrase Generation: Instruction-Based LLMs for   Russian Scientific Keyphrases

Anna Glazkova; Dmitry Morozov; Timur Garipov

arXiv:2410.18040·cs.CL·April 16, 2025·2 cites

Key Algorithms for Keyphrase Generation: Instruction-Based LLMs for Russian Scientific Keyphrases

Anna Glazkova, Dmitry Morozov, Timur Garipov

PDF

Open Access

TL;DR

This paper evaluates the effectiveness of prompt-based large language models in generating Russian scientific keyphrases, comparing various methods and analyzing their strengths and weaknesses through human expert assessments.

Contribution

It introduces the application of prompt-based LLMs for Russian keyphrase generation and compares their performance with traditional methods.

Findings

01

Prompt-based methods outperform baselines in keyphrase generation.

02

Few-shot prompt strategies improve model performance.

03

Human evaluation confirms the quality of generated keyphrases.

Abstract

Keyphrase selection is a challenging task in natural language processing that has a wide range of applications. Adapting existing supervised and unsupervised solutions for the Russian language faces several limitations due to the rich morphology of Russian and the limited number of training datasets available. Recent studies conducted on English texts show that large language models (LLMs) successfully address the task of generating keyphrases. LLMs allow achieving impressive results without task-specific fine-tuning, using text prompts instead. In this work, we access the performance of prompt-based methods for generating keyphrases for Russian scientific abstracts. First, we compare the performance of zero-shot and few-shot prompt-based methods, fine-tuned models, and unsupervised methods. Then we assess strategies for selecting keyphrase examples in a few-shot setting. We present the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Text Analysis Techniques