LLM-TAKE: Theme Aware Keyword Extraction Using Large Language Models
Reza Yousefi Maragheh, Chenhao Fang, Charan Chand Irugu, Parth Parikh,, Jason Cho, Jianpeng Xu, Saranyan Sukumar, Malay Patel, Evren Korpeoglu,, Sushant Kumar, Kannan Achan

TL;DR
This paper introduces LLM-TAKE, a framework using large language models for theme-aware keyword extraction that improves accuracy and diversity in product metadata analysis.
Contribution
It presents a novel LLM-based framework with two variations for extractive and abstractive theme generation in e-commerce, addressing limitations of traditional models.
Findings
Enhanced accuracy over benchmark models
Improved diversity in keyword extraction
Effective in e-commerce product metadata
Abstract
Keyword extraction is one of the core tasks in natural language processing. Classic extraction models are notorious for having a short attention span which make it hard for them to conclude relational connections among the words and sentences that are far from each other. This, in turn, makes their usage prohibitive for generating keywords that are inferred from the context of the whole text. In this paper, we explore using Large Language Models (LLMs) in generating keywords for items that are inferred from the items textual metadata. Our modeling framework includes several stages to fine grain the results by avoiding outputting keywords that are non informative or sensitive and reduce hallucinations common in LLM. We call our LLM-based framework Theme-Aware Keyword Extraction (LLM TAKE). We propose two variations of framework for generating extractive and abstractive themes for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Text Analysis Techniques · Information Retrieval and Search Behavior · Text and Document Classification Technologies
MethodsSparse Evolutionary Training · Network On Network
