Exploring Large Language Models for Product Attribute Value Identification
Kassem Sabeh, Mouna Kacimi, Johann Gamper, Robert Litschko, Barbara, Plank

TL;DR
This paper investigates the use of large language models like LLaMA and Mistral for product attribute value identification, proposing prompt strategies and fine-tuning methods that improve zero-shot and supervised performance.
Contribution
It introduces a two-step prompt approach and instruction fine-tuning for LLMs, enhancing their effectiveness and data efficiency in product attribute value identification tasks.
Findings
Two-step prompt approach improves zero-shot accuracy
Instruction fine-tuning boosts performance with training data
LLM-based methods outperform traditional fine-tuning approaches
Abstract
Product attribute value identification (PAVI) involves automatically identifying attributes and their values from product information, enabling features like product search, recommendation, and comparison. Existing methods primarily rely on fine-tuning pre-trained language models, such as BART and T5, which require extensive task-specific training data and struggle to generalize to new attributes. This paper explores large language models (LLMs), such as LLaMA and Mistral, as data-efficient and robust alternatives for PAVI. We propose various strategies: comparing one-step and two-step prompt-based approaches in zero-shot settings and utilizing parametric and non-parametric knowledge through in-context learning examples. We also introduce a dense demonstration retriever based on a pre-trained T5 model and perform instruction fine-tuning to explicitly train LLMs on task-specific…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Text Analysis Techniques
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Adam · Gated Linear Unit · SentencePiece · Byte Pair Encoding · LLaMA · Softmax · Layer Normalization · Adafactor
