Exploring Large Language Models for Product Attribute Value   Identification

Kassem Sabeh; Mouna Kacimi; Johann Gamper; Robert Litschko; Barbara; Plank

arXiv:2409.12695·cs.CL·September 20, 2024

Exploring Large Language Models for Product Attribute Value Identification

Kassem Sabeh, Mouna Kacimi, Johann Gamper, Robert Litschko, Barbara, Plank

PDF

Open Access

TL;DR

This paper investigates the use of large language models like LLaMA and Mistral for product attribute value identification, proposing prompt strategies and fine-tuning methods that improve zero-shot and supervised performance.

Contribution

It introduces a two-step prompt approach and instruction fine-tuning for LLMs, enhancing their effectiveness and data efficiency in product attribute value identification tasks.

Findings

01

Two-step prompt approach improves zero-shot accuracy

02

Instruction fine-tuning boosts performance with training data

03

LLM-based methods outperform traditional fine-tuning approaches

Abstract

Product attribute value identification (PAVI) involves automatically identifying attributes and their values from product information, enabling features like product search, recommendation, and comparison. Existing methods primarily rely on fine-tuning pre-trained language models, such as BART and T5, which require extensive task-specific training data and struggle to generalize to new attributes. This paper explores large language models (LLMs), such as LLaMA and Mistral, as data-efficient and robust alternatives for PAVI. We propose various strategies: comparing one-step and two-step prompt-based approaches in zero-shot settings and utilizing parametric and non-parametric knowledge through in-context learning examples. We also introduce a dense demonstration retriever based on a pre-trained T5 model and perform instruction fine-tuning to explicitly train LLMs on task-specific…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Text Analysis Techniques

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Adam · Gated Linear Unit · SentencePiece · Byte Pair Encoding · LLaMA · Softmax · Layer Normalization · Adafactor