Investigating LLM Applications in E-Commerce

Chester Palen-Michel; Ruixiang Wang; Yipeng Zhang; David Yu; Canran; Xu; Zhe Wu

arXiv:2408.12779·cs.CL·August 26, 2024·2 cites

Investigating LLM Applications in E-Commerce

Chester Palen-Michel, Ruixiang Wang, Yipeng Zhang, David Yu, Canran, Xu, Zhe Wu

PDF

Open Access

TL;DR

This paper evaluates the performance of large language models in e-commerce tasks, comparing instruction-tuned LLMs with traditional models, and analyzes training methodologies to optimize NLP applications in the industry.

Contribution

It provides a comprehensive comparison of LLMs and traditional models in e-commerce tasks and explores various training strategies for improved performance.

Findings

01

Few-shot inference with large LLMs often underperforms fine-tuned smaller models.

02

Task-specific fine-tuning is crucial for optimal performance.

03

Different training methodologies significantly impact model effectiveness.

Abstract

The emergence of Large Language Models (LLMs) has revolutionized natural language processing in various applications especially in e-commerce. One crucial step before the application of such LLMs in these fields is to understand and compare the performance in different use cases in such tasks. This paper explored the efficacy of LLMs in the e-commerce domain, focusing on instruction-tuning an open source LLM model with public e-commerce datasets of varying sizes and comparing the performance with the conventional models prevalent in industrial applications. We conducted a comprehensive comparison between LLMs and traditional pre-trained language models across specific tasks intrinsic to the e-commerce domain, namely classification, generation, summarization, and named entity recognition (NER). Furthermore, we examined the effectiveness of the current niche industrial application of very…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDigital Rights Management and Security