Can Large Language Models be Effective Online Opinion Miners?

Ryang Heo; Yongsik Seo; Junseong Lee; Dongha Lee

arXiv:2505.15695·cs.CL·October 23, 2025

Can Large Language Models be Effective Online Opinion Miners?

Ryang Heo, Yongsik Seo, Junseong Lee, Dongha Lee

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a new benchmark dataset and evaluation protocol to assess large language models' effectiveness in mining opinions from complex online content, addressing current challenges in opinion extraction.

Contribution

The paper presents the Online Opinion Mining Benchmark (OOMB), a novel dataset and evaluation framework for testing LLMs' ability to extract and summarize opinions from diverse online sources.

Findings

01

LLMs show promising capabilities in opinion extraction and summarization.

02

Certain aspects of opinion mining remain challenging for LLMs.

03

The benchmark reveals areas where LLMs need improvement for online opinion mining.

Abstract

The surge of user-generated online content presents a wealth of insights into customer preferences and market trends. However, the highly diverse, complex, and context-rich nature of such contents poses significant challenges to traditional opinion mining approaches. To address this, we introduce Online Opinion Mining Benchmark (OOMB), a novel dataset and evaluation protocol designed to assess the ability of large language models (LLMs) to mine opinions effectively from diverse and intricate online environments. OOMB provides extensive (entity, feature, opinion) tuple annotations and a comprehensive opinion-centric summary that highlights key opinion topics within each content, thereby enabling the evaluation of both the extractive and abstractive capabilities of models. Through our proposed benchmark, we conduct a comprehensive analysis of which aspects remain challenging and where…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ryang1119/Online-Opinion-Mining
noneOfficial

Videos

Can Large Language Models be Effective Online Opinion Miners?· underline

Taxonomy

TopicsSentiment Analysis and Opinion Mining · Spam and Phishing Detection · Digital Marketing and Social Media