LLMs as Architects and Critics for Multi-Source Opinion Summarization

Anuj Attri; Arnav Attri; Pushpak Bhattacharyya; Suman Banerjee; Amey Patil; Muthusamy Chelliah; Nikesh Garera

arXiv:2507.04751·cs.CL·July 8, 2025

LLMs as Architects and Critics for Multi-Source Opinion Summarization

Anuj Attri, Arnav Attri, Pushpak Bhattacharyya, Suman Banerjee, Amey Patil, Muthusamy Chelliah, Nikesh Garera

PDF

TL;DR

This paper introduces a new multi-source opinion summarization task utilizing LLMs, presents a benchmark dataset, and demonstrates that factually enriched summaries improve user engagement and align well with human judgment.

Contribution

The paper proposes a novel multi-source opinion summarization framework using LLMs, introduces the M-OS-EVAL benchmark dataset, and shows improved summary quality and user engagement.

Findings

01

M-OS significantly increases user engagement with 87% preference in user studies.

02

M-OS-PROMPTS achieve a Spearman correlation of 0.74 with human judgment.

03

Factually enriched summaries outperform previous methods.

Abstract

Multi-source Opinion Summarization (M-OS) extends beyond traditional opinion summarization by incorporating additional sources of product metadata such as descriptions, key features, specifications, and ratings, alongside reviews. This integration results in comprehensive summaries that capture both subjective opinions and objective product attributes essential for informed decision-making. While Large Language Models (LLMs) have shown significant success in various Natural Language Processing (NLP) tasks, their potential in M-OS remains largely unexplored. Additionally, the lack of evaluation datasets for this task has impeded further advancements. To bridge this gap, we introduce M-OS-EVAL, a benchmark dataset for evaluating multi-source opinion summaries across 7 key dimensions: fluency, coherence, relevance, faithfulness, aspect coverage, sentiment consistency, specificity. Our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.