LLMs as Architects and Critics for Multi-Source Opinion Summarization
Anuj Attri, Arnav Attri, Pushpak Bhattacharyya, Suman Banerjee, Amey Patil, Muthusamy Chelliah, Nikesh Garera

TL;DR
This paper introduces a new multi-source opinion summarization task utilizing LLMs, presents a benchmark dataset, and demonstrates that factually enriched summaries improve user engagement and align well with human judgment.
Contribution
The paper proposes a novel multi-source opinion summarization framework using LLMs, introduces the M-OS-EVAL benchmark dataset, and shows improved summary quality and user engagement.
Findings
M-OS significantly increases user engagement with 87% preference in user studies.
M-OS-PROMPTS achieve a Spearman correlation of 0.74 with human judgment.
Factually enriched summaries outperform previous methods.
Abstract
Multi-source Opinion Summarization (M-OS) extends beyond traditional opinion summarization by incorporating additional sources of product metadata such as descriptions, key features, specifications, and ratings, alongside reviews. This integration results in comprehensive summaries that capture both subjective opinions and objective product attributes essential for informed decision-making. While Large Language Models (LLMs) have shown significant success in various Natural Language Processing (NLP) tasks, their potential in M-OS remains largely unexplored. Additionally, the lack of evaluation datasets for this task has impeded further advancements. To bridge this gap, we introduce M-OS-EVAL, a benchmark dataset for evaluating multi-source opinion summaries across 7 key dimensions: fluency, coherence, relevance, faithfulness, aspect coverage, sentiment consistency, specificity. Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
