Learning Opinion Summarizers by Selecting Informative Reviews
Arthur Bra\v{z}inskas, Mirella Lapata, Ivan Titov

TL;DR
This paper introduces a supervised approach for opinion summarization by jointly selecting informative reviews and generating summaries, improving quality and reducing hallucinations using a novel training method.
Contribution
It proposes a joint learning framework that combines review selection with summarization, addressing challenges of large review sets and irrelevant content.
Findings
Selected reviews lead to more accurate summaries.
Joint training reduces hallucinations in generated summaries.
Supervised approach outperforms unsupervised methods.
Abstract
Opinion summarization has been traditionally approached with unsupervised, weakly-supervised and few-shot learning techniques. In this work, we collect a large dataset of summaries paired with user reviews for over 31,000 products, enabling supervised training. However, the number of reviews per product is large (320 on average), making summarization - and especially training a summarizer - impractical. Moreover, the content of many reviews is not reflected in the human-written summaries, and, thus, the summarizer trained on random review subsets hallucinates. In order to deal with both of these challenges, we formulate the task as jointly learning to select informative subsets of reviews and summarizing the opinions expressed in these subsets. The choice of the review subset is treated as a latent variable, predicted by a small and simple selector. The subset is then fed into a more…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Sentiment Analysis and Opinion Mining · Advanced Text Analysis Techniques
MethodsVariational Inference
