Learning Opinion Summarizers by Selecting Informative Reviews

Arthur Bra\v{z}inskas; Mirella Lapata; Ivan Titov

arXiv:2109.04325·cs.CL·September 10, 2021

Learning Opinion Summarizers by Selecting Informative Reviews

Arthur Bra\v{z}inskas, Mirella Lapata, Ivan Titov

PDF

Open Access 1 Repo

TL;DR

This paper introduces a supervised approach for opinion summarization by jointly selecting informative reviews and generating summaries, improving quality and reducing hallucinations using a novel training method.

Contribution

It proposes a joint learning framework that combines review selection with summarization, addressing challenges of large review sets and irrelevant content.

Findings

01

Selected reviews lead to more accurate summaries.

02

Joint training reduces hallucinations in generated summaries.

03

Supervised approach outperforms unsupervised methods.

Abstract

Opinion summarization has been traditionally approached with unsupervised, weakly-supervised and few-shot learning techniques. In this work, we collect a large dataset of summaries paired with user reviews for over 31,000 products, enabling supervised training. However, the number of reviews per product is large (320 on average), making summarization - and especially training a summarizer - impractical. Moreover, the content of many reviews is not reflected in the human-written summaries, and, thus, the summarizer trained on random review subsets hallucinates. In order to deal with both of these challenges, we formulate the task as jointly learning to select informative subsets of reviews and summarizing the opinions expressed in these subsets. The choice of the review subset is treated as a latent variable, predicted by a small and simple selector. The subset is then fed into a more…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

abrazinskas/selsum
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Sentiment Analysis and Opinion Mining · Advanced Text Analysis Techniques

MethodsVariational Inference