Towards Proactively Forecasting Sentence-Specific Information Popularity   within Online News Documents

Sayar Ghosh Roy; Anshul Padhi; Risubh Jain; Manish Gupta; Vasudeva; Varma

arXiv:2301.00152·cs.CL·January 3, 2023

Towards Proactively Forecasting Sentence-Specific Information Popularity within Online News Documents

Sayar Ghosh Roy, Anshul Padhi, Risubh Jain, Manish Gupta, Vasudeva, Varma

PDF

1 Repo

TL;DR

This paper introduces a new task of predicting the popularity of individual sentences in online news articles using natural language content, supported by a novel dataset and transfer learning approach.

Contribution

It presents the first dataset for sentence-level popularity prediction and a transfer learning method leveraging salience prediction to improve forecasting accuracy.

Findings

01

Achieved nDCG > 0.8 in popularity forecasting

02

Transfer learning from salience prediction improves performance

03

First dataset with sentence-level popularity labels from search queries

Abstract

Multiple studies have focused on predicting the prospective popularity of an online document as a whole, without paying attention to the contributions of its individual parts. We introduce the task of proactively forecasting popularities of sentences within online news documents solely utilizing their natural language content. We model sentence-specific popularity forecasting as a sequence regression task. For training our models, we curate InfoPop, the first dataset containing popularity labels for over 1.7 million sentences from over 50,000 online news documents. To the best of our knowledge, this is the first dataset automatically created using streams of incoming search engine queries to generate sentence-level popularity annotations. We propose a novel transfer learning approach involving sentence salience prediction as an auxiliary task. Our proposed technique coupled with a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sayarghoshroy/infopopularity
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.