News Headline Grouping as a Challenging NLU Task

Philippe Laban; Lucas Bandarkar; Marti A. Hearst

arXiv:2105.05391·cs.CL·May 13, 2021

News Headline Grouping as a Challenging NLU Task

Philippe Laban, Lucas Bandarkar, Marti A. Hearst

PDF

1 Repo

TL;DR

This paper introduces the HeadLine Grouping task and dataset, demonstrating current models' limitations and proposing an unsupervised approach that narrows the performance gap, while analyzing model consistency issues.

Contribution

The paper presents a new NLU task and dataset for headline grouping, and introduces an unsupervised model that approaches supervised performance levels.

Findings

01

Human annotators achieve 0.9 F-1 on HLGD.

02

State-of-the-art models reach 0.75 F-1.

03

Unsupervised model is within 3 F-1 of supervised models.

Abstract

Recent progress in Natural Language Understanding (NLU) has seen the latest models outperform human performance on many standard tasks. These impressive results have led the community to introspect on dataset limitations, and iterate on more nuanced challenges. In this paper, we introduce the task of HeadLine Grouping (HLG) and a corresponding dataset (HLGD) consisting of 20,056 pairs of news headlines, each labeled with a binary judgement as to whether the pair belongs within the same group. On HLGD, human annotators achieve high performance of around 0.9 F-1, while current state-of-the art Transformer models only reach 0.75 F-1, opening the path for further improvements. We further propose a novel unsupervised Headline Generator Swap model for the task of HeadLine Grouping that achieves within 3 F-1 of the best supervised model. Finally, we analyze high-performing models with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tingofurro/headline_grouping
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsAttention Is All You Need · Linear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Dropout · Dense Connections · Adam · Layer Normalization · Softmax · Multi-Head Attention