IDS at SemEval-2020 Task 10: Does Pre-trained Language Model Know What   to Emphasize?

Jaeyoul Shin; Taeuk Kim; Sang-goo Lee

arXiv:2007.12390·cs.CL·July 27, 2020

IDS at SemEval-2020 Task 10: Does Pre-trained Language Model Know What to Emphasize?

Jaeyoul Shin, Taeuk Kim, Sang-goo Lee

PDF

Open Access

TL;DR

This paper introduces a zero-shot method using pre-trained language models' self-attention to identify words for emphasis in text, outperforming TF-IDF baselines and revealing specialized attention heads.

Contribution

The study demonstrates that PLMs can be used to select important words for emphasis without training, and identifies attention heads that specialize in this task.

Findings

01

Zero-shot emphasis detection outperforms TF-IDF baseline.

02

Certain attention heads are specialized for emphasis recognition.

03

PLMs can recognize important words in sentences.

Abstract

We propose a novel method that enables us to determine words that deserve to be emphasized from written text in visual media, relying only on the information from the self-attention distributions of pre-trained language models (PLMs). With extensive experiments and analyses, we show that 1) our zero-shot approach is superior to a reasonable baseline that adopts TF-IDF and that 2) there exist several attention heads in PLMs specialized for emphasis selection, confirming that PLMs are capable of recognizing important words in sentences.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications