Transformer-based Lexically Constrained Headline Generation

Kosuke Yamada; Yuta Hitomi; Hideaki Tamori; Ryohei Sasano; Naoaki; Okazaki; Kentaro Inui; Koichi Takeda

arXiv:2109.07080·cs.CL·September 16, 2021

Transformer-based Lexically Constrained Headline Generation

Kosuke Yamada, Yuta Hitomi, Hideaki Tamori, Ryohei Sasano, Naoaki, Okazaki, Kentaro Inui, Koichi Takeda

PDF

Open Access 1 Repo

TL;DR

This paper introduces a Transformer-based approach for lexically constrained headline generation that guarantees inclusion of a specified phrase, improving controllability and maintaining high-quality output.

Contribution

It proposes a novel Transformer method ensuring phrase inclusion and a new generation strategy, advancing controllable headline generation techniques.

Findings

01

Guaranteed phrase inclusion in headlines.

02

Comparable ROUGE scores to existing methods.

03

Improved generation strategy performance.

Abstract

This paper explores a variant of automatic headline generation methods, where a generated headline is required to include a given phrase such as a company or a product name. Previous methods using Transformer-based models generate a headline including a given phrase by providing the encoder with additional information corresponding to the given phrase. However, these methods cannot always include the phrase in the generated headline. Inspired by previous RNN-based methods generating token sequences in backward and forward directions from the given phrase, we propose a simple Transformer-based method that guarantees to include the given phrase in the high-quality generated headline. We also consider a new headline generation strategy that takes advantage of the controllable generation order of Transformer. Our experiments with the Japanese News Corpus demonstrate that our methods, which…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

asahi-research/script-for-transformer-based-seq2bf
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications

MethodsAttention Is All You Need · Linear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Layer Normalization · Label Smoothing · Adam · Residual Connection · Multi-Head Attention · Softmax