Attend and select: A segment selective transformer for microblog hashtag   generation

Qianren Mao; Xi Li; Bang Liu; Shu Guo; Peng Hao; Jianxin Li; Lihong; Wang

arXiv:2106.03151·cs.CL·September 27, 2022

Attend and select: A segment selective transformer for microblog hashtag generation

Qianren Mao, Xi Li, Bang Liu, Shu Guo, Peng Hao, Jianxin Li, Lihong, Wang

PDF

1 Repo

TL;DR

This paper introduces a segment selective transformer with a novel segments selection mechanism to improve the quality and condensation of hashtags generated from microblog posts, outperforming existing methods.

Contribution

It proposes a new transformer model with a segments-selection procedure that effectively filters crucial tokens for better hashtag generation.

Findings

01

Significant improvements over baselines in evaluation metrics.

02

Effective modeling of different textual granularities.

03

Enhanced ability to generate condensed, relevant hashtags.

Abstract

Hashtag generation aims to generate short and informal topical tags from a microblog post, in which tokens or phrases form the hashtags. These tokens or phrases may originate from primary fragmental textual pieces (e.g., segments) in the original text and are separated into different segments. However, conventional sequence-to-sequence generation methods are hard to filter out secondary information from different textual granularity and are not good at selecting crucial tokens. Thus, they are suboptimal in generating more condensed hashtags. In this work, we propose a modified Transformer-based generation model with adding a segments-selection procedure for the original encoding and decoding phases. The segments-selection phase is based on a novel Segments Selection Mechanism (SSM) to model different textual granularity on global text, local segments, and tokens, contributing to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

OpenSUM/HashtagGen
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Byte Pair Encoding · Adam · Label Smoothing · Residual Connection · Dense Connections