News Meets Microblog: Hashtag Annotation via Retriever-Generator
Xiuwen Zheng, Dheeraj Mekala, Amarnath Gupta, Jingbo Shang

TL;DR
This paper introduces a novel method for hashtag annotation in microblogs by leveraging news articles with a Retriever-Generator framework, outperforming previous conversation-based approaches.
Contribution
It proposes using pre-existing news articles instead of conversations to improve hashtag generation, addressing a key limitation of prior methods.
Findings
Outperforms conversation-based methods in hashtag accuracy
Leveraging news articles significantly improves hashtag generation
Demonstrates effectiveness on English Twitter datasets
Abstract
Hashtag annotation for microblog posts has been recently formulated as a sequence generation problem to handle emerging hashtags that are unseen in the training set. The state-of-the-art method leverages conversations initiated by posts to enrich contextual information for the short posts. However, it is unrealistic to assume the existence of conversations before the hashtag annotation itself. Therefore, we propose to leverage news articles published before the microblog post to generate hashtags following a Retriever-Generator framework. Extensive experiments on English Twitter datasets demonstrate superior performance and significant advantages of leveraging news articles to generate hashtags.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Advanced Text Analysis Techniques · Web Data Mining and Analysis
