Boosting Entity Mention Detection for Targetted Twitter Streams with Global Contextual Embeddings
Satadisha Saha Bhowmick, Eduard C. Dragut, Weiyi Meng

TL;DR
This paper introduces EMD Globalizer, a framework that enhances entity mention detection in Twitter streams by leveraging global context embeddings, significantly improving detection accuracy over existing methods.
Contribution
The paper proposes a novel global embedding approach that improves entity mention detection in microblog streams by aggregating context across messages, outperforming existing systems.
Findings
Enhances existing EMD systems by an average of 25.61% in accuracy.
Leverages global context embeddings to improve detection of entity mentions.
Adds minimal computational overhead to current methods.
Abstract
Microblogging sites, like Twitter, have emerged as ubiquitous sources of information. Two important tasks related to the automatic extraction and analysis of information in Microblogs are Entity Mention Detection (EMD) and Entity Detection (ED). The state-of-the-art EMD systems aim to model the non-literary nature of microblog text by training upon offline static datasets. They extract a combination of surface-level features -- orthographic, lexical, and semantic -- from individual messages for noisy text modeling and entity extraction. But given the constantly evolving nature of microblog streams, detecting all entity mentions from such varying yet limited context of short messages remains a difficult problem. To this end, we propose a framework named EMD Globalizer, better suited for the execution of EMD learners on microblog streams. It deviates from the processing of isolated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Web Data Mining and Analysis · Sentiment Analysis and Opinion Mining
