TL;DR
Embed2Detect is a novel event detection method for social media that leverages word embeddings and hierarchical clustering to incorporate semantic features, outperforming existing approaches in sports and political datasets.
Contribution
The paper introduces Embed2Detect, a new approach combining word embeddings with hierarchical clustering for improved semantic-aware event detection in social media.
Findings
Achieved 27% higher F-measure on sports data
Achieved 29% higher F-measure on political data
Outperforms several state-of-the-art methods
Abstract
Social media is becoming a primary medium to discuss what is happening around the world. Therefore, the data generated by social media platforms contain rich information which describes the ongoing events. Further, the timeliness associated with these data is capable of facilitating immediate insights. However, considering the dynamic nature and high volume of data production in social media data streams, it is impractical to filter the events manually and therefore, automated event detection mechanisms are invaluable to the community. Apart from a few notable exceptions, most previous research on automated event detection have focused only on statistical and syntactical features in data and lacked the involvement of underlying semantics which are important for effective information retrieval from text since they represent the connections between words and their meanings. In this paper,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
