Microblog Topic Identification using Linked Open Data
A. Y{\i}ld{\i}r{\i}m, S. Uskudarli

TL;DR
This paper introduces a novel method for identifying semantically rich topics from microblog posts using Linked Open Data and entity linking, enabling automated and human-understandable topic extraction from large Twitter datasets.
Contribution
The work presents a new approach that leverages Linked Open Data and ontologies to extract and represent semantic topics from microblog collections, improving automated topic understanding.
Findings
Achieved 81.0% precision and 93.3% F1 score in human evaluation.
Generated over 5,000 semantic topics from 11 Twitter datasets.
Demonstrated the usefulness of semantic topics in revealing hidden information.
Abstract
The extensive use of social media for sharing and obtaining information has resulted in the development of topic detection models to facilitate the comprehension of the overwhelming amount of short and distributed posts. Probabilistic topic models, such as Latent Dirichlet Allocation, and matrix factorization based approaches such as Latent Semantic Analysis and Non-negative Matrix Factorization represent topics as sets of terms that are useful for many automated processes. However, the determination of what a topic is about is left as a further task. Alternatively, techniques that produce summaries are human comprehensible, but less suitable for automated processing. This work proposes an approach that utilizes Linked Open Data (LOD) resources to extract semantically represented topics from collections of microposts. The proposed approach utilizes entity linking to identify the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
