Modeling Social Annotation: a Bayesian Approach
Anon Plangprasopchok, Kristina Lerman

TL;DR
This paper introduces a fully Bayesian probabilistic model for social annotation data that automatically determines the number of interests and topics, improving the extraction of meaningful categories from user-generated tags.
Contribution
It extends previous interest-based models to a Bayesian framework, allowing automatic estimation of the number of interests and topics from social annotations.
Findings
The Bayesian model outperforms Latent Dirichlet Allocation in topic extraction tasks.
It effectively infers resource topics from social annotations in real-world data.
The model can discover new similar resources based on inferred interests and topics.
Abstract
Collaborative tagging systems, such as Delicious, CiteULike, and others, allow users to annotate resources, e.g., Web pages or scientific papers, with descriptive labels called tags. The social annotations contributed by thousands of users, can potentially be used to infer categorical knowledge, classify documents or recommend new relevant information. Traditional text inference methods do not make best use of social annotation, since they do not take into account variations in individual users' perspectives and vocabulary. In a previous work, we introduced a simple probabilistic model that takes interests of individual annotators into account in order to find hidden topics of annotated resources. Unfortunately, that approach had one major shortcoming: the number of topics and interests must be specified a priori. To address this drawback, we extend the model to a fully Bayesian…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Topic Modeling · Advanced Text Analysis Techniques
