Semantic Analysis of Tag Similarity Measures in Collaborative Tagging Systems
Ciro Cattuto, Dominik Benz, Andreas Hotho, Gerd Stumme

TL;DR
This paper compares three semantic relatedness measures for tags in social bookmarking systems, analyzing their characteristics and suitability for tasks like synonym detection and hierarchy discovery using large-scale data and WordNet mapping.
Contribution
It provides a comparative analysis of tag relatedness measures and links them to semantic concepts through WordNet, highlighting their different applications in knowledge extraction.
Findings
Different measures capture distinct semantic relationships.
FolkRank is effective for discovering concept hierarchies.
Cosine similarity aids in synonym detection.
Abstract
Social bookmarking systems allow users to organise collections of resources on the Web in a collaborative fashion. The increasing popularity of these systems as well as first insights into their emergent semantics have made them relevant to disciplines like knowledge extraction and ontology learning. The problem of devising methods to measure the semantic relatedness between tags and characterizing it semantically is still largely open. Here we analyze three measures of tag relatedness: tag co-occurrence, cosine similarity of co-occurrence distributions, and FolkRank, an adaptation of the PageRank algorithm to folksonomies. Each measure is computed on tags from a large-scale dataset crawled from the social bookmarking system del.icio.us. To provide a semantic grounding of our findings, a connection to WordNet (a semantic lexicon for the English language) is established by mapping tags…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSemantic Web and Ontologies · Recommender Systems and Techniques · Web Data Mining and Analysis
