Exploiting Social Annotation for Automatic Resource Discovery
Anon Plangprasopchok, Kristina Lerman

TL;DR
This paper presents a probabilistic model leveraging social annotations from bookmarking sites like del.icio.us to improve automatic resource discovery, addressing limitations of traditional search methods in uncovering hidden web sources.
Contribution
It introduces a novel probabilistic model of user annotations and demonstrates its effectiveness in automatically discovering relevant information resources.
Findings
Effective in identifying relevant resources in experiments
Outperforms traditional keyword-based search methods
Shows promise for automating resource discovery tasks
Abstract
Information integration applications, such as mediators or mashups, that require access to information resources currently rely on users manually discovering and integrating them in the application. Manual resource discovery is a slow process, requiring the user to sift through results obtained via keyword-based search. Although search methods have advanced to include evidence from document contents, its metadata and the contents and link structure of the referring pages, they still do not adequately cover information sources -- often called ``the hidden Web''-- that dynamically generate documents in response to a query. The recently popular social bookmarking sites, which allow users to annotate and share metadata about various information sources, provide rich evidence for resource discovery. In this paper, we describe a probabilistic model of the user annotation process in a social…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPeer-to-Peer Network Technologies · Web Data Mining and Analysis · Spam and Phishing Detection
