Exploiting Social Annotation for Automatic Resource Discovery

Anon Plangprasopchok; Kristina Lerman

arXiv:0704.1675·cs.AI·September 8, 2016·35 cites

Exploiting Social Annotation for Automatic Resource Discovery

Anon Plangprasopchok, Kristina Lerman

PDF

Open Access

TL;DR

This paper presents a probabilistic model leveraging social annotations from bookmarking sites like del.icio.us to improve automatic resource discovery, addressing limitations of traditional search methods in uncovering hidden web sources.

Contribution

It introduces a novel probabilistic model of user annotations and demonstrates its effectiveness in automatically discovering relevant information resources.

Findings

01

Effective in identifying relevant resources in experiments

02

Outperforms traditional keyword-based search methods

03

Shows promise for automating resource discovery tasks

Abstract

Information integration applications, such as mediators or mashups, that require access to information resources currently rely on users manually discovering and integrating them in the application. Manual resource discovery is a slow process, requiring the user to sift through results obtained via keyword-based search. Although search methods have advanced to include evidence from document contents, its metadata and the contents and link structure of the referring pages, they still do not adequately cover information sources -- often called ``the hidden Web''-- that dynamically generate documents in response to a query. The recently popular social bookmarking sites, which allow users to annotate and share metadata about various information sources, provide rich evidence for resource discovery. In this paper, we describe a probabilistic model of the user annotation process in a social…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPeer-to-Peer Network Technologies · Web Data Mining and Analysis · Spam and Phishing Detection