Relating Web pages to enable information-gathering tasks
Amitabha Bagchi, Garima Lahoti

TL;DR
This paper introduces a novel approach to relate web pages based on user intent, using textual and link analysis to improve information-gathering tasks, and evaluates its effectiveness against existing methods.
Contribution
It defines three types of intentional relationships between web pages and develops scoring mechanisms using subnetworks and flow computations, enhancing web page relatedness measures.
Findings
The proposed scoring mechanisms outperform existing methods in relevance.
User evaluations show improved retrieval of related pages.
The approach effectively captures different user intents in web navigation.
Abstract
We argue that relationships between Web pages are functions of the user's intent. We identify a class of Web tasks - information-gathering - that can be facilitated by a search engine that provides links to pages which are related to the page the user is currently viewing. We define three kinds of intentional relationships that correspond to whether the user is a) seeking sources of information, b) reading pages which provide information, or c) surfing through pages as part of an extended information-gathering process. We show that these three relationships can be productively mined using a combination of textual and link information and provide three scoring mechanisms that correspond to them: {\em SeekRel}, {\em FactRel} and {\em SurfRel}. These scoring mechanisms incorporate both textual and link information. We build a set of capacitated subnetworks - each corresponding to a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Network Analysis Techniques · Web Data Mining and Analysis · Information Retrieval and Search Behavior
