Learning Visual Features from Snapshots for Web Search
Yixing Fan, Jiafeng Guo, Yanyan Lan, Jun Xu, Liang Pang, Xueqi Cheng

TL;DR
This paper introduces a novel approach to Web search ranking by automatically learning visual features from Web page snapshots, capturing layout information that complements traditional relevance signals.
Contribution
The work proposes a new visual perception model inspired by human visual search, enabling end-to-end learning of visual features from Web page layouts for relevance ranking.
Findings
Visual features significantly improve ranking performance.
The model effectively captures layout-based relevance signals.
Efficient online acquisition of visual features is feasible.
Abstract
When applying learning to rank algorithms to Web search, a large number of features are usually designed to capture the relevance signals. Most of these features are computed based on the extracted textual elements, link analysis, and user logs. However, Web pages are not solely linked texts, but have structured layout organizing a large variety of elements in different styles. Such layout itself can convey useful visual information, indicating the relevance of a Web page. For example, the query-independent layout (i.e., raw page layout) can help identify the page quality, while the query-dependent layout (i.e., page rendered with matched query words) can further tell rich structural information (e.g., size, position and proximity) of the matching signals. However, such visual information of layout has been seldom utilized in Web search in the past. In this work, we propose to learn…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
