Incorporating Semantic Knowledge into Latent Matching Model in Search

Shuxin Wang; Xin Jiang; Hang Li; Jun Xu; Bin Wang

arXiv:1604.06270·cs.IR·April 22, 2016

Incorporating Semantic Knowledge into Latent Matching Model in Search

Shuxin Wang, Xin Jiang, Hang Li, Jun Xu, Bin Wang

PDF

Open Access

TL;DR

This paper enhances latent matching models in search by integrating semantic knowledge, improving accuracy especially for tail queries with limited click data, through novel regularization techniques and optimization methods.

Contribution

It introduces a new approach to incorporate semantic knowledge into latent matching models, addressing data sparsity issues for tail queries.

Findings

01

Semantic knowledge improves matching accuracy.

02

Model performs well on tail queries.

03

Significant accuracy gains demonstrated on real datasets.

Abstract

The relevance between a query and a document in search can be represented as matching degree between the two objects. Latent space models have been proven to be effective for the task, which are often trained with click-through data. One technical challenge with the approach is that it is hard to train a model for tail queries and tail documents for which there are not enough clicks. In this paper, we propose to address the challenge by learning a latent matching model, using not only click-through data but also semantic knowledge. The semantic knowledge can be categories of queries and documents as well as synonyms of words, manually or automatically created. Specifically, we incorporate semantic knowledge into the objective function by including regularization terms. We develop two methods to solve the learning task on the basis of coordinate descent and gradient descent respectively,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Topic Modeling · Information Retrieval and Search Behavior