MIRA: Leveraging Multi-Intention Co-click Information in Web-scale Document Retrieval using Deep Neural Networks
Yusi Zhang, Chuanjie Liu, Angen Luo, Hui Xue, Xuan Shan, Yuxiang Luo,, Yiqian Xia, Yuanchi Yan, Haidong Wang

TL;DR
This paper introduces MIRA, a deep neural network framework that leverages multi-intention co-click information via a graph attention network to improve web document retrieval, especially for hard tail queries, with scalable online inference.
Contribution
The paper proposes a novel Multi-Intention Co-click Graph and an attention-based encoding framework that effectively incorporates co-click neighbor information for large-scale web search.
Findings
Significant improvement over baselines in offline experiments.
Effective noise reduction in co-click relation extraction.
Enhancement of search quality through key concept and query term improvements.
Abstract
We study the problem of deep recall model in industrial web search, which is, given a user query, retrieve hundreds of most relevance documents from billions of candidates. The common framework is to train two encoding models based on neural embedding which learn the distributed representations of queries and documents separately and match them in the latent semantic space. However, all the exiting encoding models only leverage the information of the document itself, which is often not sufficient in practice when matching with query terms, especially for the hard tail queries. In this work we aim to leverage the additional information for each document from its co-click neighbour to help document retrieval. The challenges include how to effectively extract information and eliminate noise when involving co-click information in deep model while meet the demands of billion-scale data size…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Advanced Graph Neural Networks · Domain Adaptation and Few-Shot Learning
MethodsLinear Layer · Multi-Head Attention · Residual Connection · Attention Is All You Need · Attention Dropout · Weight Decay · Adam · Softmax · WordPiece · Dense Connections
