Mining Implicit Relevance Feedback from User Behavior for Web Question Answering
Linjun Shou, Shining Bo, Feixiang Cheng, Ming Gong, Jian Pei, Daxin, Jiang

TL;DR
This paper explores how to mine implicit user feedback from search logs to improve passage relevance in web question answering systems, reducing reliance on human labels and enhancing multi-language QA performance.
Contribution
It is the first to study the correlation between user behavior and passage relevance, proposing a novel approach for mining training data for Web QA from implicit feedback.
Findings
Significantly improves passage ranking accuracy
Reduces human labeling costs for low-resource languages
Successfully deployed in multi-language commercial search services
Abstract
Training and refreshing a web-scale Question Answering (QA) system for a multi-lingual commercial search engine often requires a huge amount of training examples. One principled idea is to mine implicit relevance feedback from user behavior recorded in search engine logs. All previous works on mining implicit relevance feedback target at relevance of web documents rather than passages. Due to several unique characteristics of QA tasks, the existing user behavior models for web documents cannot be applied to infer passage relevance. In this paper, we make the first study to explore the correlation between user behavior and passage relevance, and propose a novel approach for mining training data for Web QA. We conduct extensive experiments on four test datasets and the results show our approach significantly improves the accuracy of passage ranking without extra human labeled data. In…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
