User Model-Based Intent-Aware Metrics for Multilingual Search Evaluation
Alexey Drutsa (Yandex, Moscow, Russia), Andrey Shutovich (Yandex,, Moscow, Russia), Philipp Pushnyakov (Yandex, Moscow, Russia), Evgeniy, Krokhalyov (Yandex, Moscow, Russia), Gleb Gusev (Yandex, Moscow, Russia),, Pavel Serdyukov (Yandex, Moscow, Russia)

TL;DR
This paper introduces a new intent-aware metric for multilingual search evaluation that accounts for user language preferences and improves correlation with online satisfaction metrics.
Contribution
It develops a novel user behavior model for multilingual search, addressing limitations of existing models and enhancing offline evaluation accuracy.
Findings
New intent-aware metric correlates better with online satisfaction
Proposed user behavior model overcomes previous limitations
Improved offline evaluation for multilingual search quality
Abstract
Despite the growing importance of multilingual aspect of web search, no appropriate offline metrics to evaluate its quality are proposed so far. At the same time, personal language preferences can be regarded as intents of a query. This approach translates the multilingual search problem into a particular task of search diversification. Furthermore, the standard intent-aware approach could be adopted to build a diversified metric for multilingual search on the basis of a classical IR metric such as ERR. The intent-aware approach estimates user satisfaction under a user behavior model. We show however that the underlying user behavior models is not realistic in the multilingual case, and the produced intent-aware metric do not appropriately estimate the user satisfaction. We develop a novel approach to build intent-aware user behavior models, which overcome these limitations and convert…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInformation Retrieval and Search Behavior · Recommender Systems and Techniques · Expert finding and Q&A systems
