Relevance-based Word Embedding
Hamed Zamani, W. Bruce Croft

TL;DR
This paper introduces relevance-based word embedding models designed specifically for information retrieval tasks, outperforming traditional proximity-based embeddings like word2vec and GloVe in query expansion and classification.
Contribution
The paper develops two novel relevance-based embedding models trained on query-document relevance data, tailored for IR tasks, and demonstrates their superior performance over existing models.
Findings
Relevance-based embeddings outperform word2vec and GloVe in IR tasks.
Models trained on large query-document datasets improve query expansion.
Relevance-based embeddings enhance query classification accuracy.
Abstract
Learning a high-dimensional dense representation for vocabulary terms, also known as a word embedding, has recently attracted much attention in natural language processing and information retrieval tasks. The embedding vectors are typically learned based on term proximity in a large corpus. This means that the objective in well-known word embedding algorithms, e.g., word2vec, is to accurately predict adjacent word(s) for a given word or context. However, this objective is not necessarily equivalent to the goal of many information retrieval (IR) tasks. The primary objective in various IR tasks is to capture relevance instead of term proximity, syntactic, or even semantic similarity. This is the motivation for developing unsupervised relevance-based word embedding models that learn word representations based on query-document relevance information. In this paper, we propose two learning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text and Document Classification Technologies
MethodsGloVe Embeddings
