A Discriminative Semantic Ranker for Question Retrieval
Yinqiong Cai, Yixing Fan, Jiafeng Guo, Ruqing Zhang, Yanyan Lan and, Xueqi Cheng

TL;DR
This paper introduces DenseTrans, a dense-connection Transformer model that enhances semantic embeddings for question retrieval, significantly improving recall over traditional and existing embedding methods.
Contribution
The paper proposes DenseTrans, a novel densely connected Transformer that maintains discriminative power in semantic embeddings for high-recall question retrieval.
Findings
DenseTrans outperforms term-based methods in recall.
DenseTrans surpasses existing embedding-based methods.
Significant improvements demonstrated on benchmark datasets.
Abstract
Similar question retrieval is a core task in community-based question answering (CQA) services. To balance the effectiveness and efficiency, the question retrieval system is typically implemented as multi-stage rankers: The first-stage ranker aims to recall potentially relevant questions from a large repository, and the latter stages attempt to re-rank the retrieved results. Most existing works on question retrieval mainly focused on the re-ranking stages, leaving the first-stage ranker to some traditional term-based methods. However, term-based methods often suffer from the vocabulary mismatch problem, especially on short texts, which may block the re-rankers from relevant questions at the very beginning. An alternative is to employ embedding-based methods for the first-stage ranker, which compress texts into dense vectors to enhance the semantic matching. However, these methods often…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsMulti-Head Attention · Attention Is All You Need · *Communicated@Fast*How Do I Communicate to Expedia? · Linear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Batch Normalization · Concatenated Skip Connection · 1x1 Convolution · Kaiming Initialization
