Feature Engineering in Learning-to-Rank for Community Question Answering Task
Nafis Sajid, Md Rashidul Hasan, Muhammad Ibrahim

TL;DR
This paper explores feature engineering for learning-to-rank in community question answering, introducing a BERT-based semantic feature and combining question and answer features to improve ranking performance.
Contribution
It introduces a BERT-based semantic similarity feature and combines question and answer features, providing a comprehensive empirical analysis with various rank-learning algorithms in CQA.
Findings
Achieves state-of-the-art results on three CQA datasets.
Highlights the importance of combining question and answer features.
Demonstrates effectiveness of BERT-based semantic features in ranking.
Abstract
Community question answering (CQA) forums are Internet-based platforms where users ask questions about a topic and other expert users try to provide solutions. Many CQA forums such as Quora, Stackoverflow, Yahoo!Answer, StackExchange exist with a lot of user-generated data. These data are leveraged in automated CQA ranking systems where similar questions (and answers) are presented in response to the query of the user. In this work, we empirically investigate a few aspects of this domain. Firstly, in addition to traditional features like TF-IDF, BM25 etc., we introduce a BERT-based feature that captures the semantic similarity between the question and answer. Secondly, most of the existing research works have focused on features extracted only from the question part; features extracted from answers have not been explored extensively. We combine both types of features in a linear…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExpert finding and Q&A systems
