Feature Engineering in Learning-to-Rank for Community Question Answering   Task

Nafis Sajid; Md Rashidul Hasan; Muhammad Ibrahim

arXiv:2309.07610·cs.LG·September 15, 2023

Feature Engineering in Learning-to-Rank for Community Question Answering Task

Nafis Sajid, Md Rashidul Hasan, Muhammad Ibrahim

PDF

Open Access

TL;DR

This paper explores feature engineering for learning-to-rank in community question answering, introducing a BERT-based semantic feature and combining question and answer features to improve ranking performance.

Contribution

It introduces a BERT-based semantic similarity feature and combines question and answer features, providing a comprehensive empirical analysis with various rank-learning algorithms in CQA.

Findings

01

Achieves state-of-the-art results on three CQA datasets.

02

Highlights the importance of combining question and answer features.

03

Demonstrates effectiveness of BERT-based semantic features in ranking.

Abstract

Community question answering (CQA) forums are Internet-based platforms where users ask questions about a topic and other expert users try to provide solutions. Many CQA forums such as Quora, Stackoverflow, Yahoo!Answer, StackExchange exist with a lot of user-generated data. These data are leveraged in automated CQA ranking systems where similar questions (and answers) are presented in response to the query of the user. In this work, we empirically investigate a few aspects of this domain. Firstly, in addition to traditional features like TF-IDF, BM25 etc., we introduce a BERT-based feature that captures the semantic similarity between the question and answer. Secondly, most of the existing research works have focused on features extracted only from the question part; features extracted from answers have not been explored extensively. We combine both types of features in a linear…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExpert finding and Q&A systems