Enhancing Answer Selection in Community Question Answering with Pre-trained and Large Language Models
Xinghang Hu

TL;DR
This paper introduces a novel answer selection method in Community Question Answering using pre-trained BERT models and large language models (LLMs) for knowledge augmentation, achieving state-of-the-art results.
Contribution
It proposes the Question-Answer cross attention networks (QAN) with pre-trained models and leverages LLMs for external knowledge augmentation to improve answer selection accuracy.
Findings
QAN achieves state-of-the-art performance on SemEval datasets.
Knowledge augmentation with LLM improves answer selection accuracy.
Optimized prompts enhance LLM's ability to select correct answers.
Abstract
Community Question Answering (CQA) becomes increasingly prevalent in recent years. However, there are a large number of answers, which is difficult for users to select the relevant answers. Therefore, answer selection is a very significant subtask of CQA. In this paper, we first propose the Question-Answer cross attention networks (QAN) with pre-trained models for answer selection and utilize large language model (LLM) to perform answer selection with knowledge augmentation. Specifically, we apply the BERT model as the encoder layer to do pre-training for question subjects, question bodies and answers, respectively, then the cross attention mechanism selects the most relevant answer for different questions. Experiments show that the QAN model achieves state-of-the-art performance on two datasets, SemEval2015 and SemEval2017. Moreover, we use the LLM to generate external knowledge from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExpert finding and Q&A systems · Topic Modeling
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Adam · Weight Decay · Residual Connection · Dense Connections · Dropout · Softmax · Layer Normalization
