Incremental Improvement of a Question Answering System by Re-ranking   Answer Candidates using Machine Learning

Michael Barz; Daniel Sonntag

arXiv:1908.10149·cs.LG·June 17, 2021

Incremental Improvement of a Question Answering System by Re-ranking Answer Candidates using Machine Learning

Michael Barz, Daniel Sonntag

PDF

TL;DR

This paper presents a re-ranking method using machine learning to improve answer selection in a question answering system without retraining the entire model, demonstrating a 9.15% boost in mean reciprocal rank.

Contribution

It introduces a novel re-ranking approach for QA systems that enhances answer accuracy using n-gram features and gradient boosting, applicable post-deployment.

Findings

01

Re-ranking improves top-n accuracy and mean reciprocal rank.

02

The approach benefits deployed QA systems without retraining.

03

Achieved a 9.15% increase in mean reciprocal rank.

Abstract

We implement a method for re-ranking top-10 results of a state-of-the-art question answering (QA) system. The goal of our re-ranking approach is to improve the answer selection given the user question and the top-10 candidates. We focus on improving deployed QA systems that do not allow re-training or re-training comes at a high cost. Our re-ranking approach learns a similarity function using n-gram based features using the query, the answer and the initial system confidence as input. Our contributions are: (1) we generate a QA training corpus starting from 877 answers from the customer care domain of T-Mobile Austria, (2) we implement a state-of-the-art QA pipeline using neural sentence embeddings that encode queries in the same space than the answer index, and (3) we evaluate the QA pipeline and our re-ranking approach using a separately provided test set. The test set can be…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.