ConveRT for FAQ Answering

Maxime De Bruyn; Ehsan Lotfi; Jeska Buhmann; Walter Daelemans

arXiv:2108.00719·cs.CL·October 15, 2021·1 cites

ConveRT for FAQ Answering

Maxime De Bruyn, Ehsan Lotfi, Jeska Buhmann, Walter Daelemans

PDF

Open Access 1 Repo

TL;DR

This paper introduces a new pre-training method to adapt the ConveRT conversational retrieval model for multilingual FAQ answering, demonstrated on Dutch COVID-19 vaccine FAQs, outperforming existing open-source models.

Contribution

A novel pre-training approach for adapting ConveRT to low-resource languages, applied to Dutch FAQ answering about COVID-19 vaccines.

Findings

01

Outperforms open-source alternatives in low-data settings

02

Effective in high-data regimes as well

03

First application of ConveRT adaptation to Dutch FAQ task

Abstract

Knowledgeable FAQ chatbots are a valuable resource to any organization. While powerful and efficient retrieval-based models exist for English, it is rarely the case for other languages for which the same amount of training data is not available. In this paper, we propose a novel pre-training procedure to adapt ConveRT, an English conversational retriever model, to other languages with less training data available. We apply it for the first time to the task of Dutch FAQ answering related to the COVID-19 vaccine. We show it performs better than an open-source alternative in both a low-data regime and a high-data regime.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

clips/ADATaLKS
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications