A Practical Toolkit for Multilingual Question and Answer Generation

Asahi Ushio; Fernando Alva-Manchego; Jose Camacho-Collados

arXiv:2305.17416·cs.CL·May 30, 2023·2 cites

A Practical Toolkit for Multilingual Question and Answer Generation

Asahi Ushio, Fernando Alva-Manchego, Jose Camacho-Collados

PDF

Open Access 1 Repo

TL;DR

This paper introduces AutoQG, a multilingual question and answer generation toolkit with models in eight languages, providing both online and local options for practitioners to generate structured QA pairs easily.

Contribution

It presents AutoQG and lmqg, a comprehensive toolkit and models for multilingual question-answer generation, facilitating easy access and customization for users.

Findings

01

Released QA models in eight languages

02

Developed an online service and Python package for QAG

03

Enabled easy fine-tuning and evaluation of models

Abstract

Generating questions along with associated answers from a text has applications in several domains, such as creating reading comprehension tests for students, or improving document search by providing auxiliary questions and answers based on the query. Training models for question and answer generation (QAG) is not straightforward due to the expected structured output (i.e. a list of question and answer pairs), as it requires more than generating a single sentence. This results in a small number of publicly accessible QAG models. In this paper, we introduce AutoQG, an online service for multilingual QAG, along with lmqg, an all-in-one Python package for model fine-tuning, generation, and evaluation. We also release QAG models in eight languages fine-tuned on a few variants of pre-trained encoder-decoder language models, which can be used online via AutoQG or locally via lmqg. With these…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

asahi417/lm-question-generation
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications

Methodstravel james