FairytaleQA Translated: Enabling Educational Question and Answer Generation in Less-Resourced Languages
Bernardo Leite, Tom\'as Freitas Os\'orio, Henrique Lopes Cardoso

TL;DR
This paper introduces translated versions of the FairytaleQA dataset to support question answering and generation in less-resourced languages, using fine-tuned models and providing benchmarks and error analysis.
Contribution
It presents the first machine-translated FairytaleQA datasets for underrepresented languages, along with benchmarks and a case study for QA and QG tasks.
Findings
Established baseline benchmarks for QA and QG in translated datasets.
Analyzed error cases to guide future improvements.
Demonstrated feasibility of using modest models for multilingual narrative comprehension.
Abstract
Question Answering (QA) datasets are crucial in assessing reading comprehension skills for both machines and humans. While numerous datasets have been developed in English for this purpose, a noticeable void exists in less-resourced languages. To alleviate this gap, our paper introduces machine-translated versions of FairytaleQA, a renowned QA dataset designed to assess and enhance narrative comprehension skills in young children. By employing fine-tuned, modest-scale models, we establish benchmarks for both Question Generation (QG) and QA tasks within the translated datasets. In addition, we present a case study proposing a model for generating question-answer pairs, with an evaluation incorporating quality metrics such as question well-formedness, answerability, relevance, and children suitability. Our evaluation prioritizes quantifying and describing error cases, along with providing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗benjleite/ptt5-ptpt-qgmodel· 4 dl4 dl
- 🤗benjleite/ptt5-ptbr-qgmodel· 3 dl3 dl
- 🤗benjleite/t5s-spanish-qgmodel· 3 dl· ♡ 13 dl♡ 1
- 🤗benjleite/t5-french-qgmodel· 2 dl2 dl
- 🤗benjleite/t5-english-qgmodel· 4 dl4 dl
- 🤗benjleite/ptt5-ptpt-qamodel· 1 dl1 dl
- 🤗benjleite/ptt5-ptbr-qamodel· 2 dl2 dl
- 🤗benjleite/t5s-spanish-qamodel· 7 dl7 dl
- 🤗benjleite/t5-french-qamodel· 7 dl7 dl
- 🤗benjleite/t5-english-qamodel· 2 dl2 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
