From Model to Classroom: Evaluating Generated MCQs for Portuguese with Narrative and Difficulty Concerns

Bernardo Leite; Henrique Lopes Cardoso; Pedro Pinto; Abel Ferreira; Lu\'is Abreu; Isabel Rangel; Sandra Monteiro

arXiv:2506.15598·cs.CL·June 19, 2025

From Model to Classroom: Evaluating Generated MCQs for Portuguese with Narrative and Difficulty Concerns

Bernardo Leite, Henrique Lopes Cardoso, Pedro Pinto, Abel Ferreira, Lu\'is Abreu, Isabel Rangel, Sandra Monteiro

PDF

Open Access

TL;DR

This study evaluates the quality of AI-generated multiple-choice questions for Portuguese reading comprehension, highlighting strengths and challenges in semantic clarity, distractor quality, and real-world applicability for elementary education.

Contribution

It provides an empirical assessment of generative AI models for Portuguese MCQ creation, focusing on narrative alignment, difficulty levels, and psychometric evaluation.

Findings

01

AI models produce MCQs comparable to human-authored ones.

02

Semantic clarity and distractor quality remain challenging.

03

Generated MCQs show potential for educational use with improvements.

Abstract

While MCQs are valuable for learning and evaluation, manually creating them with varying difficulty levels and targeted reading skills remains a time-consuming and costly task. Recent advances in generative AI provide an opportunity to automate MCQ generation efficiently. However, assessing the actual quality and reliability of generated MCQs has received limited attention -- particularly regarding cases where generation fails. This aspect becomes particularly important when the generated MCQs are meant to be applied in real-world settings. Additionally, most MCQ generation studies focus on English, leaving other languages underexplored. This paper investigates the capabilities of current generative models in producing MCQs for reading comprehension in Portuguese, a morphologically rich language. Our study focuses on generating MCQs that align with curriculum-relevant narrative elements…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsText Readability and Simplification · Natural Language Processing Techniques