Generate-Then-Validate: A Novel Question Generation Approach Using Small Language Models
Yumou Wei, John Stamper, Paulo F. Carvalho

TL;DR
This paper introduces a generate-then-validate pipeline using small language models for high-quality automatic question generation, demonstrating its effectiveness through expert and LLM evaluations.
Contribution
The paper presents a novel question generation pipeline that combines generation and probabilistic validation using small language models, enhancing question quality.
Findings
Generated questions had clear answers and aligned with learning objectives.
Most judges agreed on the quality and relevance of questions.
SLMs can effectively generate high-quality questions with proper guidance.
Abstract
We explore the use of small language models (SLMs) for automatic question generation as a complement to the prevalent use of their large counterparts in learning analytics research. We present a novel question generation pipeline that leverages both the text generation and the probabilistic reasoning abilities of SLMs to generate high-quality questions. Adopting a "generate-then-validate" strategy, our pipeline first performs expansive generation to create an abundance of candidate questions and refine them through selective validation based on novel probabilistic reasoning. We conducted two evaluation studies, one with seven human experts and the other with a large language model (LLM), to assess the quality of the generated questions. Most judges (humans or LLMs) agreed that the generated questions had clear answers and generally aligned well with the intended learning objectives. Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Intelligent Tutoring Systems and Adaptive Learning · Expert finding and Q&A systems
