MAGMA-Edu: Multi-Agent Generative Multimodal Framework for Text-Diagram Educational Question Generation
Zhenyu Wu, Jian Li, Hua Huang

TL;DR
MAGMA-Edu is a multi-agent framework that enhances educational question generation by integrating textual reasoning and diagrammatic synthesis through a self-reflective, iterative process, significantly improving pedagogical coherence and multimodal consistency.
Contribution
It introduces a novel multi-agent, self-reflective pipeline for structured multimodal educational question generation, outperforming existing models in accuracy and consistency.
Findings
Significantly improves textual and image-text metrics over state-of-the-art models.
Achieves new benchmarks in multimodal educational content quality.
Demonstrates the effectiveness of self-reflective multi-agent collaboration.
Abstract
Educational illustrations play a central role in communicating abstract concepts, yet current multimodal large language models (MLLMs) remain limited in producing pedagogically coherent and semantically consistent educational visuals. We introduce MAGMA-Edu, a self-reflective multi-agent framework that unifies textual reasoning and diagrammatic synthesis for structured educational problem generation. Unlike existing methods that treat text and image generation independently, MAGMA-Edu employs a two-stage co-evolutionary pipeline: (1) a generation-verification-reflection loop that iteratively refines question statements and solutions for mathematical accuracy, and (2) a code-based intermediate representation that enforces geometric fidelity and semantic alignment during image rendering. Both stages are guided by internal self-reflection modules that evaluate and revise outputs until…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Topic Modeling · Generative Adversarial Networks and Image Synthesis
