Million Tutoring Moves (MTM): An Open Multimodal Dataset for the Science of Tutoring
Ren\'e Kizilcec, Kirk Vanacore, Zhuqian Zhou, Doug Pietrzak, Jorge Dias, Haocheng Zhang, Bakhtawar Ahtisham, Joshua Marland, Rachel Slama, Justin Reich, Kenneth Koedinger

TL;DR
The paper introduces MTM, a large-scale, multimodal dataset of tutoring interactions designed to advance research, AI development, and practice in education.
Contribution
It presents the first release of MTM, a comprehensive open dataset of 4,654 math tutoring transcripts from an online platform.
Findings
MTM v1 contains 4,654 math tutoring transcripts.
The dataset is multimodal and designed for broad research and AI applications.
MTM aims to improve tutoring practices and educational technology.
Abstract
We introduce the Million Tutoring Moves (MTM) project, an open dataset initiative aimed at advancing the science of tutoring through large-scale, reusable, and multimodal interaction data. MTM is developed within the National Tutoring Observatory (NTO), a research infrastructure designed to study authentic tutoring interactions and translate them into actionable insights for research, practice, and AI-powered educational technology development. In this paper, we present the vision behind MTM and describe MTM v1, an initial release consisting of 4,654 math tutoring transcripts from a U.S.-based nonprofit online tutoring platform. MTM v1 serves as a first step toward a broader repository that is safe, open, large-scale, broad-coverage, and multimodal. By making tutoring interactions systematically observable and analyzable, MTM aims to support research on instructional processes, improve…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
