MOOCdb: Developing Standards and Systems to Support MOOC Data Science
Kalyan Veeramachaneni, Sherif Halawa, Franck Dernoncourt, Una-May, O'Reilly, Colin Taylor, Chuong Do

TL;DR
This paper introduces MOOCdb, a platform-agnostic data model for capturing student interactions in MOOCs, facilitating data science research without sharing raw data.
Contribution
The paper develops a standardized, platform-independent data model for MOOC interactions, enabling collaborative data science frameworks.
Findings
Defines core student interaction modes: Observing, Submitting, Collaborating, Feedback.
Creates a shared terminology to map Coursera and edX data into MOOCdb.
Lays foundation for collaborative MOOC data science without data sharing.
Abstract
We present a shared data model for enabling data science in Massive Open Online Courses (MOOCs). The model captures students interactions with the online platform. The data model is platform agnostic and is based on some basic core actions that students take on an online learning platform. Students usually interact with the platform in four different modes: Observing, Submitting, Collaborating and giving feedback. In observing mode students are simply browsing the online platform, watching videos, reading material, reading book or watching forums. In submitting mode, students submit information to the platform. This includes submissions towards quizzes, homeworks, or any assessment modules. In collaborating mode students interact with other students or instructors on forums, collaboratively editing wiki or chatting on google hangout or other hangout venues. With this basic definitions…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOnline Learning and Analytics · Intelligent Tutoring Systems and Adaptive Learning · Scientific Computing and Data Management
