EduCoder: An Open-Source Annotation System for Education Transcript Data
Saad Ashraf, James Malamut, Vishal Kumar, Guanzhong Pan, Hyunji Nam, Mei Tan, Luc\'ia Langlois, Liliana Deonizio, Helen Higgins, Dorottya Demszky

TL;DR
EduCoder is an open-source annotation platform tailored for educational dialogue transcripts, enabling complex, collaborative, and reliable coding of pedagogical features with contextual support.
Contribution
It introduces a specialized tool that addresses the unique challenges of annotating educational dialogue, supporting complex codebooks and collaborative annotation.
Findings
Supports both categorical and open-ended annotations.
Allows comparison and calibration among multiple annotators.
Facilitates contextualized coding with external features.
Abstract
We introduce EduCoder, a domain-specialized tool designed to support utterance-level annotation of educational dialogue. While general-purpose text annotation tools for NLP and qualitative research abound, few address the complexities of coding education dialogue transcripts -- with diverse teacher-student and peer interactions. Common challenges include defining codebooks for complex pedagogical features, supporting both open-ended and categorical coding, and contextualizing utterances with external features, such as the lesson's purpose and the pedagogical value of the instruction. EduCoder is designed to address these challenges by providing a platform for researchers and domain experts to collaboratively define complex codebooks based on observed data. It incorporates both categorical and open-ended annotation types along with contextual materials. Additionally, it offers a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
