Improving Automated Feedback Systems for Tutor Training in Low-Resource Scenarios through Data Augmentation

Chentianye Xu; Jionghao Lin; Tongshuang Wu; Vincent Aleven; Kenneth R. Koedinger

arXiv:2501.09824·cs.HC·October 21, 2025

Improving Automated Feedback Systems for Tutor Training in Low-Resource Scenarios through Data Augmentation

Chentianye Xu, Jionghao Lin, Tongshuang Wu, Vincent Aleven, Kenneth R. Koedinger

PDF

Open Access

TL;DR

This paper presents a data augmentation method using GPT-4o to generate synthetic training data, significantly improving automated feedback systems for tutor training in low-resource settings.

Contribution

It introduces a novel approach of leveraging GPT-4o for synthetic data generation to enhance tutor feedback models with limited labeled data.

Findings

01

Synthetic data improves praise type identification accuracy.

02

Augmentation enhances model performance in low-resource scenarios.

03

Method reduces reliance on extensive manual labeling.

Abstract

Tutoring is an effective instructional method for enhancing student learning, yet its success relies on the skill and experience of the tutors. This reliance presents challenges for the widespread implementation of tutoring, particularly in training novice tutors. To support tutor training programs, real-time automated feedback systems are essential for efficiently training large numbers of tutors. Lin et al.'s previous study employed Generative Pre-Trained Transformers (GPT) for sequence labeling to identify desirable and undesirable praise components in a tutor training dataset, providing explanatory feedback. However, this approach requires a significant amount of labeled data for fine-tuning, which is both labor-intensive and dependent on expert input. To address the challenges associated with extensive data labeling, the current study explores the use of prompting more advanced GPT…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIntelligent Tutoring Systems and Adaptive Learning