Sentence-level Feedback Generation for English Language Learners: Does   Data Augmentation Help?

Shabnam Behzad; Amir Zeldes; Nathan Schneider

arXiv:2212.08999·cs.CL·December 20, 2022

Sentence-level Feedback Generation for English Language Learners: Does Data Augmentation Help?

Shabnam Behzad, Amir Zeldes, Nathan Schneider

PDF

Open Access

TL;DR

This paper investigates the effectiveness of data augmentation using pseudo datasets in improving feedback comment generation for English language learners, utilizing large language models and providing extensive analysis.

Contribution

It introduces strong baselines for feedback comment generation and explores the impact of data augmentation with pseudo datasets on system performance.

Findings

01

Data augmentation improves feedback comment quality.

02

Large language models outperform traditional methods.

03

Extensive analysis guides future research in feedback generation.

Abstract

In this paper, we present strong baselines for the task of Feedback Comment Generation for Writing Learning. Given a sentence and an error span, the task is to generate a feedback comment explaining the error. Sentences and feedback comments are both in English. We experiment with LLMs and also create multiple pseudo datasets for the task, investigating how it affects the performance of our system. We present our results for the task along with extensive analysis of the generated comments with the aim of aiding future studies in feedback comment generation for English language learners.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications