Automatic Reflection Level Classification in Hungarian Student Essays
Zsolt Csibi, M\'onika S\'andor, M\'onika Serf\H{o}z\H{o}, Kinga Gy\"ongy, Kristian Fenech

TL;DR
This study develops and compares machine learning and transformer models for automatically classifying reflection levels in Hungarian student essays, addressing class imbalance and providing a new dataset.
Contribution
First comprehensive Hungarian dataset and analysis of classical and transformer models for reflection classification in student essays.
Findings
Classical models with feature engineering achieve up to 71% performance.
Transformer models achieve 68% but better handle minority classes.
Addressing class imbalance improves overall classification robustness.
Abstract
Reflective thinking is a key competency in education, but assessing reflective writing remains a time-consuming and subjective task for education experts. While automated reflective analysis has been explored in several languages, Hungarian language was not researched extensively. In this paper, we present the first comprehensive study on automatic reflection level classification in Hungarian student essays. We used a large, expert-annotated Hungarian dataset consisting of 1,954 reflective essays collected over multiple academic years and labeled on a four-level reflection scale. We investigate two approaches: (1) classical machine learning models using TF-IDF and semantic embedding features, and (2) Hungarian-specific transformer models fine-tuned for document-level reflection classification. To address the strong class imbalance in the dataset, we systematically examine class…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
