RILEC: Detection and Generation of L1 Russian Interference Errors in English Learner Texts
Darya Kharlamova, Irina Proskurina

TL;DR
This paper introduces RILEC, a large dataset and a generative framework for detecting and generating Russian L1 interference errors in English learner texts, improving error identification accuracy.
Contribution
The work presents a new dataset and a novel generative approach for modeling L1 interference errors, enhancing detection and understanding of native language influence in learner English.
Findings
Models fine-tuned on RILEC perform well on interference types
Augmentation pipeline significantly improves detection accuracy
Framework effectively generates realistic L1 interference errors
Abstract
Many errors in student essays can be explained by influence from the native language (L1). L1 interference refers to errors influenced by a speaker's first language, such as using stadion instead of stadium, reflecting lexical transliteration from Russian. In this work, we address the task of detecting such errors in English essays written by Russian-speaking learners. We introduce RILEC, a large-scale dataset of over 18,000 sentences, combining expert-annotated data from REALEC with synthetic examples generated through rule-based and neural augmentation. We propose a framework for generating L1-motivated errors using generative language models optimized with PPO, prompt-based control, and rule-based patterns. Models fine-tuned on RILEC achieve strong performance, particularly on word-level interference types such as transliteration and tense semantics. We find that the proposed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText Readability and Simplification · Second Language Acquisition and Learning · Natural Language Processing Techniques
