Improving Bilingual Capabilities of Language Models to Support Diverse   Linguistic Practices in Education

Anand Syamkumar; Nora Tseng; Kaycie Barron; Shanglin Yang; Shamya; Karumbaiah; Rheeya Uppal; Junjie Hu

arXiv:2411.04308·cs.CL·November 8, 2024

Improving Bilingual Capabilities of Language Models to Support Diverse Linguistic Practices in Education

Anand Syamkumar, Nora Tseng, Kaycie Barron, Shanglin Yang, Shamya, Karumbaiah, Rheeya Uppal, Junjie Hu

PDF

Open Access

TL;DR

This study evaluates and improves multilingual large language models' ability to assess bilingual student writing, revealing biases and demonstrating enhanced performance after bilingual fine-tuning, thus supporting diverse linguistic practices in education.

Contribution

It identifies biases in LLM grading for bilingual writing and demonstrates improved performance through bilingual fine-tuning of open-source models.

Findings

01

Models show bias in bilingual grading performance.

02

Fine-tuning with bilingual data improves model accuracy.

03

Bilingual fine-tuning enhances support for diverse language practices.

Abstract

Large language models (LLMs) offer promise in generating educational content, providing instructor feedback, and reducing teacher workload on assessments. While prior studies have focused on studying LLM-powered learning analytics, limited research has examined how effective LLMs are in a bilingual context. In this paper, we study the effectiveness of multilingual large language models (MLLMs) across monolingual (English-only, Spanish-only) and bilingual (Spanglish) student writing. We present a learning analytics use case that details LLM performance in assessing acceptable and unacceptable explanations of Science and Social Science concepts. Our findings reveal a significant bias in the grading performance of pre-trained models for bilingual writing compared to English-only and Spanish-only writing. Following this, we fine-tune open-source MLLMs including Llama 3.1 and Mistral NeMo…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Text Readability and Simplification

MethodsLLaMA