CHAI for LLMs: Improving Code-Mixed Translation in Large Language Models through Reinforcement Learning with AI Feedback

Wenbo Zhang; Aditya Majumdar; Amulya Yadav

arXiv:2411.09073·cs.CL·July 10, 2025

CHAI for LLMs: Improving Code-Mixed Translation in Large Language Models through Reinforcement Learning with AI Feedback

Wenbo Zhang, Aditya Majumdar, Amulya Yadav

PDF

Open Access

TL;DR

This paper introduces CHAI, a reinforcement learning framework that uses AI-generated annotations to enhance multilingual LLMs' performance on code-mixed translation tasks, significantly outperforming existing models.

Contribution

The paper presents a novel framework, CHAI, which improves code-mixed language translation in LLMs by leveraging AI feedback and reinforcement learning, filling a key research gap.

Findings

01

CHAI-powered LLMs outperform state-of-the-art open-source LLMs by 25.66% in code-mixed translation.

02

The framework effectively uses AI-generated annotations for reinforcement learning.

03

Experimental results demonstrate significant improvements across various datasets.

Abstract

Large Language Models (LLMs) have demonstrated remarkable capabilities across various NLP tasks but struggle with code-mixed (or code-switched) language understanding. For example, prior work benchmarking the performance of multilingual LLMs on code-mixed translation tasks has demonstrated that current state-of-the-art multilingual LLMs are ineffective in dealing with code-mixed languages. However, the question of how to improve the capability of multilingual LLMs to handle code-mixed language has not received any attention to date. In this paper, we tackle this research gap by proposing CHAI, a novel general-purpose framework for improving the ability of multilingual LLMs to handle code-mixed languages. CHAI relies on three novel contributions made in this paper. First, we explore the ability of LLMs to provide accurate annotations for code-mixed translation tasks. Second, we leverage…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Text Readability and Simplification