CHAI for LLMs: Improving Code-Mixed Translation in Large Language Models through Reinforcement Learning with AI Feedback
Wenbo Zhang, Aditya Majumdar, Amulya Yadav

TL;DR
This paper introduces CHAI, a reinforcement learning framework that uses AI-generated annotations to enhance multilingual LLMs' performance on code-mixed translation tasks, significantly outperforming existing models.
Contribution
The paper presents a novel framework, CHAI, which improves code-mixed language translation in LLMs by leveraging AI feedback and reinforcement learning, filling a key research gap.
Findings
CHAI-powered LLMs outperform state-of-the-art open-source LLMs by 25.66% in code-mixed translation.
The framework effectively uses AI-generated annotations for reinforcement learning.
Experimental results demonstrate significant improvements across various datasets.
Abstract
Large Language Models (LLMs) have demonstrated remarkable capabilities across various NLP tasks but struggle with code-mixed (or code-switched) language understanding. For example, prior work benchmarking the performance of multilingual LLMs on code-mixed translation tasks has demonstrated that current state-of-the-art multilingual LLMs are ineffective in dealing with code-mixed languages. However, the question of how to improve the capability of multilingual LLMs to handle code-mixed language has not received any attention to date. In this paper, we tackle this research gap by proposing CHAI, a novel general-purpose framework for improving the ability of multilingual LLMs to handle code-mixed languages. CHAI relies on three novel contributions made in this paper. First, we explore the ability of LLMs to provide accurate annotations for code-mixed translation tasks. Second, we leverage…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Text Readability and Simplification
