Rashid: A Cipher-Based Framework for Exploring In-Context Language Learning

Niyati Bafna; Ryan Soh-Eun Shim; Barbara Plank; David Yarowsky; Hale Sirin

arXiv:2603.22497·cs.CL·March 25, 2026

Rashid: A Cipher-Based Framework for Exploring In-Context Language Learning

Niyati Bafna, Ryan Soh-Eun Shim, Barbara Plank, David Yarowsky, Hale Sirin

PDF

Open Access

TL;DR

The paper introduces Rashid, a framework that uses reversible ciphering of high-resource languages to create truly unseen languages, enabling comprehensive study and evaluation of in-context learning across diverse languages and tasks.

Contribution

Rashid provides a novel method to simulate unseen languages from high-resource ones, facilitating large-scale, resource-efficient exploration of in-context language learning phenomena.

Findings

01

Rashid enables systematic evaluation of ICLL methods across multiple languages.

02

Utilizing resources can improve ICLL performance.

03

The framework reveals insights into current ICLL capabilities and limitations.

Abstract

Where there is growing interest in in-context language learning (ICLL) for unseen languages with large language models, such languages usually suffer from the lack of NLP tools, data resources, and researcher expertise. This means that progress is difficult to assess, the field does not allow for cheap large-scale experimentation, and findings on ICLL are often limited to very few languages and tasks. In light of such limitations, we introduce a framework (Rashid), for studying ICLL wherein we reversibly cipher high-resource languages (HRLs) to construct truly unseen languages with access to a wide range of resources available for HRLs, unlocking previously impossible exploration of ICLL phenomena. We use our framework to assess current methods in the field with SOTA evaluation tools and manual analysis, explore the utility of potentially expensive resources in improving ICLL, and test…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsText Readability and Simplification · Natural Language Processing Techniques · Multimodal Machine Learning Applications