DialUp! Modeling the Language Continuum by Adapting Models to Dialects and Dialects to Models

Niyati Bafna; Emily Chang; Nathaniel R. Robinson; David R. Mortensen; Kenton Murray; David Yarowsky; Hale Sirin

arXiv:2501.16581·cs.CL·October 22, 2025

DialUp! Modeling the Language Continuum by Adapting Models to Dialects and Dialects to Models

Niyati Bafna, Emily Chang, Nathaniel R. Robinson, David R. Mortensen, Kenton Murray, David Yarowsky, Hale Sirin

PDF

Open Access 1 Video

TL;DR

This paper introduces DialUp, a dual approach to improve machine translation for low-resource dialects by adapting models during training and inference, leveraging linguistic regularities and synthetic data.

Contribution

The paper presents DialUp, a novel method combining training-time and inference-time adaptations to enhance MT robustness to dialectal variation.

Findings

01

Significant performance improvements across multiple dialects and language families.

02

Synthetic data exposure enhances model robustness to unseen dialects.

03

Low baseline MT performance varieties benefit most from these methods.

Abstract

Most of the world's languages and dialects are low-resource, and lack support in mainstream machine translation (MT) models. However, many of them have a closely-related high-resource language (HRL) neighbor, and differ in linguistically regular ways from it. This underscores the importance of model robustness to dialectal variation and cross-lingual generalization to the HRL dialect continuum. We present DialUp, consisting of a training-time technique for adapting a pretrained model to dialectal data (M->D), and an inference-time intervention adapting dialectal data to the model expertise (D->M). M->D induces model robustness to potentially unseen and unknown dialects by exposure to synthetic data exemplifying linguistic mechanisms of dialectal variation, whereas D->M treats dialectal divergence for known target dialects. These methods show considerable performance gains for several…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

DialUp! Modeling the Language Continuum by Adapting Models to Dialects and Dialects to Models· underline

Taxonomy

TopicsNatural Language Processing Techniques