Steering LLMs toward Korean Local Speech: Iterative Refinement Framework for Faithful Dialect Translation

Keunhyeung Park; Seunguk Yu; Youngbin Kim

arXiv:2511.06680·cs.CL·March 18, 2026

Steering LLMs toward Korean Local Speech: Iterative Refinement Framework for Faithful Dialect Translation

Keunhyeung Park, Seunguk Yu, Youngbin Kim

PDF

Open Access

TL;DR

This paper introduces DIA-REFINE, an iterative framework guiding large language models to produce faithful Korean dialect translations, addressing dialect gaps and evaluation issues with new metrics and feedback loops.

Contribution

The paper proposes DIA-REFINE, a novel iterative refinement framework with new metrics for dialect translation, improving faithfulness and evaluation accuracy in Korean dialect translation tasks.

Findings

01

DIA-REFINE improves dialect fidelity across experiments.

02

New metrics distinguish true dialect translation from false positives.

03

In-context examples enhance dialect translation performance.

Abstract

Standard-to-dialect machine translation remains challenging due to a persistent dialect gap in large language models and evaluation distortions inherent in n-gram metrics, which favor source copying over authentic dialect translation. In this paper, we propose the dialect refinement (DIA-REFINE) framework, which guides LLMs toward faithful target dialect outputs through an iterative loop of translation, verification, and feedback using external dialect classifiers. To address the limitations of n-gram-based metrics, we introduce the dialect fidelity score (DFS) to quantify linguistic shift and the target dialect ratio (TDR) to measure the success of dialect translation. Experiments on Korean dialects across zero-shot and in-context learning baselines demonstrate that DIA-REFINE consistently enhances dialect fidelity. The proposed metrics distinguish between False Success cases, where…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Authorship Attribution and Profiling · Topic Modeling