Steering LLMs toward Korean Local Speech: Iterative Refinement Framework for Faithful Dialect Translation
Keunhyeung Park, Seunguk Yu, Youngbin Kim

TL;DR
This paper introduces DIA-REFINE, an iterative framework guiding large language models to produce faithful Korean dialect translations, addressing dialect gaps and evaluation issues with new metrics and feedback loops.
Contribution
The paper proposes DIA-REFINE, a novel iterative refinement framework with new metrics for dialect translation, improving faithfulness and evaluation accuracy in Korean dialect translation tasks.
Findings
DIA-REFINE improves dialect fidelity across experiments.
New metrics distinguish true dialect translation from false positives.
In-context examples enhance dialect translation performance.
Abstract
Standard-to-dialect machine translation remains challenging due to a persistent dialect gap in large language models and evaluation distortions inherent in n-gram metrics, which favor source copying over authentic dialect translation. In this paper, we propose the dialect refinement (DIA-REFINE) framework, which guides LLMs toward faithful target dialect outputs through an iterative loop of translation, verification, and feedback using external dialect classifiers. To address the limitations of n-gram-based metrics, we introduce the dialect fidelity score (DFS) to quantify linguistic shift and the target dialect ratio (TDR) to measure the success of dialect translation. Experiments on Korean dialects across zero-shot and in-context learning baselines demonstrate that DIA-REFINE consistently enhances dialect fidelity. The proposed metrics distinguish between False Success cases, where…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Authorship Attribution and Profiling · Topic Modeling
