Quality-Aware Translation Tagging in Multilingual RAG system

Hoyeon Moon; Byeolhee Kim; Nikhil Verma

arXiv:2510.23070·cs.CL·October 28, 2025

Quality-Aware Translation Tagging in Multilingual RAG system

Hoyeon Moon, Byeolhee Kim, Nikhil Verma

PDF

TL;DR

This paper introduces QTT-RAG, a method that evaluates and tags translation quality in multilingual retrieval-augmented generation, improving factual accuracy and naturalness in low-resource language QA tasks.

Contribution

QTT-RAG explicitly assesses translation quality along three dimensions and uses this metadata to enhance multilingual response generation without content alteration.

Findings

01

QTT-RAG outperforms baseline models in low-resource language QA benchmarks.

02

The approach preserves factual integrity and translation reliability.

03

Effective across multiple languages and model sizes.

Abstract

Multilingual Retrieval-Augmented Generation (mRAG) often retrieves English documents and translates them into the query language for low-resource settings. However, poor translation quality degrades response generation performance. Existing approaches either assume sufficient translation quality or utilize the rewriting method, which introduces factual distortion and hallucinations. To mitigate these problems, we propose Quality-Aware Translation Tagging in mRAG (QTT-RAG), which explicitly evaluates translation quality along three dimensions-semantic equivalence, grammatical accuracy, and naturalness&fluency-and attach these scores as metadata without altering the original content. We evaluate QTT-RAG against CrossRAG and DKM-RAG as baselines in two open-domain QA benchmarks (XORQA, MKQA) using six instruction-tuned LLMs ranging from 2.4B to 14B parameters, covering two low-resource…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.