Towards Typologically Aware Rescoring to Mitigate Unfaithfulness in   Lower-Resource Languages

Tsan Tsai Chan; Xin Tong; Thi Thu Uyen Hoang; Barbare Tepnadze,; Wojciech Stempniak

arXiv:2502.17664·cs.CL·March 11, 2025

Towards Typologically Aware Rescoring to Mitigate Unfaithfulness in Lower-Resource Languages

Tsan Tsai Chan, Xin Tong, Thi Thu Uyen Hoang, Barbare Tepnadze,, Wojciech Stempniak

PDF

TL;DR

This paper explores using lightweight auxiliary models to rescore and improve faithfulness in multilingual language models, especially for resource-limited and typologically diverse languages, demonstrating promising results across multiple languages and tasks.

Contribution

It introduces a typologically aware rescoring approach using small BERT models trained from scratch, showing effectiveness in identifying faithful summaries without fine-tuning.

Findings

01

Small BERT models achieve 88.33% accuracy in faithfulness detection.

02

Morphologically complex languages benefit from dropout regularization.

03

Shallow architectures and standard BERT training improve downstream performance.

Abstract

Multilingual large language models (LLMs) are known to more frequently generate non-faithful output in resource-constrained languages (Guerreiro et al., 2023 - arXiv:2303.16104), potentially because these typologically diverse languages are underrepresented in their training data. To mitigate unfaithfulness in such settings, we propose using computationally light auxiliary models to rescore the outputs of larger architectures. As proof of the feasibility of such an approach, we show that monolingual 4-layer BERT models pretrained from scratch on less than 700 MB of data without fine-tuning are able to identify faithful summaries with a mean accuracy of 88.33% in three genetically unrelated languages that differ in their morphological complexity - Vietnamese, Polish and Georgian. The same hyperparameter combination moreover generalises well to three other tasks, suggesting applications…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsAdam · Softmax · Dropout · Weight Decay · Dense Connections · Attention Dropout · Linear Layer · Layer Normalization · Residual Connection · Refunds@Expedia|||How do I get a full refund from Expedia?