Polite on the Surface, Wrong in Practice: A Curated Dataset for Fixing Honorific Failures in Multilingual Bangla Generation

Md. Asaduzzaman Shuvo; Mahedi Hasan; Md. Tashin Parvez; Azizul Haque Noman; Md. Shafayet Hossain Ovi

arXiv:2605.22487·cs.CL·May 22, 2026

Polite on the Surface, Wrong in Practice: A Curated Dataset for Fixing Honorific Failures in Multilingual Bangla Generation

Md. Asaduzzaman Shuvo, Mahedi Hasan, Md. Tashin Parvez, Azizul Haque Noman, Md. Shafayet Hossain Ovi

PDF

1 Repo 1 Datasets

TL;DR

This paper introduces BLADE, a curated dataset and benchmarking framework for improving honorific and cultural accuracy in Bangla language generation by fine-tuning open-weight models.

Contribution

It presents a novel dataset and evaluation framework for culturally aligned Bangla dialogue generation, enabling systematic fine-tuning of large language models.

Findings

01

Models fine-tuned on BLADE show improved honorific and structural fidelity.

02

Parameter-efficient fine-tuning with LoRA enhances model performance.

03

The dataset provides a rigorous benchmark for low-resource multilingual text generation.

Abstract

Recent advances in Multilingual Large Language Models (MLLMs) have significantly enhanced cross-lingual conversational capabilities, yet modeling culturally nuanced and context-dependent communication remains a critical bottleneck. Specifically, existing state-of-the-art models exhibit a severe pragmatic gap when handling structural variations, regional idioms, and honorific consistencies in low-resource contexts like Bangla. To address this limitation, we introduce a novel, culturally aligned instruction-tuning dataset for \textbf{BangLa Application and DialoguE generation - BLADE} and benchmarking framework comprising $4, 196$ meticulously curated interaction pairs. We leverage this resource to systematically fine-tune and evaluate leading open-weight architectures, including DeepSeek-8B and LLaMA-3.2-3B, utilizing parameter-efficient fine-tuning via LoRA adapters in a 4-bit…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ashuvo25/Bangla_Application_LLM/tree/main
github

Datasets

mdshuvo25/BLADE
dataset· 30 dl
30 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.