BanglaLorica: Design and Evaluation of a Robust Watermarking Algorithm for Large Language Models in Bangla Text Generation

Amit Bin Tariqul; A N M Zahid Hossain Milkan; Sahab-Al-Chowdhury; Syed Rifat Raiyan; Hasan Mahmud; Md Kamrul Hasan

arXiv:2601.04534·cs.CL·January 9, 2026

BanglaLorica: Design and Evaluation of a Robust Watermarking Algorithm for Large Language Models in Bangla Text Generation

Amit Bin Tariqul, A N M Zahid Hossain Milkan, Sahab-Al-Chowdhury, Syed Rifat Raiyan, Hasan Mahmud, Md Kamrul Hasan

PDF

Open Access

TL;DR

This paper evaluates and improves watermarking techniques for Bangla LLM-generated text, addressing robustness issues under translation attacks by proposing a layered watermarking strategy that enhances detection accuracy in low-resource language settings.

Contribution

It systematically assesses existing watermarking methods for Bangla, identifies their vulnerabilities under translation attacks, and introduces a layered watermarking approach that significantly improves robustness without retraining.

Findings

01

KGW and EXP achieve >88% detection accuracy under benign conditions.

02

RTT attacks reduce detection accuracy to 9-13%.

03

Layered watermarking improves post-RTT detection accuracy by 25-35%.

Abstract

As large language models (LLMs) are increasingly deployed for text generation, watermarking has become essential for authorship attribution, intellectual property protection, and misuse detection. While existing watermarking methods perform well in high-resource languages, their robustness in low-resource languages remains underexplored. This work presents the first systematic evaluation of state-of-the-art text watermarking methods: KGW, Exponential Sampling (EXP), and Waterfall, for Bangla LLM text generation under cross-lingual round-trip translation (RTT) attacks. Under benign conditions, KGW and EXP achieve high detection accuracy (>88%) with negligible perplexity and ROUGE degradation. However, RTT causes detection accuracy to collapse below RTT causes detection accuracy to collapse to 9-13%, indicating a fundamental failure of token-level watermarking. To address this, we propose…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Hate Speech and Cyberbullying Detection · Advanced Malware Detection Techniques