Detecting AI-Generated Paraphrases in Bengali: A Comparative Study of Zero-Shot and Fine-Tuned Transformers

Md. Rakibul Islam; Most. Sharmin Sultana Samu; Md. Zahid Hossain; Farhad Uz Zaman; Md. Kamrozzaman Bhuiyan

arXiv:2512.21709·cs.CL·December 29, 2025

Detecting AI-Generated Paraphrases in Bengali: A Comparative Study of Zero-Shot and Fine-Tuned Transformers

Md. Rakibul Islam, Most. Sharmin Sultana Samu, Md. Zahid Hossain, Farhad Uz Zaman, Md. Kamrozzaman Bhuiyan

PDF

Open Access

TL;DR

This study evaluates transformer models for detecting AI-generated Bengali text, showing that fine-tuning significantly enhances accuracy from near chance to over 90%, thus advancing Bengali AI text detection capabilities.

Contribution

It provides the first comprehensive comparison of zero-shot and fine-tuned transformer models for Bengali AI text detection, highlighting the importance of task-specific fine-tuning.

Findings

01

Zero-shot models perform near chance levels (~50% accuracy).

02

Fine-tuning improves performance to around 91% accuracy and F1-score.

03

IndicBERT shows limited effectiveness after fine-tuning.

Abstract

Large language models (LLMs) can produce text that closely resembles human writing. This capability raises concerns about misuse, including disinformation and content manipulation. Detecting AI-generated text is essential to maintain authenticity and prevent malicious applications. Existing research has addressed detection in multiple languages, but the Bengali language remains largely unexplored. Bengali's rich vocabulary and complex structure make distinguishing human-written and AI-generated text particularly challenging. This study investigates five transformer-based models: XLMRoBERTa-Large, mDeBERTaV3-Base, BanglaBERT-Base, IndicBERT-Base and MultilingualBERT-Base. Zero-shot evaluation shows that all models perform near chance levels (around 50% accuracy) and highlight the need for task-specific fine-tuning. Fine-tuning significantly improves performance, with XLM-RoBERTa,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMisinformation and Its Impacts · Topic Modeling · Artificial Intelligence in Healthcare and Education