Towards Explainable Khmer Polarity Classification

Marry Kong; Rina Buoy; Sovisal Chenda; Nguonly Taing

arXiv:2511.09313·cs.CL·November 13, 2025

Towards Explainable Khmer Polarity Classification

Marry Kong, Rina Buoy, Sovisal Chenda, Nguonly Taing

PDF

Open Access

TL;DR

This paper introduces an explainable Khmer polarity classifier using a fine-tuned Qwen-3 model that predicts sentiment labels and provides rationales, supported by a new Khmer polarity dataset.

Contribution

It presents a novel explainable sentiment classification approach for Khmer using instruction-based reasoning and releases a new dataset and models.

Findings

01

The model accurately predicts polarity labels.

02

It provides self-explanations supporting its predictions.

03

The dataset is publicly available for further research.

Abstract

Khmer polarity classification is a fundamental natural language processing task that assigns a positive, negative, or neutral label to a given Khmer text input. Existing Khmer models typically predict the label without explaining the rationale behind the prediction. This paper proposes an explainable Khmer polarity classifier by fine-tuning an instruction-based reasoning Qwen-3 model. The notion of explainability in this paper is limited to self-explanations, which the model uses to rationalize its predictions. Experimental results show that the fine-tuned model not only predicts labels accurately but also provides reasoning by identifying polarity-related keywords or phrases to support its predictions. In addition, we contribute a new Khmer polarity dataset consisting of short- to medium-length casual, romanized, and mixed-code Khmer expressions. This dataset was constructed using both…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Computational and Text Analysis Methods