ASR Error Correction in Low-Resource Burmese with Alignment-Enhanced Transformers using Phonetic Features

Ye Bhone Lin; Thura Aung; Ye Kyaw Thu; Thazin Myint Oo

arXiv:2511.21088·cs.CL·November 27, 2025

ASR Error Correction in Low-Resource Burmese with Alignment-Enhanced Transformers using Phonetic Features

Ye Bhone Lin, Thura Aung, Ye Kyaw Thu, Thazin Myint Oo

PDF

Open Access

TL;DR

This study develops a Transformer-based error correction model for low-resource Burmese ASR, integrating phonetic and alignment features to significantly improve transcription accuracy.

Contribution

It introduces the first ASR error correction approach for Burmese, combining IPA and alignment features within Transformer models for enhanced performance.

Findings

01

WER reduced from 51.56 to 39.82 with AEC

02

chrF++ score improved from 0.5864 to 0.627

03

Consistent gains over baseline ASR outputs

Abstract

This paper investigates sequence-to-sequence Transformer models for automatic speech recognition (ASR) error correction in low-resource Burmese, focusing on different feature integration strategies including IPA and alignment information. To our knowledge, this is the first study addressing ASR error correction specifically for Burmese. We evaluate five ASR backbones and show that our ASR Error Correction (AEC) approaches consistently improve word- and character-level accuracy over baseline outputs. The proposed AEC model, combining IPA and alignment features, reduced the average WER of ASR models from 51.56 to 39.82 before augmentation (and 51.56 to 43.59 after augmentation) and improving chrF++ scores from 0.5864 to 0.627, demonstrating consistent gains over the baseline ASR outputs without AEC. Our results highlight the robustness of AEC and the importance of feature design for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Speech and Audio Processing · Phonetics and Phonology Research