Exploring Dynamic Parameters for Vietnamese Gender-Independent ASR

Sotheara Leang (CADT; M-PSI); \'Eric Castelli (M-PSI); Dominique Vaufreydaz (M-PSI); Sethserey Sam (CADT)

arXiv:2507.22964·eess.AS·August 1, 2025

Exploring Dynamic Parameters for Vietnamese Gender-Independent ASR

Sotheara Leang (CADT, M-PSI), \'Eric Castelli (M-PSI), Dominique Vaufreydaz (M-PSI), Sethserey Sam (CADT)

PDF

Open Access

TL;DR

This paper introduces dynamic spectral parameters combined with traditional features to improve Vietnamese ASR, achieving lower error rates and greater gender independence by capturing speech's temporal and tonal characteristics.

Contribution

It proposes a novel set of dynamic parameters based on SSCFs and their combination with MFCCs, enhancing gender robustness in Vietnamese ASR.

Findings

01

Significantly reduces word error rates.

02

Exhibits greater gender independence.

03

Improves robustness of tonal information representation.

Abstract

The dynamic characteristics of speech signal provides temporal information and play an important role in enhancing Automatic Speech Recognition (ASR). In this work, we characterized the acoustic transitions in a ratio plane of Spectral Subband Centroid Frequencies (SSCFs) using polar parameters to capture the dynamic characteristics of the speech and minimize spectral variation. These dynamic parameters were combined with Mel-Frequency Cepstral Coefficients (MFCCs) in Vietnamese ASR to capture more detailed spectral information. The SSCF0 was used as a pseudo-feature for the fundamental frequency (F0) to describe the tonal information robustly. The findings showed that the proposed parameters significantly reduce word error rates and exhibit greater gender independence than the baseline MFCCs.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIndustrial Vision Systems and Defect Detection · Fault Detection and Control Systems