Evaluating the Impact of Discriminative and Generative E2E Speech Enhancement Models on Syllable Stress Preservation
Rangavajjala Sankara Bharadwaj, Jhansi Mallela, Sai Harshitha Aluru,, Chiranjeevi Yarra

TL;DR
This study investigates how discriminative and generative speech enhancement models affect the preservation of syllable stress patterns in noisy speech, with implications for improving language learning tools.
Contribution
It provides a comparative analysis of SE models' impact on stress detection and introduces effective feature sets for stress preservation in noisy conditions.
Findings
Generative SE models better preserve stress patterns with heuristic features.
Stress detection remains robust with generative models across various SNRs.
Perceptual study aligns with stress detection results, confirming model effectiveness.
Abstract
Automatic syllable stress detection is a crucial component in Computer-Assisted Language Learning (CALL) systems for language learners. Current stress detection models are typically trained on clean speech, which may not be robust in real-world scenarios where background noise is prevalent. To address this, speech enhancement (SE) models, designed to enhance speech by removing noise, might be employed, but their impact on preserving syllable stress patterns is not well studied. This study examines how different SE models, representing discriminative and generative modeling approaches, affect syllable stress detection under noisy conditions. We assess these models by applying them to speech data with varying signal-to-noise ratios (SNRs) from 0 to 20 dB, and evaluating their effectiveness in maintaining stress patterns. Additionally, we explore different feature sets to determine which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPhonetics and Phonology Research
