Self-imitating Feedback Generation Using GAN for Computer-Assisted Pronunciation Training
Seung Hee Yang, Minhwa Chung

TL;DR
This paper introduces a GAN-based method for generating self-imitating feedback in pronunciation training, transforming non-native speech spectrograms to native-like feedback with improved correction capabilities.
Contribution
It applies GANs with cycle consistency loss to produce more natural and effective pronunciation feedback, surpassing traditional prosodic transplantation techniques.
Findings
GAN successfully transforms non-native spectrograms to native-like feedback.
The method captures segmental corrections not achievable by PSOLA.
Perceptual tests favor the generative approach over baseline methods.
Abstract
Self-imitating feedback is an effective and learner-friendly method for non-native learners in Computer-Assisted Pronunciation Training. Acoustic characteristics in native utterances are extracted and transplanted onto learner's own speech input, and given back to the learner as a corrective feedback. Previous works focused on speech conversion using prosodic transplantation techniques based on PSOLA algorithm. Motivated by the visual differences found in spectrograms of native and non-native speeches, we investigated applying GAN to generate self-imitating feedback by utilizing generator's ability through adversarial training. Because this mapping is highly under-constrained, we also adopt cycle consistency loss to encourage the output to preserve the global structure, which is shared by native and non-native utterances. Trained on 97,200 spectrogram images of short utterances produced…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsCycle Consistency Loss · Convolution · Dogecoin Customer Service Number +1-833-534-1729
