Robust Nasality Representation Learning for Cleft Palate-Related Velopharyngeal Dysfunction Screening in Real-World Settings
Weixin Liu, Bowen Qu, Amy Stone, Maria E. Powell, Shama Dufresne, Stephane Braun, Izabela Galdyn, Michael Golinko, Bradley Malin, Zhijun Yin, Matthew E. Pontell

TL;DR
This paper introduces a robust two-stage speech representation learning framework for screening velopharyngeal dysfunction, significantly improving performance in real-world, noisy, and device-diverse settings.
Contribution
It proposes a nasality-focused supervised contrastive pre-training method combined with lightweight classifiers, enhancing robustness over existing speech-based VPD screening models.
Findings
Achieved perfect screening performance on in-domain data.
Outperformed baseline methods on out-of-domain internet recordings.
Demonstrated improved robustness to recording artifacts and domain shifts.
Abstract
Velopharyngeal dysfunction (VPD) is characterized by inadequate velopharyngeal closure during speech and often causes hypernasality and reduced intelligibility. Although speech-based machine learning models can perform well under standardized clinical recording conditions, their performance often drops in real-world settings because of domain shift caused by differences in devices, channels, noise, and room acoustics. To improve robustness, we propose a two-stage framework for VPD screening. First, a nasality-focused speech representation is learned by supervised contrastive pre-training on an auxiliary corpus with phoneme alignments, using oral-context versus nasal-context supervision. Second, the encoder is frozen and used with lightweight classifiers on 0.5-second speech chunks, whose probabilities are aggregated to produce recording-level decisions with a fixed threshold. On an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCleft Lip and Palate Research · Voice and Speech Disorders · Nasal Surgery and Airway Studies
