Domain Adaptation For Formant Estimation Using Deep Learning
Yehoshua Dissen, Joseph Keshet, Jacob Goldberger, Cynthia Clopper

TL;DR
This paper introduces a domain adaptation method for formant estimation with deep learning, enabling a single model to perform well across diverse speakers and speech styles by training an adaptation layer.
Contribution
It proposes a novel domain adaptation approach that freezes a trained network and adds an adaptation layer for universal formant estimation across varied speech domains.
Findings
Adapted network performs well across multiple datasets.
Method compares favorably with existing formant estimation techniques.
Adaptation layer improves model generalization to different speakers.
Abstract
In this paper we present a domain adaptation technique for formant estimation using a deep network. We first train a deep learning network on a small read speech dataset. We then freeze the parameters of the trained network and use several different datasets to train an adaptation layer that makes the obtained network universal in the sense that it works well for a variety of speakers and speech domains with very different characteristics. We evaluated our adapted network on three datasets, each of which has different speaker characteristics and speech styles. The performance of our method compares favorably with alternative methods for formant estimation.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning
