Refining a Deep Learning-based Formant Tracker using Linear Prediction Methods
Paavo Alku, Sudarsana Reddy Kadiri, Dhananjaya Gowda

TL;DR
This paper enhances a deep learning formant tracker by integrating linear prediction methods, improving accuracy and noise resilience without additional training, demonstrated on vocal tract resonance data.
Contribution
It introduces a simple, plug-in refinement method combining data-driven and model-driven formant estimation techniques for improved tracking.
Findings
Refined DeepFormants outperform traditional trackers.
QCP-FB analysis yields the best performance.
Refined trackers are more noise-robust.
Abstract
In this study, formant tracking is investigated by refining the formants tracked by an existing data-driven tracker, DeepFormants, using the formants estimated in a model-driven manner by linear prediction (LP)-based methods. As LP-based formant estimation methods, conventional covariance analysis (LP-COV) and the recently proposed quasi-closed phase forward-backward (QCP-FB) analysis are used. In the proposed refinement approach, the contours of the three lowest formants are first predicted by the data-driven DeepFormants tracker, and the predicted formants are replaced frame-wise with local spectral peaks shown by the model-driven LP-based methods. The refinement procedure can be plugged into the DeepFormants tracker with no need for any new data learning. Two refined DeepFormants trackers were compared with the original DeepFormants and with five known traditional trackers using the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
