Evince the artifacts of Spoof Speech by blending Vocal Tract and Voice Source Features
Tadipatri Uday Kiran Reddy, Sahukari Chaitanya Varun, Kota Pranav, Kumar Sankala Sreekanth, Kodukula Sri Rama Murty

TL;DR
This paper introduces a system that detects spoof speech and identifies the generating algorithm by analyzing Vocal Tract and Voice Source features, achieving high accuracy and providing insights into speech artifacts.
Contribution
It proposes a novel approach combining VTS and VS features for spoof detection and algorithm classification, enhancing robustness through model fusion.
Findings
VS features focus on phoneme transitions
VTS features emphasize stationary speech segments
Fusion improves overall classification robustness
Abstract
With the rapid advancement in synthetic speech generation technologies, great interest in differentiating spoof speech from the natural speech is emerging in the research community. The identification of these synthetic signals is a difficult task not only for the cutting-edge classification models but also for humans themselves. To prevent potential adverse effects, it becomes crucial to detect spoof signals. From a forensics perspective, it is also important to predict the algorithm which generated them to identify the forger. This needs an understanding of the underlying attributes of spoof signals which serve as a signature for the synthesizer. This study emphasizes the segments of speech signals critical in identifying their authenticity by utilizing the Vocal Tract System(\textit{VTS}) and Voice Source(\textit{VS}) features. In this paper, we propose a system that detects spoof…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Speech and Audio Processing · Phonetics and Phonology Research
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Sigmoid Activation · Softmax · Tanh Activation · WaveRNN
