Proposal of protocols for speech materials acquisition and presentation assisted by tools based on structured test signals
Hideki Kawahara, Ken-Ichi Sakakibara, Mitsunori Mizumachi, Kohei, Yatabe

TL;DR
This paper introduces protocols and tools based on structured test signals, including a new TSP family, to improve speech material acquisition, presentation, and evaluation for subjective experiments, leveraging modern computational resources.
Contribution
It presents novel protocols and tools utilizing structured test signals and a new TSP family for speech material management and evaluation.
Findings
Protocols enable reusable speech materials for future research
Tools facilitate compatibility assessment of speech materials
Implementation leverages advanced computational resources
Abstract
We propose protocols for acquiring speech materials, making them reusable for future investigations, and presenting them for subjective experiments. We also provide means to evaluate existing speech materials' compatibility with target applications. We built these protocols and tools based on structured test signals and analysis methods, including a new family of the Time-Stretched Pulse (TSP). Over a billion times more powerful computational (including software development) resources than a half-century ago enabled these protocols and tools to be accessible to under-resourced environments.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis
