Proposal of protocols for speech materials acquisition and presentation   assisted by tools based on structured test signals

Hideki Kawahara; Ken-Ichi Sakakibara; Mitsunori Mizumachi; Kohei; Yatabe

arXiv:2409.20516·eess.AS·October 1, 2024

Proposal of protocols for speech materials acquisition and presentation assisted by tools based on structured test signals

Hideki Kawahara, Ken-Ichi Sakakibara, Mitsunori Mizumachi, Kohei, Yatabe

PDF

Open Access

TL;DR

This paper introduces protocols and tools based on structured test signals, including a new TSP family, to improve speech material acquisition, presentation, and evaluation for subjective experiments, leveraging modern computational resources.

Contribution

It presents novel protocols and tools utilizing structured test signals and a new TSP family for speech material management and evaluation.

Findings

01

Protocols enable reusable speech materials for future research

02

Tools facilitate compatibility assessment of speech materials

03

Implementation leverages advanced computational resources

Abstract

We propose protocols for acquiring speech materials, making them reusable for future investigations, and presenting them for subjective experiments. We also provide means to evaluate existing speech materials' compatibility with target applications. We built these protocols and tools based on structured test signals and analysis methods, including a new family of the Time-Stretched Pulse (TSP). Over a billion times more powerful computational (including software development) resources than a half-century ago enabled these protocols and tools to be accessible to under-resourced environments.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis