A review-based study on different Text-to-Speech technologies

Md. Jalal Uddin Chowdhury; Ashab Hussan

arXiv:2312.11563·cs.SD·December 20, 2023·2 cites

A review-based study on different Text-to-Speech technologies

Md. Jalal Uddin Chowdhury, Ashab Hussan

PDF

Open Access

TL;DR

This paper provides a comprehensive review of various Text-to-Speech technologies, comparing their advantages, limitations, and recent advancements like neural and hybrid TTS to guide future research and application development.

Contribution

It offers a detailed comparison of TTS technologies, including traditional and modern neural approaches, highlighting their respective strengths and limitations.

Findings

01

Neural TTS offers more natural voice quality.

02

Concatenative TTS is simpler but less flexible.

03

Hybrid TTS combines advantages of multiple methods.

Abstract

This research paper presents a comprehensive review-based study on various Text-to-Speech (TTS) technologies. TTS technology is an important aspect of human-computer interaction, enabling machines to convert written text into audible speech. The paper examines the different TTS technologies available, including concatenative TTS, formant synthesis TTS, and statistical parametric TTS. The study focuses on comparing the advantages and limitations of these technologies in terms of their naturalness of voice, the level of complexity of the system, and their suitability for different applications. In addition, the paper explores the latest advancements in TTS technology, including neural TTS and hybrid TTS. The findings of this research will provide valuable insights for researchers, developers, and users who want to understand the different TTS technologies and their suitability for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Speech and Audio Processing · Speech and dialogue systems