Text to speech synthesis

Harini s; Manoj G M

arXiv:2401.13891·cs.SE·January 26, 2024·1 cites

Text to speech synthesis

Harini s, Manoj G M

PDF

Open Access

TL;DR

This paper reviews the core technologies, applications, and recent advancements in text-to-speech synthesis, highlighting its role in improving accessibility and naturalness in speech generation.

Contribution

It provides a comprehensive overview of TTS synthesis, discussing technological challenges and recent progress in naturalness, multilingual support, and emotional expression.

Findings

01

Advancements in neural network-based TTS improve speech naturalness.

02

Multilingual TTS systems are increasingly effective.

03

Incorporation of emotional expression enhances user experience.

Abstract

Text-to-speech (TTS) synthesis is a technology that converts written text into spoken words, enabling a natural and accessible means of communication. This abstract explores the key aspects of TTS synthesis, encompassing its underlying technologies, applications, and implications for various sectors. The technology utilizes advanced algorithms and linguistic models to convert textual information into life like speech, allowing for enhanced user experiences in diverse contexts such as accessibility tools, navigation systems, and virtual assistants. The abstract delves into the challenges and advancements in TTS synthesis, including considerations for naturalness, multilingual support, and emotional expression in synthesized speech.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and dialogue systems · Speech Recognition and Synthesis · Natural Language Processing Techniques